GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

great-expectations

Website
Wikipedia
https://static.github-zh.com/github_avatars/iusztinpaul?size=40
iusztinpaul / energy-forecasting

🌀 𝗧𝗵𝗲 𝗙𝘂𝗹𝗹 𝗦𝘁𝗮𝗰𝗸 𝟳-𝗦𝘁𝗲𝗽𝘀 𝗠𝗟𝗢𝗽𝘀 𝗙𝗿𝗮𝗺𝗲𝘄𝗼𝗿𝗸 | 𝗟𝗲𝗮𝗿𝗻 𝗠𝗟𝗘 & 𝗠𝗟𝗢𝗽𝘀 for free by designing, building and deploying an end-to-end ML batch system ~ 𝘴𝘰𝘶𝘳𝘤𝘦 𝘤...

airflowbatch-processingfeature-storeActionsmlopsPythonCI/CDDockerFastAPIGoogle 云great-expectationsml-monitoringpoetrysktimeStreamlitweights-and-biasesdata-versioning
Python 923
1 年前
https://static.github-zh.com/github_avatars/adidas?size=40
adidas / lakehouse-engine

The Lakehouse Engine is a configuration driven Spark framework, written in Python, serving as a scalable and distributed engine for several lakehouse algorithms, data flows and utilities for Data Prod...

big-dataconfiguration-drivendata-engineeringdata-qualitydatabricksdelta-lake框架great-expectationslakehouseApache Spark
Python 252
4 个月前
https://static.github-zh.com/github_avatars/josephmachado?size=40
josephmachado / data_engineering_best_practices

Sample project to demonstrate data engineering best practices

data-engineeringdelta-lakeetlgreat-expectationsminiopysparkApache Spark
Python 193
1 年前
https://static.github-zh.com/github_avatars/trannhatnguyen2?size=40
trannhatnguyen2 / NYC_Taxi_Data_Pipeline

Nyc_Taxi_Data_Pipeline - DE Project

airflowdbtdebeziumDockergreat-expectationskafkaminioPostgreSQLApache Sparktrino
Python 108
8 个月前
https://static.github-zh.com/github_avatars/GokuMohandas?size=40
GokuMohandas / testing-ml

#计算机科学#Learn how to create reliable ML systems by testing code, data and models.

great-expectations机器学习mlopspytestTesting
Jupyter Notebook 87
3 年前
https://static.github-zh.com/github_avatars/provectus?size=40
provectus / data-quality-gate

Data Quality Gate based on AWS

Amazon Web Servicesdata-qualitygreat-expectationsredshifts3athenaaws-lambdadata-governanceTerraform
Python 56
1 年前
https://static.github-zh.com/github_avatars/hoangsonww?size=40
hoangsonww / End-to-End-Data-Pipeline

📈 A scalable, production-ready data pipeline for real-time streaming & batch processing, integrating Kafka, Spark, Airflow, AWS, Kubernetes, and MLflow. Supports end-to-end data ingestion, transforma...

airflowapacheDockerelasticsearchflinkGrafanagreat-expectationshadoopinfluxdbkafkaKubernetesminiomlflowPostgreSQLprometheusPythonApache SparkSQLTerraform
Python 40
21 天前
https://static.github-zh.com/github_avatars/NatanMish?size=40
NatanMish / data_validation

#计算机科学#Tutorial for implementing data validation in data science pipelines

数据科学data-validationgreat-expectations机器学习pydanticPython
Jupyter Notebook 33
3 年前
https://static.github-zh.com/github_avatars/MDS-BD?size=40
MDS-BD / hands-on-great-expectations-with-spark

How to evaluate the Quality of your Data with Great Expectations and Spark.

data-qualitygreat-expectationsApache Spark
Jupyter Notebook 31
2 年前
https://static.github-zh.com/github_avatars/luatnc87?size=40
luatnc87 / modern-data-warehouse-modeling-and-data-quality-with-dbt-openmetadata

This repository serves as a comprehensive guide to effective data modeling and robust data quality assurance using popular open-source tools

airflowdbtgreat-expectationsopenmetadataSlackdata-modelingduckdbdata-quality
Python 30
2 年前
https://static.github-zh.com/github_avatars/PrefectHQ?size=40
PrefectHQ / prefect-great-expectations

Prefect integrations for interacting with Great Expectations

prefectexpectationsgreat-expectations
Python 28
10 个月前
https://static.github-zh.com/github_avatars/dain55788?size=40
dain55788 / ELT-Data-Pipeline

ELT Data Pipeline implementation in Data Warehousing environment

apache-airflowdata-engineeringdbtgreat-expectationsPostgreSQLpowerbiApache Sparkminiotrino
Jupyter Notebook 26
1 个月前
https://static.github-zh.com/github_avatars/moritzkoerber?size=40
moritzkoerber / covid-19-data-engineering-pipeline

A Covid-19 data pipeline on AWS featuring PySpark/Glue, Docker, Great Expectations, Airflow, and Redshift, templated in CloudFormation and CDK, deployable via Github Actions.

Amazon Web Servicesaws-lambdaaws-s3Dockergreat-expectationspysparkApache SparkAPIaws-cdkaws-cloudformationapache-airflow
Python 23
2 年前
https://static.github-zh.com/github_avatars/ismaildawoodjee?size=40
ismaildawoodjee / GreatEx

A project for exploring how Great Expectations can be used to ensure data quality and validate batches within a data pipeline defined in Airflow.

Dockerairflowgreat-expectationsPythondata-qualityecommerceCSVSQLPostgreSQLparqueteltdata-pipelinepipelinedata-engineeringetldata-profiling
Python 21
3 年前
https://static.github-zh.com/github_avatars/BirdiD?size=40
BirdiD / BirdiDQ

BirdiDQ leverages the power of the Python Great Expectations open-source library and combines it with the simplicity of natural language queries to effortlessly identify and report data quality issues...

人工智能dataqualitygreat-expectationslarge-language-models
Jupyter Notebook 20
2 年前
https://static.github-zh.com/github_avatars/josephmachado?size=40
josephmachado / data_engineering_best_practices_log

Code to demonstrate data engineering metadata & logging best practices

Grafanagreat-expectationsLoggingminioPostgreSQLprometheusApache Sparkmetadata
Python 16
1 年前
https://static.github-zh.com/github_avatars/grillazz?size=40
grillazz / fastapi-greatexpectations

Run greatexpectations.io on ANY SQL Engine using REST API. Supported by FastAPI, Pydantic and SQLAlchemy as best data quality tool

FastAPIgreat-expectationsPythonsqlalchemypydanticdataqualitySQL
Python 12
10 天前
https://static.github-zh.com/github_avatars/luchonaveiro?size=40
luchonaveiro / open-source-data-stack

Integrating Apache Airflow, dbt, Great Expectations and Apache Superset to develop a modern open source data stack.

airflowdbtetlgreat-expectationsPostgreSQLsupersetDocker
HTML 12
3 年前
https://static.github-zh.com/github_avatars/serialbandicoot?size=40
serialbandicoot / great-assertions

This library is inspired by the Great Expectations library. The library has made the various expectations found in Great Expectations available when using the inbuilt python unittest assertions.

Testinggreat-expectationsPythondatabricks数据科学Jupyter Notebookquality-assurancedata-testing
Python 10
3 年前
https://static.github-zh.com/github_avatars/datarootsio?size=40
datarootsio / notion-dbs-data-quality

Using Great Expectations and Notion's API, this repo aims to provide data quality for our databases in Notion.

data-qualityNotionnotion-apigreat-expectationsdata-engineering-pipeline
Python 9
4 年前
loading...