GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

data-quality-checks

Website
Wikipedia
open-metadata/OpenMetadata
https://static.github-zh.com/github_avatars/open-metadata?size=40
open-metadata / OpenMetadata

OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team colla...

metadatadatadiscovery数据科学dataqualitydata-profilingmetadata-managementdataengineeringdata-catalogdata-observabilitydbtdata-discoverydata-contractsdata-governancedata-lineagedata-validationsnowflakedata-qualitydata-quality-checksdata-collaboration
TypeScript 6.9 k1
7 小时前
sodadata/soda-core
https://static.github-zh.com/github_avatars/sodadata?size=40
sodadata / soda-core

⚡ Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.io

Pythondata-engineeringdata-governancedata-monitoringdata-observabilitydata-profilingdata-qualitydata-quality-checksdata-quality-monitoringdata-reliabilitydata-testingdata-validationdataqualitydbtpipeline-testingsnowflakedata-contracts
Python 2.11 k
4 天前
https://static.github-zh.com/github_avatars/re-data?size=40
re-data / re-data

re_data - fix data issues before your users & CEO would discover them 😊

data-monitoring数据分析data-qualitydata-quality-monitoringopen-source-toolingdata-observabilitydataqualitydata-testingdata-quality-checksdbtdata-reliability
HTML 1.56 k
1 年前
https://static.github-zh.com/github_avatars/datavane?size=40
datavane / datavines

Know your data better!Datavines is Next-gen Data Observability Platform, support metadata manage and data quality.

dataqualitydatasciencedorisApache Sparkmetadatacleandatadata-engineeringdata-profilingdata-qualitydata-quality-checksdata-quality-monitoring数据科学flink
Java 624
1 个月前
https://static.github-zh.com/github_avatars/polyaxon?size=40
polyaxon / traceml

Engine for ML/Data tracking, visualization, explainability, drift detection, and dashboards for Polyaxon.

pandasdataframes数据科学Apache Sparkdaskplotly统计matplotlibdata-profiling数据可视化data-explorationDataOpsmlopsdata-qualitydata-quality-checksexplainable-aiPyTorchTensorflowtracking
Python 517
2 个月前
https://static.github-zh.com/github_avatars/databrickslabs?size=40
databrickslabs / dqx

Databricks framework to validate Data Quality of pySpark DataFrames

data-profilingdata-qualitydata-quality-checksdata-quality-monitoringdatabricksApache Sparkspark-streamingdlt
Python 275
9 天前
https://static.github-zh.com/github_avatars/ubisoft?size=40
ubisoft / mobydq

🐳 Tool to automate data quality checks on data pipelines

data-qualitydata-pipelinedata-warehousebig-datadata-quality-checksdata-quality-monitoring
Vue 255
3 年前
https://static.github-zh.com/github_avatars/canimus?size=40
canimus / cuallee

Possibly the fastest DataFrame-agnostic quality check library in town.

bigdataperformance-metricspysparkPythonUnit testingpandasdataqualitydata-qualitydata-quality-checks
Python 192
6 天前
https://static.github-zh.com/github_avatars/Hyhyhyhyhyhyh?size=40
Hyhyhyhyhyhyh / Django-Data-quality-system

数据治理、数据质量检核/监控平台(Django+jQuery+MySQL)

data-qualitydata-quality-checksdata-quality-monitoring
Python 186
3 年前
https://static.github-zh.com/github_avatars/AKSW?size=40
AKSW / RDFUnit

An RDF Unit Testing Suite

RDF (Resource Description Framework)data-qualitydata-quality-checksschemaschema-validationvalidationUnit testingdata-validation
Java 159
2 年前
https://static.github-zh.com/github_avatars/dqops?size=40
dqops / dqo

Data Quality and Observability platform for the whole data lifecycle, from profiling new data sources to full automation with Data Observability. Configure data quality checks from the UI or in YAML f...

DataOpsdata-qualitydata-quality-checksdata-quality-measurementdata-quality-monitoring监控data-observabilitydata-profiling
Java 151
14 天前
https://static.github-zh.com/github_avatars/Seddryck?size=40
Seddryck / NBi

NBi is a testing framework (add-on to NUnit) for Business Intelligence and Data Access. The main goal of this framework is to let users create tests with a declarative approach based on an Xml syntax....

nunit数据库cubeetlbusiness-intelligenceTest automationtest-frameworkdata-qualitydata-quality-checks
C# 107
2 个月前
https://static.github-zh.com/github_avatars/evidentlyai?size=40
evidentlyai / ml_observability_course

Free Open-source ML observability course for data scientists and ML engineers. Learn how to monitor and debug your ML models in production.

data-driftdata-qualitydata-quality-checksllmopsmachine-learning-operationsml-monitoringml-observabilityml-pipelinesmlopsmodel-monitoringproduction-machine-learning
Jupyter Notebook 86
1 年前
https://static.github-zh.com/github_avatars/Swiple?size=40
Swiple / swiple

Swiple enables you to easily observe, understand, validate and improve the quality of your data

datadata-observabilitydata-qualityobservabilityPythonvalidationFastAPIdata-profiling数据科学data-analyticsdata-quality-checksdata-quality-monitoringdata-reliability
Python 84
5 天前
https://static.github-zh.com/github_avatars/PovertyAction?size=40
PovertyAction / high-frequency-checks

A Stata template for running high frequency checks of incoming research data at Innovations for Poverty Action

researchresearch-tooldata-quality-checks
Stata 83
4 个月前
https://static.github-zh.com/github_avatars/josephmachado?size=40
josephmachado / python_essentials_for_data_engineers

Code for blog at https://www.startdataengineering.com/post/python-for-de/

data-engineeringdata-quality-checksduckdbpolarsPythontransformations
Python 77
1 年前
https://static.github-zh.com/github_avatars/socialpoint-labs?size=40
socialpoint-labs / sqlbucket

Lightweight library to write, orchestrate and test your SQL ETL. Writing ETL with data integrity in mind.

SQLetletl-frameworkdata-quality-checksdata-quality
Python 74
1 年前
https://static.github-zh.com/github_avatars/google?size=40
google / data-quality-monitor

Data Quality Monitor (DQM) - Continuously validate your data with easy, customizable rules.

BigQuerycloudstoragedata-quality-checksGoogle 云PythonTerraform
TypeScript 37
1 年前
https://static.github-zh.com/github_avatars/mfcabrera?size=40
mfcabrera / hooqu

hooqu is a library built on top of Pandas-like Dataframes for defining "unit tests for data". This is a spiritual port of Apache Deequ to Python

数据科学data-qualitydata-quality-checks
Python 29
6 个月前
https://static.github-zh.com/github_avatars/PEDSnet?size=40
PEDSnet / Data-Quality-Analysis

The PEDSnet Data Quality Assessment Toolkit (OMOP CDM)

data-qualitydata-quality-checks
R 24
4 年前
loading...