#

DataOps

DataOps is an automated, process-oriented methodology, used by analytic and data teams, to improve the quality and reduce the cycle time of data analytics. While DataOps began as a set of best practices, it has now matured to become a new and independent approach to data analytics. DataOps applies to the entire data lifecycle from data preparation to reporting, and recognizes the interconnected nature of the data analytics team and information technology operations.

https://static.github-zh.com/github_avatars/PrefectHQ?size=40

Prefect 是一个现代化工作流编排工具,使开发人员能够构建、观察数据管道并对其做出反应

Python 20.36 k
3 小时前
https://static.github-zh.com/github_avatars/lancedb?size=40

#计算机科学#Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckD...

Rust 5.38 k
6 小时前
redpanda-data/console
https://static.github-zh.com/github_avatars/redpanda-data?size=40

Redpanda Console is a developer-friendly UI for managing your Kafka/Redpanda workloads. Console gives you a simple, interactive approach for gaining visibility into your topics, masking data, managing...

TypeScript 4.12 k
6 小时前
https://static.github-zh.com/github_avatars/whylabs?size=40

#计算机科学#An open-source data logging library for machine learning models and data pipelines. 📚 Provides visibility into data quality & model performance over time. 🛡️ Supports privacy-preserving data collect...

Jupyter Notebook 2.75 k
8 个月前
https://static.github-zh.com/github_avatars/TobikoData?size=40

Scalable and efficient data transformation framework - backwards compatible with dbt.

Python 2.61 k
6 小时前
https://static.github-zh.com/github_avatars/meltano?size=40

Meltano: the declarative code-first data integration engine that powers your wildest data and ML-powered product ideas. Say goodbye to writing, maintaining, and scaling your own API integrations.

Python 2.2 k
3 天前
elementary-data/elementary
https://static.github-zh.com/github_avatars/elementary-data?size=40

The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.

HTML 2.15 k
2 天前
https://static.github-zh.com/github_avatars/lensesio?size=40

Kafka Docker for development. Kafka, Zookeeper, Schema Registry, Kafka-Connect, , 20+ connectors

Shell 2.06 k
24 天前
https://static.github-zh.com/github_avatars/MarquezProject?size=40
Java 2.02 k
9 天前
https://static.github-zh.com/github_avatars/alibaba?size=40
Java 1.91 k
1 年前
datavane/tis
https://static.github-zh.com/github_avatars/datavane?size=40

Support agile DataOps Based on Flink, DataX and Flink-CDC, Chunjun with Web-UI

Java 1.19 k
3 天前
https://static.github-zh.com/github_avatars/raystack?size=40

Optimus is an easy-to-use, reliable, and performant workflow orchestrator for data transformation, data modeling, pipelines, and data quality management.

Go 752
1 年前
loading...
Website
Wikipedia
维基百科

相关主题

Open Data