GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

DataOps

DataOps is an automated, process-oriented methodology, used by analytic and data teams, to improve the quality and reduce the cycle time of data analytics. While DataOps began as a set of best practices, it has now matured to become a new and independent approach to data analytics. DataOps applies to the entire data lifecycle from data preparation to reporting, and recognizes the interconnected nature of the data analytics team and information technology operations.

Website
Wikipedia
维基百科

相关主题

Open Data
https://static.github-zh.com/github_avatars/PrefectHQ?size=40
PrefectHQ / prefect

Prefect 是一个现代化工作流编排工具,使开发人员能够构建、观察数据管道并对其做出反应

Pythonworkflowdata-engineering数据科学workflow-engineprefectinfrastructureml-opsDataOps自动化orchestrationdataobservabilitypipeline
Python 19.52 k
1 天前
Avaiga/taipy
https://static.github-zh.com/github_avatars/Avaiga?size=40
Avaiga / taipy

Turns Data and AI algorithms into production-ready web applications in no time.

自动化data-engineeringDataOps数据可视化datasciencedeveloper-toolsmlopsorchestrationpipelinepipelinesPythontaipy-guiworkflowtaipy-coreHacktoberfesthacktoberfest2023data-integrationjob-schedulerscenarioscenario-analysis
Python 18.12 k
2 天前
https://static.github-zh.com/github_avatars/cleanlab?size=40
cleanlab / cleanlab

#数据仓库#The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.

weak-supervisiondata-cleaningdata-quality数据科学noisy-labelsdata-centric-aiout-of-distribution-detectionoutlier-detectionactive-learningdata-labelingdata-profilingdata-validationlabelingdata-curationannotationDataOpsdataquality大语言模型数据集exploratory-data-analysis
Python 10.61 k
12 天前
redpanda-data/connect
https://static.github-zh.com/github_avatars/redpanda-data?size=40
redpanda-data / connect

Fancy stream processing made operationally mundane

message-queuestream-processingstreaming-datamessage-buslogsstream-processorcqrsevent-sourcingGokafkaamqprabbitmqnatsetldata-engineeringDataOps
Go 8.38 k
2 天前
flyteorg/flyte
https://static.github-zh.com/github_avatars/flyteorg?size=40
flyteorg / flyte

#大语言模型#Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.

flyte机器学习Goscaleworkflow数据科学数据分析datakubernetes-operatorKubernetesorchestration-enginemlopsDataOpsgRPCPythonproductiondeclarativefine-tuning大语言模型Hacktoberfest
Go 6.29 k
6 天前
https://static.github-zh.com/github_avatars/lancedb?size=40
lancedb / lance

#计算机科学#Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckD...

机器学习机器视觉data-format深度学习Pythonapache-arrowduckdbmlops数据分析data-analytics数据科学DataOpsdata-centricembeddingsRust大语言模型
Rust 4.77 k
2 天前
redpanda-data/console
https://static.github-zh.com/github_avatars/redpanda-data?size=40
redpanda-data / console

Redpanda Console is a developer-friendly UI for managing your Kafka/Redpanda workloads. Console gives you a simple, interactive approach for gaining visibility into your topics, masking data, managing...

apache-kafkaDataOpsReactTypeScriptkafka-uikafka-guiweb-uiGokafka
TypeScript 4.04 k
3 天前
https://static.github-zh.com/github_avatars/Netflix?size=40
Netflix / maestro

#计算机科学#Maestro: Netflix’s Workflow Orchestrator

analytics自动化batch-processingdagdata-engineeringDataOpsdata-orchestratordata-pipelines数据科学eltetlJava机器学习mlopsorchestrationschedulerworkflowworkflow-engineworkflow-orchestration
Java 3.48 k
3 天前
https://static.github-zh.com/github_avatars/whylabs?size=40
whylabs / whylogs

#计算机科学#An open-source data logging library for machine learning models and data pipelines. 📚 Provides visibility into data quality & model performance over time. 🛡️ Supports privacy-preserving data collect...

ai-pipelinesapproximate-statisticsstatistical-propertiesdata-qualitycalculate-statisticsPythonLoggingmlopsDataOpsml-pipelinesdata-pipelinedataset机器学习数据科学analyticsconstraints
Jupyter Notebook 2.72 k
5 个月前
https://static.github-zh.com/github_avatars/TobikoData?size=40
TobikoData / sqlmesh

Scalable and efficient data transformation framework - backwards compatible with dbt.

DataOpseltetlSQLPythondataengineeringtransformationdbt
Python 2.4 k
1 天前
https://static.github-zh.com/github_avatars/meltano?size=40
meltano / meltano

Meltano: the declarative code-first data integration engine that powers your wildest data and ML-powered product ideas. Say goodbye to writing, maintaining, and scaling your own API integrations.

DataOpseltOpen Sourcedatapipelinesextract-dataconnectorsintegrationtaploadersdata-pipelinesdata-engineering
Python 2.1 k
5 天前
elementary-data/elementary
https://static.github-zh.com/github_avatars/elementary-data?size=40
elementary-data / elementary

The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.

data-lineagedata-governancedata-warehousesnowflakeBigQuery数据分析data-pipelinesdata-pipelinelineagedata-reliabilitydata-observabilityDataOpsdbtredshift
HTML 2.08 k
3 天前
https://static.github-zh.com/github_avatars/lensesio?size=40
lensesio / fast-data-dev

Kafka Docker for development. Kafka, Zookeeper, Schema Registry, Kafka-Connect, , 20+ connectors

kafka-rest-proxyschema-registrykafkaDockerDataOps
Shell 2.05 k
2 个月前
https://static.github-zh.com/github_avatars/MarquezProject?size=40
MarquezProject / marquez

Collect, aggregate, and visualize a data ecosystem's metadata

data-lineagedata-discoverydata-governancedata-dictionarymetadataDataOps
Java 1.93 k
4 天前
https://static.github-zh.com/github_avatars/alibaba?size=40
alibaba / SREWorks

Cloud Native DataOps & AIOps Platform | 云原生数智运维平台

KubernetesSREapplicationSoftware as a servicecloudnativeDataOpsaiopsoamengineeringmaintenanceopsDevOpsflink
Java 1.89 k
1 年前
datavane/tis
https://static.github-zh.com/github_avatars/datavane?size=40
datavane / tis

Support agile DataOps Based on Flink, DataX and Flink-CDC, Chunjun with Web-UI

JavadataxetlflinkcdcDataOps
Java 1.16 k
8 天前
https://static.github-zh.com/github_avatars/automaticmode?size=40
automaticmode / active_workflow

Polyglot workflows without leaving the comfort of your technology stack.

orchestration-frameworkschedulingworkflowevent-driveniftttschedulerservices-platformdata-engineeringDataOpsagents自托管
Ruby 860
2 年前
https://static.github-zh.com/github_avatars/opendatadiscovery?size=40
opendatadiscovery / awesome-data-catalogs

#数据仓库#📙 Awesome Data Catalogs and Observability Platforms.

data-catalogdata-discoverymetadataDataOpsAwesome Listsobservabilitydata-engineeringdata-qualitybig-dataOpen Source机器学习Open Datadatadiscoverymetadata-management
858
2 个月前
https://static.github-zh.com/github_avatars/raystack?size=40
raystack / optimus

Optimus is an easy-to-use, reliable, and performant workflow orchestrator for data transformation, data modeling, pipelines, and data quality management.

airflowetlworkflows自动化GoBigQuerydata-warehouseanalyticsdata-modellinganalytics-engineeringdata-transformationdata-pipelineseltbusiness-intelligenceDataOps
Go 749
1 年前
https://static.github-zh.com/github_avatars/tenzir?size=40
tenzir / tenzir

Tenzir is the data pipeline engine for security teams.

incident-responsethreathuntingsiemsoc安全DataOpsinvestigationpcapnetflowsuricatazeekpipelinessigmaHacktoberfest
C++ 681
1 天前
loading...