GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

data-engineering

Website
Wikipedia
apache/superset
https://static.github-zh.com/github_avatars/apache?size=40
apache / superset

Apache Superset 是一个企业级数据可视化和数据分析的平台。

supersetapacheapache-superset数据可视化data-vizanalyticsbusiness-intelligence数据科学data-engineeringasfbibusiness-analyticsdata-analytics数据分析PythonReactsql-editorFlask
Jupyter Notebook 66.65 k
1 天前
apache/airflow
https://static.github-zh.com/github_avatars/apache?size=40
apache / airflow

#计算机科学#Apache Airflow 是一个workflow工作流调度、编排、监控平台

airflowapacheapache-airflowPythonschedulerworkflow自动化dagdata-engineeringdata-integrationdata-orchestratordata-pipelines数据科学eltetl机器学习mlopsorchestrationworkflow-engineworkflow-orchestration
Python 40.56 k
3 小时前
GokuMohandas/Made-With-ML
https://static.github-zh.com/github_avatars/GokuMohandas?size=40
GokuMohandas / Made-With-ML

#自然语言处理#学习如何设计、开发、部署、和迭代生产级机器学习应用

机器学习深度学习PyTorch自然语言处理数据科学Pythonmlopsdata-engineeringdata-quality大语言模型raydistributed-training
Jupyter Notebook 38.91 k
10 个月前
https://static.github-zh.com/github_avatars/DataTalksClub?size=40
DataTalksClub / data-engineering-zoomcamp

免费数据工程师视频课程,共9周课时

data-engineeringkafkaApache SparkdbtDockerkestra
Jupyter Notebook 31.01 k
2 个月前
eugeneyan/applied-ml
https://static.github-zh.com/github_avatars/eugeneyan?size=40
eugeneyan / applied-ml

#自然语言处理#精选大公司分享他们在生产中关于数据科学 & 机器学习的论文和技术博客等资源

applied-machine-learningproductionapplied-data-science机器学习数据科学reinforcement-learningdata-engineeringrecsyssearch深度学习data-qualitydata-discovery机器视觉自然语言处理
28.03 k
1 年前
https://static.github-zh.com/github_avatars/PrefectHQ?size=40
PrefectHQ / prefect

Prefect 是一个现代化工作流编排工具,使开发人员能够构建、观察数据管道并对其做出反应

Pythonworkflowdata-engineering数据科学workflow-engineprefectinfrastructureml-opsDataOps自动化orchestrationdataobservabilitypipeline
Python 19.52 k
1 天前
https://static.github-zh.com/github_avatars/airbytehq?size=40
airbytehq / airbyte

Airbyte 开源 EL(T) 平台,帮助用户将数据从应用程序,API 和数据库中同步到数据仓库

datapipeline数据分析data-engineeringJavaPythonetlchange-data-capturedata-collectiondata-integrationeltBigQueryredshiftsnowflakedata-pipelinesql-serverMySQLPostgreSQLs3自托管
Python 18.43 k
10 小时前
Avaiga/taipy
https://static.github-zh.com/github_avatars/Avaiga?size=40
Avaiga / taipy

Turns Data and AI algorithms into production-ready web applications in no time.

自动化data-engineeringDataOps数据可视化datasciencedeveloper-toolsmlopsorchestrationpipelinepipelinesPythontaipy-guiworkflowtaipy-coreHacktoberfesthacktoberfest2023data-integrationjob-schedulerscenarioscenario-analysis
Python 18.12 k
2 天前
https://static.github-zh.com/github_avatars/argoproj?size=40
argoproj / argo-workflows

#计算机科学#Kubernetes 工作流引擎

workflowKubernetesArgodagknativeairflow机器学习argo-workflowsworkflow-engineHacktoberfestcloud-nativecncfGitOpsmlopsbatch-processingdata-engineeringpipelines
Go 15.72 k
2 天前
https://static.github-zh.com/github_avatars/andkret?size=40
andkret / Cookbook

The Data Engineering Cookbook

data-engineerdata-engineeringbig-databest-practicescookbook
Python 14.33 k
4 天前
https://static.github-zh.com/github_avatars/dagster-io?size=40
dagster-io / dagster

An orchestration platform for the development, production, and observation of data assets.

data-pipelinesdagsterworkflow数据科学workflow-automationPythonschedulerdata-orchestratoretlanalyticsdata-engineeringmlopsorchestrationdata-integrationmetadata
Python 13.38 k
2 天前
datastacktv/data-engineer-roadmap
https://static.github-zh.com/github_avatars/datastacktv?size=40
datastacktv / data-engineer-roadmap

#新手入门#Roadmap to becoming a data engineer in 2021

data-engineer-roadmapdata-engineeringcloud路线图
12.65 k
3 年前
https://static.github-zh.com/github_avatars/great-expectations?size=40
great-expectations / great_expectations

Always know what to expect from your data.

pipeline-testsdataqualitydatacleaningdatacleaner数据科学data-profilingpipelinepipeline-testingcleandatadataunittestedaexploratory-data-analysisdata-qualitydata-engineeringmlops
Python 10.48 k
2 天前
xonsh/xonsh
https://static.github-zh.com/github_avatars/xonsh?size=40
xonsh / xonsh

🐚 Python-powered shell. Full-featured and cross-platform.

PythonXonshShell命令行界面consoleBashDevOpsfriendly interactive shellZshiterm2数据科学data-engineeringsecurity-automation树莓派人工智能
Python 8.81 k
3 天前
redpanda-data/connect
https://static.github-zh.com/github_avatars/redpanda-data?size=40
redpanda-data / connect

Fancy stream processing made operationally mundane

message-queuestream-processingstreaming-datamessage-buslogsstream-processorcqrsevent-sourcingGokafkaamqprabbitmqnatsetldata-engineeringDataOps
Go 8.38 k
2 天前
https://static.github-zh.com/github_avatars/mage-ai?size=40
mage-ai / mage-ai

#计算机科学#🧙 Build, run, and manage data pipelines for integrating and transforming data.

机器学习人工智能datadata-engineering数据科学Pythoneltetlpipelinesdata-pipelinesorchestrationdata-integrationSQLApache Sparkdbtpipelinereverse-etltransformation
Python 8.37 k
2 天前
risingwavelabs/risingwave
https://static.github-zh.com/github_avatars/risingwavelabs?size=40
risingwavelabs / risingwave

下一代云原生流数据库

数据库stream-processingRustPostgreSQLkafkamaterialized-viewdata-engineeringapache-iceberg
Rust 7.88 k
2 小时前
growthbook/growthbook
https://static.github-zh.com/github_avatars/growthbook?size=40
growthbook / growthbook

Open Source Feature Flagging and A/B Testing Platform

abtesting统计abtestexperimentationsplit-testingmixpanelsnowflakeBigQueryredshiftclickhouseanalyticsab-testingfeature-flagsfeature-flaggingremote-configContinuous Delivery (CD)数据分析数据科学data-engineering
TypeScript 6.64 k
2 天前
https://static.github-zh.com/github_avatars/feast-dev?size=40
feast-dev / feast

#计算机科学#The Open Source Feature Store for AI/ML

机器学习featuresbig-datafeature-storePythonmlopsdata-engineering数据科学data-quality
Python 6.14 k
4 天前
https://static.github-zh.com/github_avatars/cloudquery?size=40
cloudquery / cloudquery

一个高性能ELT 框架,powered by Apache Arrow

Amazon Web ServicesGoogle 云AzureSQLdata-integrationeltetletl-frameworkBigQuerydata-collectiondata-engineeringKubernetesdataairbyteGitHub API数据分析GoogleGocspmattack-surface-management
Go 6.12 k
2 天前
loading...