GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

elt

Website
Wikipedia
apache/airflow
https://static.github-zh.com/github_avatars/apache?size=40
apache / airflow

#计算机科学#Apache Airflow 是一个workflow工作流调度、编排、监控平台

airflowapacheapache-airflowPythonschedulerworkflow自动化dagdata-engineeringdata-integrationdata-orchestratordata-pipelines数据科学eltetl机器学习mlopsorchestrationworkflow-engineworkflow-orchestration
Python 40.56 k
3 小时前
https://static.github-zh.com/github_avatars/airbytehq?size=40
airbytehq / airbyte

Airbyte 开源 EL(T) 平台,帮助用户将数据从应用程序,API 和数据库中同步到数据仓库

datapipeline数据分析data-engineeringJavaPythonetlchange-data-capturedata-collectiondata-integrationeltBigQueryredshiftsnowflakedata-pipelinesql-serverMySQLPostgreSQLs3自托管
Python 18.43 k
15 小时前
https://static.github-zh.com/github_avatars/apache?size=40
apache / doris

Doris 是百度开源的支持对海量大数据进行快速分析的MPP数据库。

olap数据库hadoophivehudiicebergreal-timeSQLBigQuerydbtdelta-lakeeltetllakehousequery-engineredshiftsnowflakeApache Spark
Java 13.81 k
2 天前
dbt-labs/dbt-core
https://static.github-zh.com/github_avatars/dbt-labs?size=40
dbt-labs / dbt-core

dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.

dbt-viewpointSlackpypadata-modelingbusiness-intelligenceanalyticselt
Python 10.97 k
2 天前
https://static.github-zh.com/github_avatars/apache?size=40
apache / seatunnel

SeaTunnel (原名为 waterdrop)是一个易用的支持海量数据实时同步的高性能分布式数据集成平台,每天可以稳定同步数百亿数据

data-integrationhigh-performanceofflinereal-timeapachebatchcdcchange-data-capturedata-ingestioneltstreaming
Java 8.57 k
3 天前
https://static.github-zh.com/github_avatars/mage-ai?size=40
mage-ai / mage-ai

#计算机科学#🧙 Build, run, and manage data pipelines for integrating and transforming data.

机器学习人工智能datadata-engineering数据科学Pythoneltetlpipelinesdata-pipelinesorchestrationdata-integrationSQLApache Sparkdbtpipelinereverse-etltransformation
Python 8.37 k
2 天前
https://static.github-zh.com/github_avatars/cloudquery?size=40
cloudquery / cloudquery

一个高性能ELT 框架,powered by Apache Arrow

Amazon Web ServicesGoogle 云AzureSQLdata-integrationeltetletl-frameworkBigQuerydata-collectiondata-engineeringKubernetesdataairbyteGitHub API数据分析GoogleGocspmattack-surface-management
Go 6.12 k
2 天前
https://static.github-zh.com/github_avatars/apache?size=40
apache / flink-cdc

Flink CDC Connector 是ApacheFlink的一组数据源连接器

change-data-capturecdcbatchdata-integrationdata-pipelinedistributedeltetlflinkkafkaMySQLpaimonPostgreSQLreal-timeschema-evolution
Java 6.1 k
2 天前
rudderlabs/rudder-server
https://static.github-zh.com/github_avatars/rudderlabs?size=40
rudderlabs / rudder-server

Privacy and Security focused Segment-alternative, in Golang and React

隐私warehouse-managementdata-warehousecustomer-data-platformdata-integrationdata-synchronizationetlBigQueryredshiftsnowflakedata-pipelineeltdata-engineeringcdpevent-streaming
Go 4.21 k
3 天前
https://static.github-zh.com/github_avatars/dlt-hub?size=40
dlt-hub / dlt

data load tool (dlt) is an open source Python library that makes data loading easy 🛠️

dataPythondata-engineeringdata-lakedata-loadingdata-warehouseeltextractloadtransform
Python 3.72 k
4 天前
https://static.github-zh.com/github_avatars/Netflix?size=40
Netflix / maestro

#计算机科学#Maestro: Netflix’s Workflow Orchestrator

analytics自动化batch-processingdagdata-engineeringDataOpsdata-orchestratordata-pipelines数据科学eltetlJava机器学习mlopsorchestrationschedulerworkflowworkflow-engineworkflow-orchestration
Java 3.48 k
20 小时前
https://static.github-zh.com/github_avatars/TobikoData?size=40
TobikoData / sqlmesh

Scalable and efficient data transformation framework - backwards compatible with dbt.

DataOpseltetlSQLPythondataengineeringtransformationdbt
Python 2.4 k
1 天前
quarylabs/quary
https://static.github-zh.com/github_avatars/quarylabs?size=40
quarylabs / quary

Open-source BI for engineers

analyticsbusiness-intelligencedata-modelingeltbig-data
Rust 2.31 k
4 个月前
https://static.github-zh.com/github_avatars/ucbepic?size=40
ucbepic / docetl

#大语言模型#A system for agentic LLM-powered data processing and ETL

dataetl大语言模型Pythondata-pipelineseltworkflowagentssemantic-datallm-datadocument-processing
Python 2.14 k
4 天前
https://static.github-zh.com/github_avatars/meltano?size=40
meltano / meltano

Meltano: the declarative code-first data integration engine that powers your wildest data and ML-powered product ideas. Say goodbye to writing, maintaining, and scaling your own API integrations.

DataOpseltOpen Sourcedatapipelinesextract-dataconnectorsintegrationtaploadersdata-pipelinesdata-engineering
Python 2.1 k
5 天前
https://static.github-zh.com/github_avatars/dataform-co?size=40
dataform-co / dataform

Dataform is a framework for managing SQL based data operations in BigQuery

data-pipelineseltdata-engineeringbusiness-intelligenceanalyticsetlHacktoberfest
TypeScript 907
4 天前
https://static.github-zh.com/github_avatars/datazip-inc?size=40
datazip-inc / olake

Fastest open-source tool for replicating Databases to Data Lake in Open Table Formats like Apache Iceberg. ⚡ Efficient, quick and scalable data ingestion for real-time analytics. Supporting Postgres,...

cdcchange-data-capturedata-pipeline数据库eltlakehousereplicationapache-icebergparquets3
Go 889
3 天前
https://static.github-zh.com/github_avatars/kuwala-io?size=40
kuwala-io / kuwala

#网络爬虫#Kuwala is the no-code data platform for BI analysts and engineers enabling you to build powerful analytics workflows. We are set out to bring state-of-the-art data engineering tools you love, such as ...

datadata-integration数据科学Open Dataspatial-analysiseltOpen SourcescrapingdbtPostgreSQLpysparkPythonJupyter NotebookpopulationReact无代码react-flow
JavaScript 795
3 年前
https://static.github-zh.com/github_avatars/raystack?size=40
raystack / optimus

Optimus is an easy-to-use, reliable, and performant workflow orchestrator for data transformation, data modeling, pipelines, and data quality management.

airflowetlworkflows自动化GoBigQuerydata-warehouseanalyticsdata-modellinganalytics-engineeringdata-transformationdata-pipelineseltbusiness-intelligenceDataOps
Go 749
1 年前
https://static.github-zh.com/github_avatars/artie-labs?size=40
artie-labs / transfer

Database replication platform that leverages change data capture. Stream production data from databases to your data warehouse (Snowflake, BigQuery, Redshift, Databricks) in real-time.

snowflakecdcchange-data-captureGoBigQuerykafkaapache-kafkadata-integrationdata-pipelines数据库debeziumeltredshift
Go 655
9 天前
loading...