GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

data-integration

Website
Wikipedia
apache/airflow
https://static.github-zh.com/github_avatars/apache?size=40
apache / airflow

#计算机科学#Apache Airflow 是一个workflow工作流调度、编排、监控平台

airflowapacheapache-airflowPythonschedulerworkflow自动化dagdata-engineeringdata-integrationdata-orchestratordata-pipelines数据科学eltetl机器学习mlopsorchestrationworkflow-engineworkflow-orchestration
Python 40.56 k
2 小时前
https://static.github-zh.com/github_avatars/airbytehq?size=40
airbytehq / airbyte

Airbyte 开源 EL(T) 平台,帮助用户将数据从应用程序,API 和数据库中同步到数据仓库

datapipeline数据分析data-engineeringJavaPythonetlchange-data-capturedata-collectiondata-integrationeltBigQueryredshiftsnowflakedata-pipelinesql-serverMySQLPostgreSQLs3自托管
Python 18.43 k
14 小时前
Avaiga/taipy
https://static.github-zh.com/github_avatars/Avaiga?size=40
Avaiga / taipy

Turns Data and AI algorithms into production-ready web applications in no time.

自动化data-engineeringDataOps数据可视化datasciencedeveloper-toolsmlopsorchestrationpipelinepipelinesPythontaipy-guiworkflowtaipy-coreHacktoberfesthacktoberfest2023data-integrationjob-schedulerscenarioscenario-analysis
Python 18.12 k
2 天前
https://static.github-zh.com/github_avatars/dagster-io?size=40
dagster-io / dagster

An orchestration platform for the development, production, and observation of data assets.

data-pipelinesdagsterworkflow数据科学workflow-automationPythonschedulerdata-orchestratoretlanalyticsdata-engineeringmlopsorchestrationdata-integrationmetadata
Python 13.38 k
2 天前
https://static.github-zh.com/github_avatars/apache?size=40
apache / seatunnel

SeaTunnel (原名为 waterdrop)是一个易用的支持海量数据实时同步的高性能分布式数据集成平台,每天可以稳定同步数百亿数据

data-integrationhigh-performanceofflinereal-timeapachebatchcdcchange-data-capturedata-ingestioneltstreaming
Java 8.57 k
3 天前
https://static.github-zh.com/github_avatars/mage-ai?size=40
mage-ai / mage-ai

#计算机科学#🧙 Build, run, and manage data pipelines for integrating and transforming data.

机器学习人工智能datadata-engineering数据科学Pythoneltetlpipelinesdata-pipelinesorchestrationdata-integrationSQLApache Sparkdbtpipelinereverse-etltransformation
Python 8.37 k
2 天前
https://static.github-zh.com/github_avatars/cloudquery?size=40
cloudquery / cloudquery

一个高性能ELT 框架,powered by Apache Arrow

Amazon Web ServicesGoogle 云AzureSQLdata-integrationeltetletl-frameworkBigQuerydata-collectiondata-engineeringKubernetesdataairbyteGitHub API数据分析GoogleGocspmattack-surface-management
Go 6.12 k
2 天前
https://static.github-zh.com/github_avatars/apache?size=40
apache / flink-cdc

Flink CDC Connector 是ApacheFlink的一组数据源连接器

change-data-capturecdcbatchdata-integrationdata-pipelinedistributedeltetlflinkkafkaMySQLpaimonPostgreSQLreal-timeschema-evolution
Java 6.1 k
2 天前
https://static.github-zh.com/github_avatars/apache?size=40
apache / hudi

Upserts, Deletes And Incremental Processing on Big Data.

hudiapachehudidatalakebigdataapachesparkincremental-processingstream-processingdata-integrationapacheflink
Java 5.84 k
1 天前
infinyon/fluvio
https://static.github-zh.com/github_avatars/infinyon?size=40
infinyon / fluvio

🦀 event stream processing for developers to collect and transform data in motion to power responsive data intensive applications.

cloud-nativestreamingRustreal-timeServerlessstatefulstream-processingWebAssemblydata-integrationdata-flowdistributed-systemsevent-driven-architecturestream-processing-enginedata-pipelinesstreaming-datastreaming-data-pipelinesstreaming-data-processingdata-analyticsstreaming-analytics
Rust 4.94 k
6 天前
jitsucom/jitsu
https://static.github-zh.com/github_avatars/jitsucom?size=40
jitsucom / jitsu

Jitsu is an open-source Segment alternative. Fully-scriptable data ingestion engine for modern data teams. Set-up a real-time data pipeline in minutes, not days

data-integrationclickhouseGoBigQuerydata-collectionredshiftsnowflakePostgreSQL
TypeScript 4.31 k
6 天前
rudderlabs/rudder-server
https://static.github-zh.com/github_avatars/rudderlabs?size=40
rudderlabs / rudder-server

Privacy and Security focused Segment-alternative, in Golang and React

隐私warehouse-managementdata-warehousecustomer-data-platformdata-integrationdata-synchronizationetlBigQueryredshiftsnowflakedata-pipelineeltdata-engineeringcdpevent-streaming
Go 4.21 k
3 天前
https://static.github-zh.com/github_avatars/DTStack?size=40
DTStack / chunjun

Chunjun 纯钧,是一款稳定、易用、高效、批流一体的数据集成框架,目前基于实时计算引擎Flink实现多种异构数据源之间的数据同步与计算,已在上千家公司部署且稳定运行。

flinkbigdatadata-integration框架Java
Java 4.06 k
3 个月前
https://static.github-zh.com/github_avatars/seandavi?size=40
seandavi / awesome-single-cell

#Awesome#Community-curated list of software packages and data resources for single-cell, including RNA-seq, ATAC-seq, etc.

BioinformaticsAwesome Listsdimensionality-reductionPythonclusteringdata-integration数据可视化analysis
3.4 k
4 天前
https://static.github-zh.com/github_avatars/bruin-data?size=40
bruin-data / ingestr

ingestr is a CLI tool to copy data between any databases with a single command seamlessly.

BigQuerycopy-databasedata-ingestiondata-integrationdata-pipelineduckdbingestion-pipelinesql-serverPostgreSQLsnowflake
Python 2.97 k
3 天前
https://static.github-zh.com/github_avatars/apache?size=40
apache / incubator-devlake

Apache DevLake is an open-source dev data platform to ingest, analyze, and visualize the fragmented data from DevOps tools, extracting insights for engineering excellence, developer experience, and co...

data数据分析data-engineeringdata-integrationdata-transfersDevOpsdomain-layeretlGointegrationjiraOpen Sourceuser-friendlydoraHacktoberfest
Go 2.75 k
5 天前
https://static.github-zh.com/github_avatars/mara?size=40
mara / mara-pipelines

A lightweight opinionated ETL framework, halfway between plain scripts and Apache Airflow

etldata-integrationPythonPostgreSQLpipelinedata
Python 2.08 k
2 年前
https://static.github-zh.com/github_avatars/bytedance?size=40
bytedance / bitsail

BitSail is a distributed high-performance data integration engine which supports batch, streaming and incremental scenarios. BitSail is widely used to synchronize hundreds of trillions of data every d...

flinkbig-datadata-integrationdata-lakedata-pipelinedata-synchronizationhigh-performancereal-time
Java 1.67 k
1 年前
https://static.github-zh.com/github_avatars/apache?size=40
apache / hop

Hop Orchestration Platform

Javastreaminghopapachedata-integrationetlorchestrationpipelineworkflow
Java 1.16 k
3 天前
https://static.github-zh.com/github_avatars/kuwala-io?size=40
kuwala-io / kuwala

#网络爬虫#Kuwala is the no-code data platform for BI analysts and engineers enabling you to build powerful analytics workflows. We are set out to bring state-of-the-art data engineering tools you love, such as ...

datadata-integration数据科学Open Dataspatial-analysiseltOpen SourcescrapingdbtPostgreSQLpysparkPythonJupyter NotebookpopulationReact无代码react-flow
JavaScript 795
3 年前
loading...