GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

etl-framework

Website
Wikipedia
https://static.github-zh.com/github_avatars/pathwaycom?size=40
pathwaycom / pathway

Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.

batch-processingkafkapathwayPythonstreaming机器学习real-timedata-analyticsdata-pipelinesdata-processingdataflowetletl-frameworkiot-analyticsRuststream-processingtime-series-analysis
Python 26.78 k
3 天前
https://static.github-zh.com/github_avatars/elastic?size=40
elastic / logstash

Logstash 是一个实时的管道式开源日志收集引擎。 Logstash可以动态的将不同来源的数据进行归一并且将格式化的数据存储到你选择的位置。 对你的所有做数据清洗和大众化处理,以便做数据分析和可视化。

etl-frameworkstreamingLoggingJavajrubyreal-time-processing
Java 14.52 k
6 小时前
https://static.github-zh.com/github_avatars/cloudquery?size=40
cloudquery / cloudquery

一个高性能ELT 框架,powered by Apache Arrow

Amazon Web ServicesGoogle 云AzureSQLdata-integrationeltetletl-frameworkBigQuerydata-collectiondata-engineeringKubernetesdataairbyteGitHub API数据分析GoogleGocspmattack-surface-management
Go 6.12 k
2 天前
https://static.github-zh.com/github_avatars/noflo?size=40
noflo / noflo

Flow-based programming for JavaScript

noflofbpetl-frameworkflow-based-programming无代码
JavaScript 3.53 k
1 年前
https://static.github-zh.com/github_avatars/apache?size=40
apache / hamilton

#计算机科学#Apache Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage/tracing and metadata. Runs and scales everywhere python does.

数据科学Pythondagdata-engineeringdataframeetletl-frameworketl-pipelinefeature-engineering机器学习pandas软件工程数据分析lineagellmopsmlopsorchestrationHacktoberfestrag
Jupyter Notebook 2.15 k
6 天前
san089/goodreads_etl_pipeline
https://static.github-zh.com/github_avatars/san089?size=40
san089 / goodreads_etl_pipeline

An end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform.

etl-pipelineetl-frameworkApache Sparkapache-airflowairflowredshiftemr-clusterlivys3data-lakeschedulerdata-migrationdata-engineeringdata-engineering-pipelinePythonetl-job
Python 1.39 k
5 年前
https://static.github-zh.com/github_avatars/singer-io?size=40
singer-io / getting-started

This repository is a getting started guide to Singer.

etletl-frameworkPython数据分析
Makefile 1.31 k
9 个月前
https://static.github-zh.com/github_avatars/marsupialtail?size=40
marsupialtail / quokka

Making data lake work for time series

data-lake-analyticsdistributedetl-frameworkmlopsSQL
Python 1.17 k
10 个月前
https://static.github-zh.com/github_avatars/stitchfix?size=40
stitchfix / hamilton

#计算机科学#A scalable general purpose micro-framework for defining dataflows. THIS REPOSITORY HAS BEEN MOVED TO www.github.com/dagworks-inc/hamilton

Pythonpandasdag数据科学data-engineeringNumPy软件工程etl-frameworketl-pipelineetlfeature-engineeringdataframedata-platform机器学习
Python 860
2 年前
https://static.github-zh.com/github_avatars/Cinchoo?size=40
Cinchoo / ChoETL

ETL framework for .NET (Parser / Writer for CSV, Flat, Xml, JSON, Key-Value, Parquet, Yaml, Avro formatted files)

CSVParserwriterreaderflatXMLJSONkeyvalueetletl-frameworkC#.NETparquetYAMLavro
C# 825
9 天前
https://static.github-zh.com/github_avatars/apache?size=40
apache / seatunnel-web

SeaTunnel is a distributed, high-performance data integration platform for the synchronization and transformation of massive data (offline & real-time).

apachedata-integrationdata-pipelineetl-frameworkhigh-performanceofflinereal-timeseatunnelsql-engine
Java 694
5 天前
https://static.github-zh.com/github_avatars/YotpoLtd?size=40
YotpoLtd / metorikku

A simplified, lightweight ETL Framework based on Apache Spark

big-dataApache SparkScalaetl-frameworkdistributed-computingSQLetletl-pipeline
Scala 586
1 年前
https://static.github-zh.com/github_avatars/seanharr11?size=40
seanharr11 / etlalchemy

Extract, Transform, Load: Any SQL Database in 4 lines of Code.

etl-frameworketlPython数据库migrationssqlalchemy
Python 557
6 年前
https://static.github-zh.com/github_avatars/usc-isi-i2?size=40
usc-isi-i2 / kgtk

Knowledge Graph Toolkit

graphsRDF (Resource Description Framework)etl-frameworkembeddingswikidatatoolkit
Jupyter Notebook 386
2 年前
https://static.github-zh.com/github_avatars/quintoandar?size=40
quintoandar / butterfree

A tool for building feature stores.

Pythonpackagedata-engineeringetl-frameworketlfeature-store数据科学pyspark
Python 303
9 天前
https://static.github-zh.com/github_avatars/data-dot-all?size=40
data-dot-all / dataall

A modern data marketplace that makes collaboration among diverse users (like business, analysts and engineers) easier, increasing efficiency and agility in data projects on AWS.

aws-s3datalakehouseredshift数据科学etl-frameworkAmazon Web Services
Python 242
5 天前
https://static.github-zh.com/github_avatars/Nextdoor?size=40
Nextdoor / bender

Bender - Serverless ETL Framework

aws-lambdaetl-frameworkaws-s3etlJava
Java 185
1 年前
https://static.github-zh.com/github_avatars/velocitybolt?size=40
velocitybolt / open-extract

#大语言模型#Structured Data Extractor for AI Agents. Search your documents or the web for specific data and get it back in JSON or Markdown in a single tool call.

人工智能etletl-framework大语言模型autogencrewailangchainlanggraphopenairagunstructured-dataPython
Python 171
3 个月前
https://static.github-zh.com/github_avatars/ceumicrodata?size=40
ceumicrodata / mETL

mito ETL tool

Pythondata-integrationetletl-frameworkpipeline
Python 163
4 年前
https://static.github-zh.com/github_avatars/dalenewman?size=40
dalenewman / Transformalize

Configurable Extract, Transform, and Load

etletl-frameworkelasticsearchsolrsql-serverMySQLPostgreSQLSQLitefilesexceldata-warehouse
C# 161
1 个月前
loading...