GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

hudi

Website
Wikipedia
https://static.github-zh.com/github_avatars/apache?size=40
apache / doris

Doris 是百度开源的支持对海量大数据进行快速分析的MPP数据库。

olap数据库hadoophivehudiicebergreal-timeSQLBigQuerydbtdelta-lakeeltetllakehousequery-engineredshiftsnowflakeApache Spark
Java 13.81 k
1 天前
https://static.github-zh.com/github_avatars/StarRocks?size=40
StarRocks / starrocks

StarRocks 是新一代极速全场景 MPP (Massively Parallel Processing) 数据库。StarRocks 的愿景是能够让用户的数据分析变得更加简单和敏捷。用户无需经过复杂的预处理,就可以用 StarRocks 来支持多种数据分析场景的极速分析。

数据库olapSQLanalyticsbig-datarealtime-databasevectorizeddistributed-databasereal-time-analyticsmppjoinstar-schemareal-time-updatesdelta-lakehudiiceberglakehousedatalakelakehouse-platformcloudnative
Java 10.13 k
1 天前
https://static.github-zh.com/github_avatars/apache?size=40
apache / hudi

Upserts, Deletes And Incremental Processing on Big Data.

hudiapachehudidatalakebigdataapachesparkincremental-processingstream-processingdata-integrationapacheflink
Java 5.84 k
2 天前
alldatacenter/alldata
https://static.github-zh.com/github_avatars/alldatacenter?size=40
alldatacenter / alldata

🔥🔥 AllData可定义数据中台,以数据平台为底座,以数据中台为桥梁,以机器学习平台为工厂,以大模型应用为上游产品,提供全链路数字化解决方案。采购商业版、加入技术社区:https://docs.qq.com/doc/DVHlkSEtvVXVCdEFo

griffinhudiicebergpaimondatartdinkystreamparkcube-studiodatasophoncloudeondolphinschedulerdatahubdatavinesopenmetadataalldata
Java 2.76 k
25 天前
https://static.github-zh.com/github_avatars/collabH?size=40
collabH / bigdata-growth

大数据知识仓库涉及到数据仓库建模、实时计算、大数据、数据中台、系统设计、Java、算法等。

flinkkafkahivemapreduceApache Sparkolaphadoophbasedebeziumhdfsbigdatahudi
Shell 1.63 k
1 个月前
https://static.github-zh.com/github_avatars/Mrkuhuo?size=40
Mrkuhuo / data-warehouse-learning

【2025最新版】 大数据 数据分析 电商系统 实时数仓 离线数仓 数据湖 建设方案及实战代码,涉及组件 #flink #paimon #doris #seatunnel #dolphinscheduler #datart #dinky #hudi #iceberg。

datartdinkydolphinschedulerdorisflinkhudiicebergpaimonseatunnel
Java 864
8 天前
https://static.github-zh.com/github_avatars/leesf?size=40
leesf / hudi-resources

汇总Apache Hudi相关资料

hudiapachehudiapachedatalakebigdatastream-processingincremental-processingdata-integration
553
7 天前
https://static.github-zh.com/github_avatars/fancyChuan?size=40
fancyChuan / bigdata-hub

数据建设与大数据技术知识体系,包含hadoop、hive、spark、flink主流框架和系列框架,数据中台、数据湖、数据治理、数仓建设、数据化转型等

bigdatahadoopApache Sparkflinkhivehudikafkaclickhouseseatunnel
Java 401
3 个月前
https://static.github-zh.com/github_avatars/apache?size=40
apache / hudi-rs

The native Rust implementation for Apache Hudi, with C++ & Python API bindings.

apachehudiPythonRustC++
Rust 223
5 天前
https://static.github-zh.com/github_avatars/izhangzhihao?size=40
izhangzhihao / Real-time-Data-Warehouse

Real-time Data Warehouse with Apache Flink & Apache Kafka & Apache Hudi

flinkdata-warehousedata-warehousingflink-sqldebeziumkafkaelasticsearchdelta-lakecdcchange-data-capturehudiicebergSQLdatalakedeltadeltalakeApache Sparkspark-sql
Dockerfile 113
2 年前
https://static.github-zh.com/github_avatars/WeBankFinTech?size=40
WeBankFinTech / Streamis

Streaming application development and management system, based on Linkis and DSS, planning to provide the workflow-like graphical drag-and-drop development capability.

flinklinkisstreaminghudiicebergdatalakekafkadeltalake
Java 107
2 个月前
https://static.github-zh.com/github_avatars/apache?size=40
apache / doris-website

Apache Doris Website

dorisanalyticsapachebig-datadata-warehousing数据库datalakedbmsdistributed-systemhadoophivehudiicebergmppolapssbvectorized
TypeScript 99
4 天前
https://static.github-zh.com/github_avatars/leesf?size=40
leesf / hudi-demos

汇总Apache Hudi中的一些Demo,便于快速上手Apache Hudi(Apache Hudi Demos to help beginners know about Hudi)

hudiapachehudi
Java 74
5 年前
https://static.github-zh.com/github_avatars/1ambda?size=40
1ambda / lakehouse

Playground for Lakehouse (Iceberg, Hudi, Spark, Flink, Trino, DBT, Airflow, Kafka, Debezium CDC)

flinkhudiicebergApache SparktrinodbtDockerairflowcdcdebeziumkafka
Kotlin 58
2 年前
https://static.github-zh.com/github_avatars/Mrkuhuo?size=40
Mrkuhuo / bigdata_learning

大数据组件学习代码

flinkhadoopJavaPythonApache Sparkclickhousedataxdolphinschedulerdoriselasticsearchhbasehivehudiicebergsqoop
Java 57
1 年前
https://static.github-zh.com/github_avatars/dacort?size=40
dacort / modern-data-lake-storage-layers

Jupyter notebooks and AWS CloudFormation template to show how Hudi, Iceberg, and Delta Lake work

Amazon Web Serviceshudiicebergapache-hudiapache-icebergdelta-lake
Jupyter Notebook 47
3 年前
https://static.github-zh.com/github_avatars/apache?size=40
apache / doris-thirdparty

Self-managed thirdparty dependencies for Apache Doris

analyticsbig-datadata-warehousing数据库datalakedbmsdistributed-databasehadoophivehudiicebergmppolapreal-timeSQLssbvectorized
41
18 天前
https://static.github-zh.com/github_avatars/jaehyeon-kim?size=40
jaehyeon-kim / dbt-on-aws

dbt (data build tool) projects targeting AWS analytics services (redshift, glue, emr, athena) and open table formats

athenadbtdelta-lakeemrgluehudiicebergredshift
HCL 29
2 年前
https://static.github-zh.com/github_avatars/apache?size=40
apache / doris-streamloader

Stream Loader for Apache Doris

BigQuery数据库dbtdelta-lakeeltetlhadoophivehudiiceberglakehouseolapquery-enginereal-timeredshiftsnowflakeApache SparkSQL
Go 24
2 个月前
https://static.github-zh.com/github_avatars/shangyuantech?size=40
shangyuantech / hudi-multistream

Consumption and writing to Hudi based on multiple topic

hudi
Java 8
5 年前
loading...