GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

apache-iceberg

Website
Wikipedia
risingwavelabs/risingwave
https://static.github-zh.com/github_avatars/risingwavelabs?size=40
risingwavelabs / risingwave

下一代云原生流数据库

数据库stream-processingRustPostgreSQLkafkamaterialized-viewdata-engineeringapache-iceberg
Rust 7.88 k
12 小时前
matanolabs/matano
https://static.github-zh.com/github_avatars/matanolabs?size=40
matanolabs / matano

Open source security data lake for threat hunting, detection & response, and cybersecurity analytics at petabyte scale on AWS

Amazon Web Servicescloud安全big-dataServerlessapache-iceberglog-analyticslog-managementthreat-huntingRustalertingcloud-nativeaws-securitycloud-securityCybersecuritysecopsdfirdetection-engineeringsiem
Rust 1.57 k
5 个月前
https://static.github-zh.com/github_avatars/apache?size=40
apache / incubator-xtable

Apache XTable (incubating) is a cross-table converter for lakehouse table formats that facilitates interoperability across data processing systems and query engines.

apache-hudiapache-icebergdelta-lake
Java 1.07 k
9 天前
https://static.github-zh.com/github_avatars/datazip-inc?size=40
datazip-inc / olake

Fastest open-source tool for replicating Databases to Data Lake in Open Table Formats like Apache Iceberg. ⚡ Efficient, quick and scalable data ingestion for real-time analytics. Supporting Postgres,...

cdcchange-data-capturedata-pipeline数据库eltlakehousereplicationapache-icebergparquets3
Go 889
3 天前
https://static.github-zh.com/github_avatars/tansu-io?size=40
tansu-io / tansu

Apache Kafka® compatible broker with S3, PostgreSQL, Apache Iceberg and Delta Lake

built-with-rustPostgreSQLs3apache-icebergapache-kafkaapache-arrowdatafusionparquetdelta-lakedatalake
Rust 387
5 天前
https://static.github-zh.com/github_avatars/cuebook?size=40
cuebook / cuelake

Use SQL to build ELT pipelines on a data lakehouse.

apache-icebergdeltalakehousedatalakedata-lakeeltetldata-engineeringdata-integrationdata-ingestionApache Sparkspark-sqldata-transferpipelinesdata-pipelinezeppelin-notebookSQL
JavaScript 287
3 年前
https://static.github-zh.com/github_avatars/lhbench?size=40
lhbench / lhbench

Lakehouse storage system benchmark

apache-hudiapache-iceberglakehousebenchmarkcidr数据库databricksdelta-lake
Scala 75
2 年前
https://static.github-zh.com/github_avatars/dominikhei?size=40
dominikhei / Local-Data-LakeHouse

Sample Data Lakehouse deployed in Docker containers using Apache Iceberg, Minio, Trino and a Hive Metastore. Can be used for local testing.

apache-icebergdata-lakelakehouseminiotrino
Dockerfile 71
2 年前
https://static.github-zh.com/github_avatars/nimtable?size=40
nimtable / nimtable

The Control Plane for Apache Iceberg

apache-icebergdatalakelakehouseicebergpolaris
TypeScript 62
4 天前
https://static.github-zh.com/github_avatars/dacort?size=40
dacort / modern-data-lake-storage-layers

Jupyter notebooks and AWS CloudFormation template to show how Hudi, Iceberg, and Delta Lake work

Amazon Web Serviceshudiicebergapache-hudiapache-icebergdelta-lake
Jupyter Notebook 47
3 年前
https://static.github-zh.com/github_avatars/abeltavares?size=40
abeltavares / real-time-data-pipeline

📡 Real-time data pipeline with Kafka, Flink, Iceberg, Trino, MinIO, and Superset. Ideal for learning data systems.

apache-flinkapache-icebergapache-kafkaapache-supersetbig-datadata-engineeringdata-pipelineDockerlakehouseminioreal-time-datastreaming-analyticstrino数据可视化Open SourceetlAmazon Web Servicess3
Python 44
5 个月前
https://static.github-zh.com/github_avatars/hyparam?size=40
hyparam / icebird

Icebird: JavaScript Iceberg Client

icebergJavaScriptparquetapache-icebergdata-lakedatalakedata-engineering
JavaScript 36
1 个月前
https://static.github-zh.com/github_avatars/aws-samples?size=40
aws-samples / transactional-datalake-using-apache-iceberg-on-aws-glue

Stream CDC into an Amazon S3 data lake in Apache Iceberg table format with AWS Glue Streaming and DMS

apache-icebergApache Spark
Python 32
4 个月前
https://static.github-zh.com/github_avatars/aws-samples?size=40
aws-samples / sample-pace-data-analytics-ml-ai

DAIVI is a reference solution with IAC modules to accelerate development of Data, Analytics, AI and Visualization applications on AWS using the next generation Amazon SageMaker Unified Studio. The goa...

apache-icebergsagemakerTerraform
HCL 28
6 天前
https://static.github-zh.com/github_avatars/guidok91?size=40
guidok91 / spark-movies-etl

Spark data pipeline that processes movie ratings data.

Apache Sparkpysparketldata-engineeringeltapache-airflowdata-pipelineuvapache-iceberg
Python 28
10 天前
https://static.github-zh.com/github_avatars/bodo-ai?size=40
bodo-ai / denali

An open-source, community-driven REST catalog for Apache Iceberg!

apache-icebergcatalogGoiceberg
Go 28
1 年前
https://static.github-zh.com/github_avatars/aws-samples?size=40
aws-samples / monitoring-apache-iceberg-table-metadata-layer

Sample code to collect Apache Iceberg metrics for table monitoring

apache-icebergAmazon Web Servicesaws-lambdadata-quality监控Apache Sparkpyiceberg
Python 28
10 个月前
https://static.github-zh.com/github_avatars/aws-samples?size=40
aws-samples / iceberg-streaming-examples

This repo contains examples of high throughput ingestion using Apache Spark and Apache Iceberg. These examples cover IoT and CDC scenarios using best practices. The code can be deployed into any Spark...

apache-icebergApache Spark
Java 26
7 个月前
https://static.github-zh.com/github_avatars/aws-samples?size=40
aws-samples / aws-glue-streaming-etl-with-apache-iceberg

Streaming ETL job cases in AWS Glue to integrate Iceberg and creating an in-place updatable data lake on Amazon S3

apache-icebergApache Spark
Python 23
9 个月前
https://static.github-zh.com/github_avatars/tj---?size=40
tj--- / iceberg-demo

A sample implementation of stream writes to an Iceberg table on GCS using Flink and reading it using Trino

apache-flinkapache-icebergapache-kafkaflinkicebergkafkatrinogcsJava
Java 20
3 年前
loading...