Doris 是百度开源的支持对海量大数据进行快速分析的MPP数据库。
trino 是一个分布式大数据 SQL 查询引擎(前身 PrestoSQL)
StarRocks 是新一代极速全场景 MPP (Massively Parallel Processing) 数据库。StarRocks 的愿景是能够让用户的数据分析变得更加简单和敏捷。用户无需经过复杂的预处理,就可以用 StarRocks 来支持多种数据分析场景的极速分析。
Iceberg 是用于庞大分析数据集的开放表格式。 Iceberg 为大数据带来了 SQL 表的可靠性和简单性,同时让 Spark、Trino、Flink、Presto、Hive 和 Impala 等引擎能够同时安全地使用相同的表。
#编辑器#:antarctica: Bluish color scheme for Vim and Neovim
A high-performance SQL engine written in C++, designed for real-time data processing. It can read millions of rows per second from ClickHouse, Kafka, Pulsar, or REST, and write results back with low l...
Apache Polaris, the interoperable, open source catalog for Apache Iceberg
Real-time analytics on Postgres tables
Open-source Snowflake and Fivetran alternative bundled together
Provide JSON file template that demonstrate how to create customize Well-Architected reviews using Custom lenses.
Nessie: Transactional Catalog for Data Lakes with Git-like semantics
【2025最新版】 大数据 数据分析 电商系统 实时数仓 离线数仓 数据湖 建设方案及实战代码,涉及组件 #flink #paimon #doris #seatunnel #dolphinscheduler #datart #dinky #hudi #iceberg。
ClickBench: a Benchmark For Analytical Databases
DuckDB-powered data lake analytics from Postgres
Open Control Plane for Tables in Data Lakehouse
Serverless HTAP cloud data platform powered by Arrow × DuckDB × Iceberg