#

parquet

questdb/questdb
https://static.github-zh.com/github_avatars/questdb?size=40
Java 16.1 k
2 小时前
https://static.github-zh.com/github_avatars/apache?size=40

Apache Arrow 是用于内存分析的开发平台,支持多语言。包含一个标准化的物件栏内存格式,且能够表示平面及层级化数据,以便在现代CPU和GPU硬体上进行高效率的分析操作。

C++ 15.95 k
16 小时前
https://static.github-zh.com/github_avatars/multiprocessio?size=40

Commandline tool for running SQL queries against JSON, CSV, Excel, Parquet, and more.

Go 3.86 k
2 年前
https://static.github-zh.com/github_avatars/apache?size=40

Official Rust implementation of Apache Arrow

Rust 3.13 k
1 天前
https://static.github-zh.com/github_avatars/apache?size=40
Java 2.94 k
5 天前
rilldata/rill
https://static.github-zh.com/github_avatars/rilldata?size=40

Rill is a tool for effortlessly transforming data sets into powerful, opinionated dashboards using SQL. BI-as-code.

Go 2.33 k
12 小时前
https://static.github-zh.com/github_avatars/apache?size=40
Thrift 2.04 k
1 天前
https://static.github-zh.com/github_avatars/apache?size=40

Apache Drill is a distributed MPP query layer for self describing data

Java 1.99 k
4 天前
https://static.github-zh.com/github_avatars/uber?size=40

#计算机科学#Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch, a...

Python 1.86 k
1 个月前
https://static.github-zh.com/github_avatars/gchq?size=40

A large-scale entity and relation database supporting aggregation of properties

Java 1.79 k
3 个月前
https://static.github-zh.com/github_avatars/BemiHQ?size=40

Open-source Snowflake and Fivetran alternative bundled together

Go 1.46 k
10 天前
https://static.github-zh.com/github_avatars/paradigmxyz?size=40

cryo is the easiest way to extract blockchain data to parquet, csv, json, or python dataframes

Rust 1.44 k
8 个月前
https://static.github-zh.com/github_avatars/quiltdata?size=40
TypeScript 1.35 k
14 小时前
https://static.github-zh.com/github_avatars/tonbo-io?size=40
Rust 1.17 k
5 天前
https://static.github-zh.com/github_avatars/datazip-inc?size=40

Fastest open-source tool for replicating Databases to Data Lake in Open Table Formats like Apache Iceberg. ⚡ Efficient, quick and scalable data ingestion for real-time analytics. Supporting Postgres,...

Go 1.05 k
1 小时前
https://static.github-zh.com/github_avatars/bigdatagenomics?size=40

ADAM is a genomics analysis platform with specialized file formats built using Apache Avro, Apache Spark, and Apache Parquet. Apache 2 licensed.

Scala 1.04 k
2 个月前
loading...
Website
Wikipedia