GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

parquet

Website
Wikipedia
https://static.github-zh.com/github_avatars/apache?size=40
apache / arrow

Apache Arrow 是用于内存分析的开发平台,支持多语言。包含一个标准化的物件栏内存格式,且能够表示平面及层级化数据,以便在现代CPU和GPU硬体上进行高效率的分析操作。

arrowparquet
C++ 15.56 k
15 小时前
https://static.github-zh.com/github_avatars/multiprocessio?size=40
multiprocessio / dsq

Commandline tool for running SQL queries against JSON, CSV, Excel, Parquet, and more.

GoCSVJSONtsvexcelopenoffice-calcparquetSQL命令行界面
Go 3.84 k
2 年前
https://static.github-zh.com/github_avatars/roapi?size=40
roapi / roapi

#数据仓库#Create full-fledged APIs for slowly moving datasets without writing a single line of code.

SQLGraphQLarrowREST APIanalyticsQuery (disambiguation)columnarRustin-memory-databasedatafusionblob-storagecloud-nativeparquet数据集s3delta-lake
Rust 3.31 k
1 个月前
https://static.github-zh.com/github_avatars/apache?size=40
apache / arrow-rs

Official Rust implementation of Apache Arrow

arrowparquetRust
Rust 2.98 k
2 天前
dathere/qsv
https://static.github-zh.com/github_avatars/dathere?size=40
dathere / qsv

#数据仓库#Blazing-fast Data-Wrangling toolkit

CSVdata-wrangling命令行界面Open Datadata-engineeringckanexcelluauparquetPostgreSQLSQLitepolarsSQLgeocodetimeseriesdcatmetadata统计samplinglibreoffice
Rust 2.88 k
1 天前
https://static.github-zh.com/github_avatars/apache?size=40
apache / parquet-java

Apache Parquet Java

parquetapacheparquet-java
Java 2.85 k
8 天前
rilldata/rill
https://static.github-zh.com/github_avatars/rilldata?size=40
rilldata / rill

Rill is a tool for effortlessly transforming data sets into powerful, opinionated dashboards using SQL. BI-as-code.

duckdbSveltesveltekit数据可视化CSVparquetGos3gcs数据分析SQLbidatabusiness-analyticssql-editor
Go 2.08 k
4 天前
https://static.github-zh.com/github_avatars/apache?size=40
apache / drill

Apache Drill is a distributed MPP query layer for self describing data

Javabig-dataSQLhivehadoopjdbcparquet
Java 1.98 k
18 天前
https://static.github-zh.com/github_avatars/apache?size=40
apache / parquet-format

Apache Parquet Format

parquetapache
Thrift 1.97 k
15 天前
https://static.github-zh.com/github_avatars/uber?size=40
uber / petastorm

#计算机科学#Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch, a...

TensorflowPyTorch深度学习机器学习pysparkparquet
Python 1.84 k
2 年前
https://static.github-zh.com/github_avatars/gchq?size=40
gchq / Gaffer

A large-scale entity and relation database supporting aggregation of properties

accumulographgraph-databasehadoopbig-dataaggregationhbaseparquetApache Spark
Java 1.79 k
10 天前
Mooncake-Labs/pg_mooncake
https://static.github-zh.com/github_avatars/Mooncake-Labs?size=40
Mooncake-Labs / pg_mooncake

Real-time analytics on Postgres tables

analyticscolumnstoredelta-lakeiceberglakehouseparquetPostgreSQL
Rust 1.48 k
4 天前
https://static.github-zh.com/github_avatars/BemiHQ?size=40
BemiHQ / BemiDB

Single-binary Postgres read replica optimized for analytics

analyticsdata-warehouseduckdbicebergolapparquetPostgreSQLreplication
Go 1.39 k
1 个月前
https://static.github-zh.com/github_avatars/paradigmxyz?size=40
paradigmxyz / cryo

cryo is the easiest way to extract blockchain data to parquet, csv, json, or python dataframes

crypto以太坊evmparquetRust
Rust 1.39 k
5 个月前
https://static.github-zh.com/github_avatars/quiltdata?size=40
quiltdata / quilt

Quilt is a data mesh for connecting people with actionable data

datadata-engineeringdata-version-controldata-versioningPythonserializationparquet
TypeScript 1.34 k
4 天前
https://static.github-zh.com/github_avatars/tonbo-io?size=40
tonbo-io / tonbo

A portable embedded database using Arrow.

arrowbig-dataRustembedded-database数据库htaplsm-treeoffline-firstparquet
Rust 1.06 k
4 天前
https://static.github-zh.com/github_avatars/bigdatagenomics?size=40
bigdatagenomics / adam

ADAM is a genomics analysis platform with specialized file formats built using Apache Avro, Apache Spark, and Apache Parquet. Apache 2 licensed.

Apache Sparkbig-dataBioinformaticsgenomicsparquetavroScalaJavaPythonR
Scala 1.03 k
1 个月前
https://static.github-zh.com/github_avatars/julien040?size=40
julien040 / anyquery

#大语言模型#Query anything (GitHub, Notion, +40 more) with SQL and let LLMs (ChatGPT, Claude) connect to using MCP

APIbusiness-intelligenceCSV数据可视化数据库JSONMySQLparquetSQLSQLiteanalyticsGoGitHubNotionsalesforce人工智能ChatGPTclaude大语言模型mcp
Go 928
5 天前
https://static.github-zh.com/github_avatars/mukunku?size=40
mukunku / ParquetViewer

Simple Windows desktop application for viewing & querying Apache Parquet files

parquetwindows-desktop.NETbig-dataapache-parquet
C# 903
9 天前
https://static.github-zh.com/github_avatars/datazip-inc?size=40
datazip-inc / olake

Fastest open-source tool for replicating Databases to Data Lake in Open Table Formats like Apache Iceberg. ⚡ Efficient, quick and scalable data ingestion for real-time analytics. Supporting Postgres,...

cdcchange-data-capturedata-pipeline数据库eltlakehousereplicationapache-icebergparquets3
Go 889
3 天前
loading...