#Awesome#A curated list of awesome big data frameworks, ressources and other awesomeness.
Miller is like awk, sed, cut, join, and sort for name-indexed data such as CSV, TSV, and tabular JSON
Fancy stream processing made operationally mundane
Real-time Data Integration and Transformation: use SQL to transform, deliver, and act on fast-changing data.
#计算机科学#🌊 Online machine learning in Python
Readyset is a MySQL and Postgres wire-compatible caching layer that sits in front of existing databases to speed up queries and horizontally scale read throughput. Under the hood, ReadySet caches the ...
🦀 event stream processing for developers to collect and transform data in motion to power responsive data intensive applications.
Utils for streaming large files (S3, HDFS, gzip, bz2...)
Open-source graph database, tuned for dynamic analytics environments. Easy to adopt, scale and own.
A lightweight stream processing library for Go
Pravega - Streaming as a new software defined storage primitive
#计算机科学#Python Stream Processing
#计算机科学#Python Streaming DataFrames for Kafka
Open-Source Web UI for managing Apache Kafka clusters
Trill is a single-node query processor for temporal or streaming data.
📐 Pushing the boundaries of simplicity
Superdiff provides a complete and readable diff for both arrays and objects. Plus, it supports stream and file inputs for handling large datasets efficiently, is battle-tested, has zero dependencies, ...