一个非常快的 DataFrame 库,支持 Rust、Python、Node.js
A light-weight, flexible, and expressive statistical data testing library
The Universal Storage Engine
#数据仓库#In-memory tabular data in Julia
#计算机科学#Machine learning with dataframes
#计算机科学#DataFrames for Go: For statistics, machine-learning, and data manipulation/exploration
PySpark-Tutorial provides basic algorithms using PySpark
Series (one-dimensional) and dataframes (two-dimensional) for fast and elegant data exploration in Elixir
GraphFrames is a package for Apache Spark which provides DataFrame-based Graphs
Easy pipelines for pandas DataFrames.
#计算机科学#Python Client and Toolkit for DataFrames, Big Data, Machine Learning and ETL in Elasticsearch
Pandas, Polars, Spark, and Snowpark DataFrame comparison for humans and more!
Engine for ML/Data tracking, visualization, explainability, drift detection, and dashboards for Polyaxon.
#数据仓库#Metaprogramming tools for DataFrames
Immutable and statically-typeable DataFrames with runtime type and data validation
A package to easily open an instance of a Google spreadsheet and interact with worksheets through Pandas DataFrames.
An introductory workshop on pandas with notebooks and exercises for following along. Slides contain all solutions.
64bit multithreaded python data analytics tools for numpy arrays and datasets
#计算机科学#⛈️ RumbleDB 1.23.0 "Mountain Ash" 🌳 for Apache Spark | Run queries on your large-scale, messy JSON-like data (JSON, text, CSV, Parquet, ROOT, AVRO, SVM...) | No install required (just a jar to downl...
#算法刷题#O'Reilly Book: [Data Algorithms with Spark] by Mahmoud Parsian