STUMPY is a powerful and scalable Python library for modern time series analysis
#计算机科学#Mars is a tensor-based unified framework for large-scale data computation which scales numpy, pandas, scikit-learn and Python functions.
A package which efficiently applies any function to a pandas dataframe or series in the fastest available manner
#计算机科学#A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rewrites.
A distributed task scheduler for Dask
#计算机科学#🚚 Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark
Eliot: the logging system that tells you *why* it happened
Python package for earth-observing satellite data processing
#计算机科学#Scalable machine 🤖 learning for time series forecasting.
Pandas, Polars, Spark, and Snowpark DataFrame comparison for humans and more!
Fast data store for Pandas time-series data
Engine for ML/Data tracking, visualization, explainability, drift detection, and dashboards for Polyaxon.
Distributed SQL Engine in Python using Dask
Geospatial image resampling in Python
Library of derived climate variables, ie climate indicators, based on xarray.
A full pipeline AutoML tool for tabular data