STUMPY is a powerful and scalable Python library for modern time series analysis
Koalas: pandas API on Apache Spark
Extract data from a wide range of Internet sources into a pandas DataFrame.
A distributed task scheduler for Dask
Clean APIs for data cleaning. Python implementation of R package Janitor
A clean, three-column Sphinx theme with Bootstrap for the PyData community
PyData, The Complete Works of
#计算机科学#High-Performance Python Compute Engine for Data and AI
RFC document, tooling and other content related to the array API standard
#自然语言处理#Notebooks for the Seattle PyData 2017 talk on Scattertext
Social network analysis code examples for PyCon 2019 talk
Python library for GraphBLAS: high-performance sparse linear algebra for scalable graph analytics
#计算机科学#Machine learning with scikit-learn tutorial at PyData Chicago 2016
#计算机科学#Introduction to Machine Learning with Time Series at PyData Festival Amsterdam 2020