#自然语言处理#Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website to...
#自然语言处理#中文 NLP 预处理、解析工具包,准确、高效、易用 A Chinese NLP Preprocessing & Parsing Package www.jionlp.com
#计算机科学#a delightful machine learning tool that allows you to train, test, and use models without writing code
An ultra-fast all-in-one FASTQ preprocessor (QC/adapters/trimming/filtering/splitting/merging...)
#计算机科学#MLBox is a powerful Automated Machine Learning python library.
#计算机科学#Automated Time Series Forecasting
#计算机科学#A Deep Learning Python Toolkit for Healthcare Applications.
#计算机科学#NVTabular is a feature engineering and preprocessing library for tabular data designed to quickly and easily manipulate terabyte scale datasets used to train deep learning based recommender systems.
Audio processing by using pytorch 1D convolution network
#学习与技能提升#Collection of various algorithms implemented in R.
#数据仓库#Fast Semantic Text Deduplication & Filtering
High performance model preprocessing library on PyTorch
A curated list of awesome CAE frameworks, libraries and software.
#自然语言处理#✔️Contextual word checker for better suggestions (not actively maintained)
#计算机科学#Deal with bad samples in your dataset dynamically, use Transforms as Filters, and more!
#计算机科学#🎯 Personal data science and machine learning toolbox
A full pipeline AutoML tool for tabular data
Pure-Python Japanese character interconverter for Hiragana, Katakana, Hankaku, and Zenkaku
#时序数据库#Introduction to time series preprocessing and forecasting in Python using AR, MA, ARMA, ARIMA, SARIMA and Prophet model with forecast evaluation.
#计算机科学#Cylon is a fast, scalable, distributed memory, parallel runtime with a Pandas like DataFrame.