GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

preprocessing

Website
Wikipedia
https://static.github-zh.com/github_avatars/Unstructured-IO?size=40
Unstructured-IO / unstructured

#自然语言处理#Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website to...

深度学习document-parsing机器学习自然语言处理OCRinformation-retrievaldata-pipelinespreprocessingpdf-to-textpdfpdf-to-jsondocument-image-analysisdonutdocument-image-processingdocument-parserdocxlangchain大语言模型
HTML 11.49 k
2 天前
https://static.github-zh.com/github_avatars/dongrixinyu?size=40
dongrixinyu / JioNLP

#自然语言处理#中文 NLP 预处理、解析工具包,准确、高效、易用 A Chinese NLP Preprocessing & Parsing Package www.jionlp.com

自然语言处理Pythonapache2ner中文time-parsenlp-parsepreprocessingtime-parsing
Python 3.64 k
1 个月前
nidhaloff/igel
https://static.github-zh.com/github_avatars/nidhaloff?size=40
nidhaloff / igel

#计算机科学#a delightful machine learning tool that allows you to train, test, and use models without writing code

机器学习人工智能神经网络neural-networksscikit-learnscikitlearn-machine-learning数据科学数据分析preprocessing自动化automlHacktoberfesthacktoberfest2021
Python 3.12 k
2 年前
https://static.github-zh.com/github_avatars/OpenGene?size=40
OpenGene / fastp

An ultra-fast all-in-one FASTQ preprocessor (QC/adapters/trimming/filtering/splitting/merging...)

fastqqcpreprocessingfilteringadapteroverlapCode qualitytrimmingsplittingquality-controlfilterngsBioinformaticsumisequencing
C++ 2.1 k
9 天前
AxeldeRomblay/MLBox
https://static.github-zh.com/github_avatars/AxeldeRomblay?size=40
AxeldeRomblay / MLBox

#计算机科学#MLBox is a powerful Automated Machine Learning python library.

机器学习auto-mlkaggle深度学习stackingpipelineoptimizationpreprocessingencodingpredictiondistributedxgboostdriftclassificationregressionlightgbmKerasautomated-machine-learningautoml数据科学
Python 1.52 k
2 年前
winedarksea/AutoTS
https://static.github-zh.com/github_avatars/winedarksea?size=40
winedarksea / AutoTS

#计算机科学#Automated Time Series Forecasting

time-series机器学习automlforecasting深度学习preprocessingfeature-engineering
Python 1.29 k
2 个月前
sunlabuiuc/PyHealth
https://static.github-zh.com/github_avatars/sunlabuiuc?size=40
sunlabuiuc / PyHealth

#计算机科学#A Deep Learning Python Toolkit for Healthcare Applications.

healthcaredata-mining深度学习preprocessingclinical-dataclinical-researchelectronic-medical-recordelectronic-health-record
Python 1.17 k
8 天前
https://static.github-zh.com/github_avatars/NVIDIA-Merlin?size=40
NVIDIA-Merlin / NVTabular

#计算机科学#NVTabular is a feature engineering and preprocessing library for tabular data designed to quickly and easily manipulate terabyte scale datasets used to train deep learning based recommender systems.

深度学习feature-engineeringfeature-selectiongpu机器学习Nvidiapreprocessingrecommendation-systemrecommender-system
Python 1.09 k
9 个月前
KinWaiCheuk/nnAudio
https://static.github-zh.com/github_avatars/KinWaiCheuk?size=40
KinWaiCheuk / nnAudio

Audio processing by using pytorch 1D convolution network

PyTorchaudio-processingpreprocessingspectrogram神经网络
Python 1.07 k
1 个月前
https://static.github-zh.com/github_avatars/TheAlgorithms?size=40
TheAlgorithms / R

#学习与技能提升#Collection of various algorithms implemented in R.

算法R教学机器学习practicelearningpreprocessingregressiondata-miningclusteringclassificationHacktoberfest
R 964
2 个月前
https://static.github-zh.com/github_avatars/MinishLab?size=40
MinishLab / semhash

#数据仓库#Fast Semantic Text Deduplication & Filtering

数据集Entity resolutionpreprocessing
Python 727
19 天前
https://static.github-zh.com/github_avatars/pytorch?size=40
pytorch / torcharrow

High performance model preprocessing library on PyTorch

PythonpreprocessingPyTorch
Python 650
1 年前
https://static.github-zh.com/github_avatars/qd-cae?size=40
qd-cae / awesome-CAE

A curated list of awesome CAE frameworks, libraries and software.

caeLibrarycollectionpreprocessingscripting工具有限元法 (FEM)Computational Fluid Dynamics (CFD)
416
10 个月前
https://static.github-zh.com/github_avatars/R1j1t?size=40
R1j1t / contextualSpellCheck

#自然语言处理#✔️Contextual word checker for better suggestions (not actively maintained)

spaCyspacy-extension自然语言处理spellcheckpreprocessingberthelp-wanted聊天机器人spellcheckerspelling-correctionPython
Python 414
4 个月前
https://static.github-zh.com/github_avatars/msamogh?size=40
msamogh / nonechucks

#计算机科学#Deal with bad samples in your dataset dynamically, use Transforms as Filters, and more!

PyTorchdata-processingdata-preprocessingdata-pipelinedata-cleaningpreprocessing机器学习torch
Python 377
3 年前
https://static.github-zh.com/github_avatars/MaxHalford?size=40
MaxHalford / xam

#计算机科学#🎯 Personal data science and machine learning toolbox

Python机器学习数据科学preprocessingstacking
Python 365
5 年前
https://static.github-zh.com/github_avatars/DataCanvasIO?size=40
DataCanvasIO / HyperGBM

A full pipeline AutoML tool for tabular data

automlgbmxgboostlightgbmcatboostsemi-supervised-learningdatacleaningpreprocessingensemble-learningtabular-datadistributed-trainingdaskgpu-accelerationrapidsaiscikit-learn
Python 350
2 个月前
https://static.github-zh.com/github_avatars/ikegami-yukino?size=40
ikegami-yukino / jaconv

Pure-Python Japanese character interconverter for Hiragana, Katakana, Hankaku, and Zenkaku

japanese-languagepreprocessingtext-processing
Python 331
5 个月前
https://static.github-zh.com/github_avatars/advaitsave?size=40
advaitsave / Introduction-to-Time-Series-forecasting-Python

#时序数据库#Introduction to time series preprocessing and forecasting in Python using AR, MA, ARMA, ARIMA, SARIMA and Prophet model with forecast evaluation.

time-seriesforecastingPythonarimaarmatime-series-forecastingpreprocessingseasonality
Jupyter Notebook 324
7 年前
https://static.github-zh.com/github_avatars/cylondata?size=40
cylondata / cylon

#计算机科学#Cylon is a fast, scalable, distributed memory, parallel runtime with a Pandas like DataFrame.

datajoinshuffledataframempi深度学习preprocessing
C++ 301
1 年前
loading...