GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

preprocessing

Website
Wikipedia
https://static.github-zh.com/github_avatars/Unstructured-IO?size=40
Unstructured-IO / unstructured

#自然语言处理#Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website to...

深度学习document-parsing机器学习自然语言处理OCRinformation-retrievaldata-pipelinespreprocessingpdf-to-textpdfpdf-to-jsondocument-image-analysisdonutdocument-image-processingdocument-parserdocxlangchain大语言模型
HTML 12.14 k
3 天前
https://static.github-zh.com/github_avatars/dongrixinyu?size=40
dongrixinyu / JioNLP

#自然语言处理#中文 NLP 预处理、解析工具包,准确、高效、易用 A Chinese NLP Preprocessing & Parsing Package www.jionlp.com

自然语言处理Pythonapache2ner中文time-parsenlp-parsepreprocessingtime-parsing
Python 3.67 k
12 天前
nidhaloff/igel
https://static.github-zh.com/github_avatars/nidhaloff?size=40
nidhaloff / igel

#计算机科学#a delightful machine learning tool that allows you to train, test, and use models without writing code

机器学习人工智能神经网络neural-networksscikit-learnscikitlearn-machine-learning数据科学数据分析preprocessing自动化automlHacktoberfesthacktoberfest2021
Python 3.12 k
2 年前
https://static.github-zh.com/github_avatars/OpenGene?size=40
OpenGene / fastp

An ultra-fast all-in-one FASTQ preprocessor (QC/adapters/trimming/filtering/splitting/merging...)

fastqqcpreprocessingfilteringadapteroverlapCode qualitytrimmingsplittingquality-controlfilterngsBioinformaticsumisequencing
C++ 2.14 k
25 天前
AxeldeRomblay/MLBox
https://static.github-zh.com/github_avatars/AxeldeRomblay?size=40
AxeldeRomblay / MLBox

#计算机科学#MLBox is a powerful Automated Machine Learning python library.

机器学习auto-mlkaggle深度学习stackingpipelineoptimizationpreprocessingencodingpredictiondistributedxgboostdriftclassificationregressionlightgbmKerasautomated-machine-learningautoml数据科学
Python 1.52 k
2 年前
winedarksea/AutoTS
https://static.github-zh.com/github_avatars/winedarksea?size=40
winedarksea / AutoTS

#计算机科学#Automated Time Series Forecasting

time-series机器学习automlforecasting深度学习preprocessingfeature-engineering
Python 1.31 k
20 天前
sunlabuiuc/PyHealth
https://static.github-zh.com/github_avatars/sunlabuiuc?size=40
sunlabuiuc / PyHealth

#计算机科学#A Deep Learning Python Toolkit for Healthcare Applications.

healthcaredata-mining深度学习preprocessingclinical-dataclinical-researchelectronic-medical-recordelectronic-health-record
Python 1.2 k
4 天前
https://static.github-zh.com/github_avatars/NVIDIA-Merlin?size=40
NVIDIA-Merlin / NVTabular

#计算机科学#NVTabular is a feature engineering and preprocessing library for tabular data designed to quickly and easily manipulate terabyte scale datasets used to train deep learning based recommender systems.

深度学习feature-engineeringfeature-selectiongpu机器学习Nvidiapreprocessingrecommendation-systemrecommender-system
Python 1.1 k
1 年前
KinWaiCheuk/nnAudio
https://static.github-zh.com/github_avatars/KinWaiCheuk?size=40
KinWaiCheuk / nnAudio

Audio processing by using pytorch 1D convolution network

PyTorchaudio-processingpreprocessingspectrogram神经网络
Python 1.08 k
2 个月前
https://static.github-zh.com/github_avatars/TheAlgorithms?size=40
TheAlgorithms / R

#学习与技能提升#Collection of various algorithms implemented in R.

算法R教学机器学习practicelearningpreprocessingregressiondata-miningclusteringclassificationHacktoberfest
R 987
3 个月前
https://static.github-zh.com/github_avatars/MinishLab?size=40
MinishLab / semhash

#数据仓库#Fast Semantic Text Deduplication & Filtering

数据集Entity resolutionpreprocessing
Python 773
2 个月前
https://static.github-zh.com/github_avatars/pytorch?size=40
pytorch / torcharrow

High performance model preprocessing library on PyTorch

PythonpreprocessingPyTorch
Python 649
1 年前
https://static.github-zh.com/github_avatars/qd-cae?size=40
qd-cae / awesome-CAE

A curated list of awesome CAE frameworks, libraries and software.

caeLibrarycollectionpreprocessingscripting工具有限元法 (FEM)Computational Fluid Dynamics (CFD)
427
1 年前
https://static.github-zh.com/github_avatars/R1j1t?size=40
R1j1t / contextualSpellCheck

#自然语言处理#✔️Contextual word checker for better suggestions (not actively maintained)

spaCyspacy-extension自然语言处理spellcheckpreprocessingberthelp-wanted聊天机器人spellcheckerspelling-correctionPython
Python 417
6 个月前
https://static.github-zh.com/github_avatars/msamogh?size=40
msamogh / nonechucks

#计算机科学#Deal with bad samples in your dataset dynamically, use Transforms as Filters, and more!

PyTorchdata-processingdata-preprocessingdata-pipelinedata-cleaningpreprocessing机器学习torch
Python 377
3 年前
https://static.github-zh.com/github_avatars/MaxHalford?size=40
MaxHalford / xam

#计算机科学#🎯 Personal data science and machine learning toolbox

Python机器学习数据科学preprocessingstacking
Python 365
5 年前
https://static.github-zh.com/github_avatars/DataCanvasIO?size=40
DataCanvasIO / HyperGBM

A full pipeline AutoML tool for tabular data

automlgbmxgboostlightgbmcatboostsemi-supervised-learningdatacleaningpreprocessingensemble-learningtabular-datadistributed-trainingdaskgpu-accelerationrapidsaiscikit-learn
Python 353
3 个月前
https://static.github-zh.com/github_avatars/ikegami-yukino?size=40
ikegami-yukino / jaconv

Pure-Python Japanese character interconverter for Hiragana, Katakana, Hankaku, and Zenkaku

japanese-languagepreprocessingtext-processing
Python 331
7 个月前
https://static.github-zh.com/github_avatars/advaitsave?size=40
advaitsave / Introduction-to-Time-Series-forecasting-Python

#时序数据库#Introduction to time series preprocessing and forecasting in Python using AR, MA, ARMA, ARIMA, SARIMA and Prophet model with forecast evaluation.

time-seriesforecastingPythonarimaarmatime-series-forecastingpreprocessingseasonality
Jupyter Notebook 327
7 年前
https://static.github-zh.com/github_avatars/Razor12911?size=40
Razor12911 / xtool

#安全#Just some tool repackers like to use...

archivingcompressionpreprocessingrepackingencryption
Pascal 311
2 年前
loading...