GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

data-preprocessing

Website
Wikipedia
https://static.github-zh.com/github_avatars/zzw922cn?size=40
zzw922cn / Automatic_Speech_Recognition

#计算机科学#End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow

automatic-speech-recognitionTensorflowtimit-datasetfeature-vectorphonemesdata-preprocessingrnnaudio深度学习lstmend-to-endcnnevaluationBukkitspeech-recognitionchinese-speech-recognition
Python 2.84 k
2 年前
https://static.github-zh.com/github_avatars/skrub-data?size=40
skrub-data / skrub

#计算机科学#Machine learning with dataframes

机器学习数据科学data-cleaningdatadata-preparationdata-preprocessing数据分析data-wranglingdataframedataframes
Python 1.41 k
3 天前
https://static.github-zh.com/github_avatars/data-prep-kit?size=40
data-prep-kit / data-prep-kit

#大语言模型#Open source project for data preparation for GenAI applications

data-preparationfinetuning大语言模型llmappsdatadata-prepdata-preprocessingdata-preprocessing-pipelinesdatacurationlarge-language-modelslarge-scale-data-processingPythonrayApache SparkdatarecipesCode qualityEntity resolutionMalware
HTML 698
4 天前
https://static.github-zh.com/github_avatars/Western-OC2-Lab?size=40
Western-OC2-Lab / AutoML-Implementation-for-Static-and-Dynamic-Data-Analytics

#计算机科学#Implementation/Tutorial of using Automated Machine Learning (AutoML) methods for static/batch and online/continual learning

automated-machine-learningautomlconcept-driftdata-preprocessingdata-stream-processingdata-streams深度学习feature-engineeringhyperparameter-tuningintrusion-detection-systemInternet of things机器学习model-selection
Jupyter Notebook 628
1 年前
https://static.github-zh.com/github_avatars/machinelearnjs?size=40
machinelearnjs / machinelearnjs

#计算机科学#Machine Learning library for the web and Node.

机器学习easy-to-useminimalisticWebNode.jsstatistical-learningrandom-forestsvmfeature-extractiondata-preprocessingprobabilistic-modelsstructured-data
TypeScript 541
9 天前
https://static.github-zh.com/github_avatars/akanz1?size=40
akanz1 / klib

Easy to use Python library of customized functions for cleaning and analyzing data.

数据科学数据分析数据可视化Pythonfeature-selectiondata-cleaningdata-preprocessing
Python 514
1 个月前
https://static.github-zh.com/github_avatars/Desbordante?size=40
Desbordante / desbordante-core

Desbordante is a high-performance data profiler that is capable of discovering many different patterns in data using various algorithms. It also allows to run data cleaning scenarios using these algor...

data-analyticsdata-cleaningdata-cleansingdata-engineeringdata-explorationdata-miningdata-profiling数据科学data-wranglingdata-preprocessingfeature-selectionfeature-engineeringfeature-extractionSpreadsheettabular-dataanomaly-detectionexploratory-data-analysisknowledge-discovery
C++ 405
11 天前
https://static.github-zh.com/github_avatars/shamspias?size=40
shamspias / customizable-gpt-chatbot

#自然语言处理#A dynamic, scalable AI chatbot built with Django REST framework, supporting custom training from PDFs, documents, websites, and YouTube videos. Leveraging OpenAI's GPT-3.5, Pinecone, FAISS, and Celery...

人工智能聊天机器人data-preprocessingDjangodjango-rest-frameworkgpt-3机器学习自然语言处理Pythonconversational-aivoice-chatvoice-recognitionlangchainlangchain-pythonautogpt
Python 391
1 年前
https://static.github-zh.com/github_avatars/msamogh?size=40
msamogh / nonechucks

#计算机科学#Deal with bad samples in your dataset dynamically, use Transforms as Filters, and more!

PyTorchdata-processingdata-preprocessingdata-pipelinedata-cleaningpreprocessing机器学习torch
Python 377
3 年前
https://static.github-zh.com/github_avatars/harunurrashid97?size=40
harunurrashid97 / 100-Days-Of-ML-Code

#计算机科学#A day to day plan for this challenge. Covers both theoritical and practical aspects

机器学习Pythonedadatascience教程siraj-raval-challengeinfographicsimplementationexploratory-data-analysis数据科学data-preprocessingdecision-treelinear-regressionarticle
Jupyter Notebook 227
2 年前
https://static.github-zh.com/github_avatars/TirendazAcademy?size=40
TirendazAcademy / PANDAS-TUTORIAL

#计算机科学#Jupyter Notebooks and Data Sets for Pandas Library

Pythonpandas-tutorialpandaspandas-dataframedata数据分析data-preprocessing数据科学机器学习
Jupyter Notebook 221
1 年前
https://static.github-zh.com/github_avatars/HasnainRaz?size=40
HasnainRaz / SemSegPipeline

#计算机科学#A simpler way of reading and augmenting image segmentation data into TensorFlow

Tensorflow深度学习data-augmentationsemantic-segmentationPythondata-preprocessingaugmentationpipelineimage-augmentationimage-preprocessing
Python 144
5 年前
https://static.github-zh.com/github_avatars/thepanacealab?size=40
thepanacealab / SMMT

Social Media Mining Toolkit (SMMT) main repository

annotationtwitter-apidata-annotationdata-preprocessingspaCygatheringtweets
Python 137
3 年前
https://static.github-zh.com/github_avatars/triton-inference-server?size=40
triton-inference-server / dali_backend

#计算机科学#The Triton backend that allows running GPU-accelerated data pre-processing pipelines implemented in DALI's python API.

深度学习gpudata-preprocessingPythonfast-data-pipeline图像处理
C++ 135
12 天前
https://static.github-zh.com/github_avatars/dansuh17?size=40
dansuh17 / segan-pytorch

SEGAN pytorch implementation https://arxiv.org/abs/1703.09452

PyTorchdata-preprocessingaudiospeech-enhancementsource-separation
Python 110
6 年前
https://static.github-zh.com/github_avatars/TensorMSA?size=40
TensorMSA / tensormsa

#计算机科学#Deep learning GUI frame work for enterprise

深度学习机器学习TensorflowDockergpumicroservices-architecturedata-preprocessingDocker Compose
Python 109
7 年前
https://static.github-zh.com/github_avatars/Mohan-Zhang-u?size=40
Mohan-Zhang-u / mzutils

#自然语言处理#

深度学习机器学习question-answeringTensorflowtensorflow2torchdata-preprocessing数据可视化自然语言处理reinforcement-learningtoolkitreadthedocs
Python 107
2 年前
https://static.github-zh.com/github_avatars/Mohan-Zhang-u?size=40
Mohan-Zhang-u / mzutils

#自然语言处理#

深度学习机器学习question-answeringTensorflowtensorflow2torchdata-preprocessing数据可视化自然语言处理reinforcement-learningtoolkitreadthedocs
Python 104
2 年前
https://static.github-zh.com/github_avatars/asavinov?size=40
asavinov / prosto

Prosto is a data processing toolkit radically changing how data is processed by heavily relying on functions and operations with functions - an alternative to map-reduce and join-groupby

workflowdata-processingmap-reduceApache SparkpandasPythonfeature-engineering数据科学data-wranglingdata-preprocessingdata-preparationbusiness-intelligenceolap
Python 91
4 年前
https://static.github-zh.com/github_avatars/wangxb96?size=40
wangxb96 / Awesome-EdgeAI

#Awesome#Resources of our survey paper "Optimizing Edge AI: A Comprehensive Survey on Data, Model, and System Strategies"

data-preprocessingedge-aiedge-computing机器学习model-compressionmodel-inferenceAwesome Lists深度学习model-deployment
87
5 个月前
loading...