GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

data-preprocessing-pipelines

Website
Wikipedia
https://static.github-zh.com/github_avatars/data-prep-kit?size=40
data-prep-kit / data-prep-kit

#大语言模型#Open source project for data preparation for GenAI applications

data-preparationfinetuning大语言模型llmappsdatadata-prepdata-preprocessingdata-preprocessing-pipelinesdatacurationlarge-language-modelslarge-scale-data-processingPythonrayApache SparkdatarecipesCode qualityEntity resolutionMalware
HTML 698
4 天前
https://static.github-zh.com/github_avatars/preprocessy?size=40
preprocessy / preprocessy

#计算机科学#Python package for Customizable Data Preprocessing Pipelines

pipelinespreprocessing机器学习python-librarydata-engineering数据科学data-preprocessing-pipelinesHacktoberfesthacktoberfest2022
Jupyter Notebook 42
2 个月前
https://static.github-zh.com/github_avatars/shamspias?size=40
shamspias / gpt3-data-preprocessing

#计算机科学#This repository containing code for preprocessing text data from PDF and DOCX files for use with GPT-3. It includes steps such as tokenization, removal of stop words and punctuation, and formatting fo...

人工智能data-preprocessingdata-preprocessing-pipelines数据科学gpt-3机器学习
Python 6
2 年前
https://static.github-zh.com/github_avatars/firefly-cpp?size=40
firefly-cpp / succulent

#计算机科学#Collect POST requests

data-collectiondata-preprocessing-pipelines数据科学ESP32机器学习树莓派
Python 3
1 个月前
https://static.github-zh.com/github_avatars/vuanhngo14?size=40
vuanhngo14 / Decision-Tree-from-Scratch

Understand and Implement decision tree

data-preprocessingdata-preprocessing-pipelines数据可视化decision-tree
Jupyter Notebook 1
1 年前
https://static.github-zh.com/github_avatars/kolhesamiksha?size=40
kolhesamiksha / Nemo_Curator

This repository contains a sample text data-preparation code using Nemo Curator for pre-training or synthetic data generation

curatordata-preprocessing-pipelinesgenerative-ainemoNvidiasynthetic-dataset-generation
Jupyter Notebook 1
6 个月前
https://static.github-zh.com/github_avatars/PrasunDatta?size=40
PrasunDatta / adorsho-praniSheba_Preprocessing-Pipeline-of-Muzzle-Data-of-Cow

This work highlights my contribution as a "ML Engineer" at "adorsho praniSheb"(an ML based agro farming company of Bangladesh) where I was assigned the task of designing the preprocessing pipeline.

data-preprocessing-pipelinesimage-preprocessingJupyter Notebookpython-script
Jupyter Notebook 0
3 年前
https://static.github-zh.com/github_avatars/SaraLittleSquirrel?size=40
SaraLittleSquirrel / Obesity-estimator

#计算机科学#Project for Machine Learning Data Mining course

adaboostdata-miningdata-preprocessing-pipelinesdecision-tree机器学习NumPypandasrandom-forestscikit-learnsupport-vector-machines
Jupyter Notebook 0
2 年前
https://static.github-zh.com/github_avatars/DigitalLifeYZQiu?size=40
DigitalLifeYZQiu / Data-Process-Library

The data process library to help better industrial data understanding.

data-preprocessing-pipelines
Jupyter Notebook 0
1 个月前
https://static.github-zh.com/github_avatars/MustofAhmed41?size=40
MustofAhmed41 / Data-Preprocessing-using-Distributed-Database

#计算机科学#Machine learning models cannot be directly applied to raw data. This desktop application consists of a central server and two client servers. The main servers send raw data to clients, where the data ...

数据库机器学习plsqldata-preprocessing-pipelinesdistributed-database
0
3 年前