GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

data-prep

Website
Wikipedia
https://static.github-zh.com/github_avatars/NVIDIA-NeMo?size=40
NVIDIA-NeMo / Curator

#大语言模型#Scalable data pre processing and curation toolkit for LLMs

data-curation大语言模型datadata-prepdata-preparationdata-processingdata-qualitydatacurationdatarecipesEntity resolutionfine-tuninglarge-language-modelslarge-scale-data-processingllmappsPython
Python 1.05 k
8 小时前
https://static.github-zh.com/github_avatars/data-prep-kit?size=40
data-prep-kit / data-prep-kit

#大语言模型#Open source project for data preparation for GenAI applications

data-preparationfinetuning大语言模型llmappsdatadata-prepdata-preprocessingdata-preprocessing-pipelinesdatacurationlarge-language-modelslarge-scale-data-processingPythonrayApache SparkdatarecipesCode qualityEntity resolutionMalware
HTML 754
9 小时前
https://static.github-zh.com/github_avatars/data-integrations?size=40
data-integrations / wrangler

Wrangler Transform: A DMD system for transforming Big Data

wrangledata-transformationdata-transform数据科学transform-datamanipulate-datacdapbig-datacdap-plugintransformProjectpreparationdata-cleansingdata-prepParsingavro
Java 105
8 天前
https://static.github-zh.com/github_avatars/Kukuster?size=40
Kukuster / SumStatsRehab

GWAS summary statistics files QC tool

summary-statisticsdata-preparationdata-preprocessingdata-prepBioinformaticscomputational-biology
Python 40
7 个月前
https://static.github-zh.com/github_avatars/sminerport?size=40
sminerport / SequencePredictionANN

#计算机科学#Predict next number in a sequence using a simple ANN. Modularized code with classes for data preparation, neural network architecture, and training.

artificial-neural-networksdata-prep深度学习机器学习model-training神经网络NumPyPythonscikit-learnsupervised-learningtime-series-forecasting
Python 8
7 个月前
https://static.github-zh.com/github_avatars/DSE-capstone-sharknado?size=40
DSE-capstone-sharknado / AdvancedBPR

#计算机科学#Amazon Recommendation System build on BPR TensorFlow implementation

Jupyter Notebookexploratory-analysisdata-prep机器学习recommender-system数据科学
Jupyter Notebook 7
8 年前
https://static.github-zh.com/github_avatars/data-integrations?size=40
data-integrations / example-directive

A example for writing custom directives

dataprepwranglerdata-prepcdapexample-code
Java 4
2 年前
https://static.github-zh.com/github_avatars/SapanaKolambe?size=40
SapanaKolambe / Data-Science-with-Python

This Data Science with Python repository gives you an overview of Python’s data analytics tools and techniques. you can learn Python for data science along with concepts like data preprocessing, panda...

数据分析data-prep数据可视化google-colabpandas
Jupyter Notebook 1
3 年前
https://static.github-zh.com/github_avatars/JyotiVGupta?size=40
JyotiVGupta / Preppn-Challenge-2023-Week-4

Solving Tableau Prep challenge 2023 Week 4 using SQL/Snowflake

data-prepSQLpivot-tables
0
1 年前
https://static.github-zh.com/github_avatars/data-integrations?size=40
data-integrations / image-directives

A set of directives for working with images

cdapdata-prep
Java 0
6 个月前
https://static.github-zh.com/github_avatars/AMPATH-Capstone?size=40
AMPATH-Capstone / DataPrep

This repository contains the original data and code to prepare it for analysis

data-prep
HTML 0
5 年前
https://static.github-zh.com/github_avatars/datacorner?size=40
datacorner / dataprep-handbook

#计算机科学#Time to get your data sorted! The Data Preparation Handbook, published by Manning within the MEAP release, is the go-to guide for handling messy data. All the book's code and resources can be found he...

datadata-prepdata-preparationgenerative-ai机器学习
HTML 0
2 个月前
https://static.github-zh.com/github_avatars/sandy-sp?size=40
sandy-sp / gittxt

#自然语言处理#Gittxt: Get text from Git repositories in AI-ready formats. Extract docs, code, and assets from Git repositories for LLMs, AI datasets, and NLP pipelines.

人工智能cli-toolGit大语言模型自然语言处理text-extractiondata-prep机器学习
Python 0
4 个月前
https://static.github-zh.com/github_avatars/enso-org?size=40
enso-org / sample-projects

Open source Enso Analytics examples and documentation explicitly permitted for AI training and educational use.

data-prep
0
2 个月前