GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

training-data

Website
Wikipedia
https://static.github-zh.com/github_avatars/snorkel-team?size=40
snorkel-team / snorkel

#计算机科学#A system for quickly generating training data with weak supervision

机器学习人工智能weak-supervisionlabeling数据科学Pythonsnorkeltraining-datadata-augmentationdata-slicing
Python 5.87 k
1 年前
diffgram/diffgram
https://static.github-zh.com/github_avatars/diffgram?size=40
diffgram / diffgram

#数据仓库#The AI Datastore for Schemas, BLOBs, and Predictions. Use with your apps or integrate built-in Human Supervision, Data Workflow, and UI Catalog to get the most value out of your AI Data.

annotationannotation-tooltraining-datavideo-annotationdata-annotationKubernetes数据科学data-analyticsimage-annotation机器学习深度学习dataannotations数据集labelingdatastore
Python 1.87 k
7 个月前
ydataai/ydata-synthetic
https://static.github-zh.com/github_avatars/ydataai?size=40
ydataai / ydata-synthetic

#计算机科学#Synthetic data generators for tabular and time-series data

Generative Adversarial Network深度学习synthetic-datatensorflow2机器学习training-dataPythontimeseriesgansPyTorchtime-series
Jupyter Notebook 1.56 k
3 个月前
https://static.github-zh.com/github_avatars/NorskRegnesentral?size=40
NorskRegnesentral / skweak

#自然语言处理#skweak: A software toolkit for weak supervision applied to NLP tasks

weak-supervision自然语言处理distant-supervisionnlp-libraryspaCyPython数据科学training-data
Python 925
9 个月前
https://static.github-zh.com/github_avatars/OvidijusParsiunas?size=40
OvidijusParsiunas / myvision

#计算机科学#Computer vision based ML training data generation tool 🚀

机器学习机器视觉object-detectiontraining-dataannotationlabellingannotation-toolcocovggTensorflowyolomodelvisionimage-annotationlabeling-tooltaggingImage人工智能
JavaScript 600
4 个月前
https://static.github-zh.com/github_avatars/alteryx?size=40
alteryx / compose

#计算机科学#A machine learning tool for automated prediction engineering. It allows you to easily structure prediction problems and generate labels for supervised learning.

机器学习automl数据科学labeling-toollabeling人工智能training-datadata-labeling
Python 507
3 个月前
https://static.github-zh.com/github_avatars/a-maliarov?size=40
a-maliarov / amazoncaptcha

Pure Python, lightweight, Pillow-based solver for Amazon's text captcha.

captchacaptcha-solveramazonPythonpillowtraining-datadata-extraction
Python 481
6 天前
https://static.github-zh.com/github_avatars/sparkfish?size=40
sparkfish / augraphy

#计算机科学#Augmentation pipeline for rendering synthetic paper printing, faxing, scanning and copy machine processes

data-augmentation深度神经网络training-data机器学习data-pipeline图像处理synthetic-datasynthetic-dataset-generation机器视觉
Python 425
3 个月前
https://static.github-zh.com/github_avatars/Slava?size=40
Slava / label-tool

#计算机科学#Web application for image labeling and segmentation

image-labelingimage-labeling-tool机器视觉机器学习training-datasegmentationcomputer-vision-toolsimage-annotationboundingboxdata-labeling
JavaScript 352
3 年前
https://static.github-zh.com/github_avatars/d5555?size=40
d5555 / TagEditor

#自然语言处理#🏖TagEditor - Annotation tool for spaCy

annotation-toolspaCycoreference-resolutiontext-annotationlabeling-tool自然语言处理annotation机器学习数据科学neural-networkstraining-datanamed-entity-recognition
192
3 年前
https://static.github-zh.com/github_avatars/Geocene?size=40
Geocene / trainset

#计算机科学#A lightweight web application for brushing labels onto time series data; useful for building training sets.

labeling-tool机器学习training-datalabelingpaintingtime-series-classification
JavaScript 174
2 年前
https://static.github-zh.com/github_avatars/KennethEnevoldsen?size=40
KennethEnevoldsen / augmenty

#自然语言处理#Augmenty is an augmentation library based on spaCy for augmenting texts.

augmentationspacy-extensionspaCy自然语言处理nlprocPythontext-classificationtraining-data
Python 155
1 年前
https://static.github-zh.com/github_avatars/avinashsen707?size=40
avinashsen707 / AUBOi5-D435-ROS-DOPE

#计算机科学#Aubo i5 Dual Arm Collaborative Robot - RealSense D435 - 3D Object Pose Estimation - ROS

pose-estimationobject-detectiondataset深度学习Ubunturosblendertraining-data
C++ 119
3 年前
https://static.github-zh.com/github_avatars/tzano?size=40
tzano / fountain

Natural Language Data Augmentation Tool for Conversational Systems

nludata-generator聊天机器人training-datanatural-languageconversational-ai
Python 115
2 年前
https://static.github-zh.com/github_avatars/enginBozkurt?size=40
enginBozkurt / carla-training-data

#计算机科学#Generating training data from the Carla driving simulator in the KITTI dataset format

carla-simulatortraining-data深度学习人工智能kitti-datasetautonomous-drivingself-driving-carautonomous-vehicles
Python 109
6 年前
https://static.github-zh.com/github_avatars/rahul051296?size=40
rahul051296 / small-talk-rasa-stack

Collection of casual conversations that can be used with the Rasa Stack

smalltalktraining-dataconversational-aidialogflow
Python 85
5 年前
https://static.github-zh.com/github_avatars/google-research-datasets?size=40
google-research-datasets / swim-ir

#自然语言处理#SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 languages, generated using PaLM 2 and summarize-then-ask promptin...

cross-lingual数据集深度学习information-retrieval机器学习multilingual自然语言处理training-data
48
2 年前
https://static.github-zh.com/github_avatars/ableinc?size=40
ableinc / git2txt

#大语言模型#Convert all files in git repository to .txt files. Useful for training LLMs on your codebase.

Git大语言模型机器学习Pythontraining-datatxt
Python 42
6 个月前
https://static.github-zh.com/github_avatars/hernanmd?size=40
hernanmd / COVID-19-train-audio

COVID-19 Coughs files for training AI models

COVID-19coronaviruswavelet-analysisaudio-analysistraining-data
Python 41
5 年前
https://static.github-zh.com/github_avatars/milangritta?size=40
milangritta / Pragmatic-Guide-to-Geoparsing-Evaluation

#计算机科学#Full resources supporting the publication "A Pragmatic Guide to Geoparsing Evaluation."

datageocodinggeocoderevaluationtaxonomylocationplacesgeographyanalysis机器学习training-datanamed-entity-recognitionGoogle 云
Python 40
6 年前
loading...