GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

corpus

Website
Wikipedia
https://static.github-zh.com/github_avatars/brightmart?size=40
brightmart / nlp_chinese_corpus

#自然语言处理#大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP

chinese-datasetchinese-corpuspretrainword2vec自然语言处理bertlanguage-modelWikinewsquestion-answering中文corpuschinese-nlpdatasettext-classification
9.73 k
1 年前
https://static.github-zh.com/github_avatars/dariusk?size=40
dariusk / corpora

A collection of small corpuses of interesting data for the creation of bots and similar stuff.

languagewordsBotcorpus
JavaScript 5.01 k
1 年前
https://static.github-zh.com/github_avatars/CLUEbenchmark?size=40
CLUEbenchmark / CLUEDatasetSearch

#自然语言处理#搜索所有中文NLP数据集,附常用英文NLP数据集

自然语言处理数据集中文nerqamatchtext-classificationmachine-translationknowledge-graphcorpussentiment-analysistext-similarity
Python 4.33 k
3 年前
https://static.github-zh.com/github_avatars/CLUEbenchmark?size=40
CLUEbenchmark / CLUE

中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard

nlubenchmark中文corpusdatasetbertalbertchinesegluegluerobertalanguage-modelpretrained-modelstransformersTensorflowPyTorch
Python 4.14 k
1 年前
https://static.github-zh.com/github_avatars/wainshine?size=40
wainshine / Chinese-Names-Corpus

中文人名语料库。人名生成器。中文姓名,姓氏,名字,称呼,日本人名,翻译人名,英文人名。可用于中文分词、人名实体识别。

namescorpusdictnerdataset
4.14 k
1 年前
https://static.github-zh.com/github_avatars/endymecy?size=40
endymecy / awesome-deeplearning-resources

#自然语言处理#Deep Learning and deep reinforcement learning research papers and some codes

深度学习reinforcement-learning神经网络BukkitCodeVideocorpus自然语言处理
2.92 k
1 年前
https://static.github-zh.com/github_avatars/lucasjinreal?size=40
lucasjinreal / weibo_terminater

#网络爬虫#Final Weibo Crawler Scrap Anything From Weibo, comments, weibo contents, followers, anything. The Terminator

scraperweibosinacorpus中文聊天机器人
Python 2.32 k
6 年前
https://static.github-zh.com/github_avatars/fendouai?size=40
fendouai / Awesome-Chatbot

Awesome Chatbot Projects,Corpus,Papers,Tutorials.Chinese Chatbot =>:

聊天机器人TensorflowAwesome Listsseq2seqseq2seq-chatbotcorpus教程
Python 2.1 k
1 年前
https://static.github-zh.com/github_avatars/candlewill?size=40
candlewill / Dialog_Corpus

用于训练中英文对话系统的语料库 Datasets for Training Chatbot System

datasetdialogsystemcorpus聊天机器人
Python 2.05 k
5 年前
https://static.github-zh.com/github_avatars/gunthercox?size=40
gunthercox / chatterbot-corpus

A multilingual dialog corpus

chatterbotcorpusdialoglanguageYAML
Python 1.4 k
1 个月前
https://static.github-zh.com/github_avatars/NiuTrans?size=40
NiuTrans / Classical-Modern

非常全的文言文(古文)-现代文平行语料

corpustraditional-chinese
Python 1.34 k
1 年前
https://static.github-zh.com/github_avatars/wainshine?size=40
wainshine / Company-Names-Corpus

公司名语料库。机构名语料库。公司简称,缩写,品牌词,企业名。可用于中文分词、机构名实体识别。

companycorpusdictnerdataset
1.27 k
1 年前
https://static.github-zh.com/github_avatars/chatopera?size=40
chatopera / insuranceqa-corpus-zh

#自然语言处理#🚁 保险行业语料库,聊天机器人

corpus聊天机器人自然语言处理natural-language-understanding机器学习datasetquestion-answeringinsurance
Python 1.03 k
20 天前
https://static.github-zh.com/github_avatars/CLUEbenchmark?size=40
CLUEbenchmark / CLUECorpus2020

#自然语言处理#Large-scale Pre-training Corpus for Chinese 100G 中文预训练语料

中文chinese-corpus数据集pretraincorpus自然语言处理bertrobertaalbert
966
3 年前
https://static.github-zh.com/github_avatars/PlexPt?size=40
PlexPt / chatgpt-corpus

ChatGPT 中文语料库 对话语料 小说语料 客服语料 用于训练大模型

corpusAwesome Listscorpus-dataquestion-answering
904
1 年前
https://static.github-zh.com/github_avatars/OYE93?size=40
OYE93 / Chinese-NLP-Corpus

#数据仓库#Collections of Chinese NLP corpus

chinese-nlp数据集corpus
Python 902
4 年前
https://static.github-zh.com/github_avatars/quanteda?size=40
quanteda / quanteda

#自然语言处理#An R package for the Quantitative Analysis of Textual Data

R自然语言处理corpus
R 859
25 天前
https://static.github-zh.com/github_avatars/tensorlayer?size=40
tensorlayer / seq2seq-chatbot

#自然语言处理#Chatbot in 200 lines of code using TensorLayer

tensorlayerTensorflow聊天机器人rnnlstmBot自然语言处理chatcorpusPython
Python 839
4 年前
https://static.github-zh.com/github_avatars/soskek?size=40
soskek / bookcorpus

#网络爬虫#Crawl BookCorpus

corpus爬虫scraper自然语言处理bookcorpus
Python 834
2 年前
https://static.github-zh.com/github_avatars/CLUEbenchmark?size=40
CLUEbenchmark / CLUEPretrainedModels

高质量中文预训练模型集合:最先进大模型、最快小模型、相似度专门模型

pretrained-models中文bertrobertaalbertsentence-classificationsemantic-similaritytext-classificationdistillationcorpusdataset
Python 818
5 年前
loading...