GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

text-data

Website
Wikipedia
https://static.github-zh.com/github_avatars/asyml?size=40
asyml / texar

#自然语言处理#Toolkit for Machine Learning, Natural Language Processing, and Text Generation, in TensorFlow. This is part of the CASL project: http://casl-project.ai/

机器学习自然语言处理Tensorflow深度学习text-generationPythonmachine-translationdialog-systemstexarbertgpt-2xlnettext-datadata-processing
Python 2.39 k
4 年前
https://static.github-zh.com/github_avatars/microsoft?size=40
microsoft / DialoGPT

#计算机科学#Large-scale pretraining for dialogue

dialogue机器学习PyTorchtransformertext-generationdialogptgpt-2text-datadata-processing
Python 2.39 k
3 年前
https://static.github-zh.com/github_avatars/microsoft?size=40
microsoft / GODEL

#计算机科学#Large-scale pretrained models for goal-directed dialog

data-processingdialoguedialogue-systems机器学习text-datatext-generationtransformersconversational-ailanguage-groundinggrounded-generationdialogptlanguage-modelpretrained-modelPyTorchtransformer
Python 870
2 年前
https://static.github-zh.com/github_avatars/asyml?size=40
asyml / texar-pytorch

#自然语言处理#Integrating the Best of TF into PyTorch, for Machine Learning, Natural Language Processing, and Text Generation. This is part of the CASL project: http://casl-project.ai/

机器学习自然语言处理PyTorch深度学习text-generationPythonmachine-translationdialog-systemstexarbertgpt-2xlnetrobertatext-datadata-processing
Python 745
3 年前
https://static.github-zh.com/github_avatars/asyml?size=40
asyml / forte

#自然语言处理#Forte is a flexible and powerful ML workflow builder. This is part of the CASL project: http://casl-project.ai/

机器学习自然语言处理深度学习Pythontext-datadata-processinginformation-retrievalnatural-languagepipeline
Python 245
1 年前
https://static.github-zh.com/github_avatars/thu-coai?size=40
thu-coai / cotk

#自然语言处理#Conversational Toolkit. An Open-Source Toolkit for Fast Development and Fair Evaluation of Text Generation

机器学习自然语言处理natural-language-generation深度学习Pythondata-processingtext-data监控
Python 127
5 年前
https://static.github-zh.com/github_avatars/LoLei?size=40
LoLei / redditcleaner

#自然语言处理#Cleans Reddit Text Data 📜 🧹

Redditdata-cleaningtext-data自然语言处理PythonprawHacktoberfest
Python 82
5 年前
https://static.github-zh.com/github_avatars/trinker?size=40
trinker / textreadr

Tools to uniformly read in text data including semi-structured transcripts

Rdocxtext-datatext-miningdoc
R 75
2 年前
https://static.github-zh.com/github_avatars/trinker?size=40
trinker / textshape

Tools for reshaping text data

text-datamanipulationR
R 52
1 年前
https://static.github-zh.com/github_avatars/PratikBarhate?size=40
PratikBarhate / question-classification

#自然语言处理#Question Classification for the dataset CogComp QC Dataset - [ http://cogcomp.org/Data/QA/QC/ ].

Python自然语言处理机器学习spaCyexperimentaltext-dataPyTorch神经网络
Python 29
5 年前
https://static.github-zh.com/github_avatars/BALaka-18?size=40
BALaka-18 / rake_new2

#自然语言处理#A Python library that enables smooth keyword extraction from any text using the RAKE(Rapid Automatic Keyword Extraction) algorithm.

texttext-datakeyword-extractionkeywords自然语言处理python-library
Python 29
1 年前
https://static.github-zh.com/github_avatars/YaleDHLab?size=40
YaleDHLab / wordmap

#自然语言处理#Visualize large text collections with WebGL

webgl数据可视化word2vectext-data自然语言处理
JavaScript 25
9 个月前
https://static.github-zh.com/github_avatars/carted?size=40
carted / processing-text-data

Presents an optimized Apache Beam pipeline for generating sentence embeddings (runnable on Cloud Dataflow).

Tensorflowapache-beamdataflowtext-databerttfhub
Python 20
3 年前
https://static.github-zh.com/github_avatars/tylerjthomas9?size=40
tylerjthomas9 / ScrapeSEC.jl

#网络爬虫#Scrape EDGAR filings from https://www.sec.gov/

scraperfinancial-datasecJulia 语言text-datafinance
Julia 14
3 个月前
https://static.github-zh.com/github_avatars/PedroBarcha?size=40
PedroBarcha / old-books-dataset

Old book pages (with groundtruth), formerly used for OCR studies. There are several versions of the set (concerning resolution and binarization). Noised and denoised sets (done by several methods) are...

textground-truthdatasettext-data
HTML 12
8 年前
https://static.github-zh.com/github_avatars/tayebiarasteh?size=40
tayebiarasteh / retweet

#自然语言处理#How Will Your Tweet Be Received? Predicting theSentiment Polarity of Tweet Replies

自然语言处理natural-languagelstmsentiment-analysisPyTorch深度学习深度神经网络tweettext-classificationtext-dataunsupervised-learning
Python 11
4 年前
https://static.github-zh.com/github_avatars/Hsankesara?size=40
Hsankesara / The-Tweets-of-Wisdom

#自然语言处理#A dataset which contains 30k+ so called "self-help" tweets from 100+ authors.

自然语言处理tweetstext-data
Jupyter Notebook 9
6 年前
https://static.github-zh.com/github_avatars/mrchypark?size=40
mrchypark / gomSubtitleData

곰tv 자막 데이터 수집 코드

Rdatasubtitleskoreantexttext-datamoviesdrama
R 6
8 年前
https://static.github-zh.com/github_avatars/Ankit152?size=40
Ankit152 / StackOverflow-Tag-Prediction

#自然语言处理#A machine learning model that predicts tags for a given question and body.

Stack Overflow机器学习自然语言处理text-miningtext-datatags
Jupyter Notebook 3
4 年前
https://static.github-zh.com/github_avatars/saghiles?size=40
saghiles / dcc

Directional Co-clustering with a Conscience (DCC)

co-clusteringclusteringtopic-modelingtext-clusteringtext-data
R 3
6 年前
loading...