GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

corpus-processing

Website
Wikipedia
https://static.github-zh.com/github_avatars/BLKSerene?size=40
BLKSerene / Wordless

An Integrated Corpus Tool With Multilingual Support for the Study of Language, Literature, and Translation

corpuscorpus-linguisticscorpus-toolscorpus-processingliteraturetranslationParsingtaggerlemmatizerdependency-parser
Python 733
7 天前
https://static.github-zh.com/github_avatars/bitextor?size=40
bitextor / bitextor

#网络爬虫#Bitextor generates translation memories from multilingual websites

dictionaries爬虫wgetParsingwarccorpus-toolscorpus-processingmachine-translationneural-machine-translationstatistical-machine-translation
Python 295
10 个月前
https://static.github-zh.com/github_avatars/hankcs?size=40
hankcs / TreebankPreprocessing

#自然语言处理# Python scripts preprocessing Penn Treebank and Chinese Treebank

自然语言处理corpus-processing
Python 162
5 年前
https://static.github-zh.com/github_avatars/Helsinki-NLP?size=40
Helsinki-NLP / OpusFilter

#自然语言处理#OpusFilter - Parallel corpus processing toolkit

corpus-toolscorpus-processing自然语言处理machine-translation
Python 109
1 个月前
https://static.github-zh.com/github_avatars/NathanDuran?size=40
NathanDuran / Switchboard-Corpus

Utilities for Processing the Switchboard Dialogue Act Corpus

corpuscorpus-processingcorpus-datacorpus-toolsdialogue
Python 70
5 年前
https://static.github-zh.com/github_avatars/OHNLP?size=40
OHNLP / MedTator

#自然语言处理#A Serverless Text Annotation Tool for Corpus Development

corpus-processing自然语言处理Serverless
JavaScript 57
7 个月前
https://static.github-zh.com/github_avatars/johentsch?size=40
johentsch / ms3

A parser for annotated MuseScore 3 files.

corpuscorpus-datacorpus-processingcorpus-toolsmusescoreParsersheet-musictsvtsv-filesxml-parserxml-parser-libraryxml-parsing
Python 49
6 个月前
https://static.github-zh.com/github_avatars/uma-pi1?size=40
uma-pi1 / OPIEC

#自然语言处理#Reading the data from OPIEC - an Open Information Extraction corpus

information-extractioncorpuscorpus-datacorpus-tools自然语言处理natural-language-understandingwikipediaWikicorpus-processingdataset
Java 38
6 年前
https://static.github-zh.com/github_avatars/NathanDuran?size=40
NathanDuran / MRDA-Corpus

Utilities for Processing the Meeting Recorder Dialogue Act Corpus

corpuscorpus-datacorpus-processingcorpus-toolsdialogue
Python 33
5 年前
https://static.github-zh.com/github_avatars/versotym?size=40
versotym / rhymetagger

A simple collocation-driven recognition of rhymes. Contains pre-trained models for Czech, Dutch, English, French, German, Russian, and Spanish poetry

corpus-processinglanguage-processing
Python 30
3 个月前
https://static.github-zh.com/github_avatars/notesjor?size=40
notesjor / corpusexplorer2.0

#自然语言处理#Korpuslinguistik war noch nie so einfach...

corpus-linguistics数据科学text-miningtext-processingtext-analysis自然语言处理data-miningSDKcorpus-processingnatural-language-understandingbig-datatagger可视化journalismdatajournalism
C# 24
3 个月前
https://static.github-zh.com/github_avatars/jaytimm?size=40
jaytimm / corpuslingr

A library of functions enabling complex corpus search in context (KWIC), search aggregation, bag-of-words building & keyphrase extraction.

corpus-toolscorpus-processing
R 20
7 年前
https://static.github-zh.com/github_avatars/Bibliome?size=40
Bibliome / alvisnlp

#自然语言处理#ALvisNLP corpus processing engine

自然语言处理pipelinecorpus-processingworkflowJavaworkflow-engine机器学习
Java 18
10 个月前
https://static.github-zh.com/github_avatars/zgornel?size=40
zgornel / StringAnalysis.jl

Hard-Forked from JuliaText/TextAnalysis.jl

corpus-processingtext-processingtext-analysis
Julia 17
2 年前
https://static.github-zh.com/github_avatars/uma-pi1?size=40
uma-pi1 / OPIEC-pipeline

#自然语言处理#

text-processingcorpus-datacorpus-toolscorpus-linguisticscorpus-processingwikipediaWikiinformation-extractionbig-databigdata自然语言处理natural-language-understanding
Java 14
4 年前
https://static.github-zh.com/github_avatars/jonathandunn?size=40
jonathandunn / corpus_similarity

#自然语言处理#Measure the similarity of text corpora for 74 languages

corpustextcorpus-linguisticscorpus-processingcorpus-toolslanguage自然语言处理
Python 13
2 年前
https://static.github-zh.com/github_avatars/kennedyCzar?size=40
kennedyCzar / NLP-PROJECT-BOOK-INSIGHTS-WITH-PLOTLY

#自然语言处理#Plotly-Dash NLP project. Document similarity measure using Latent Dirichlet Allocation, principal component analysis and finally follow with KMeans clustering. Project is completed with dynamic visual...

pcaplotly-dash自然语言处理corpus-processingdashplotlyplotly-pythoncallbacks
Python 13
3 年前
https://static.github-zh.com/github_avatars/jonathandunn?size=40
jonathandunn / common_crawl_corpus

Scripts for building a geo-located web corpus using Common Crawl data

corpus-linguisticscorpus-processingcorpus-toolsweb-crawling
Python 11
5 天前
https://static.github-zh.com/github_avatars/felipetovarhenao?size=40
felipetovarhenao / exquisitecorpus

A set of corpus-based sampling & analysis M4L devices

maxforlivecorpus-processingsampling
Max 11
4 年前
https://static.github-zh.com/github_avatars/Linguista?size=40
Linguista / CQPweb-Instabox

Script that sets up and configures an entire CQPweb server installation

corpus-linguisticscorpus-toolscorpus-processingcqp
Shell 11
6 年前
loading...