GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

word-segmentation

Website
Wikipedia
https://static.github-zh.com/github_avatars/google?size=40
google / sentencepiece

#自然语言处理#Unsupervised text tokenizer for Neural Network-based text generation.

neural-machine-translation自然语言处理word-segmentation
C++ 10.99 k
2 个月前
https://static.github-zh.com/github_avatars/baidu?size=40
baidu / lac

百度NLP:分词,词性标注,命名实体识别,词重要性

word-segmentationpart-of-speech-taggernamed-entity-recognitionchinese-word-segmentationchinese-nlpParsingPythonJava
C++ 3.94 k
4 年前
https://static.github-zh.com/github_avatars/wolfgarbe?size=40
wolfgarbe / SymSpell

SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm

levenshteinfuzzy-searchapproximate-string-matchingedit-distancespellcheckspell-checklevenshtein-distancedamerau-levenshteinspellingfuzzy-matchingword-segmentationchinese-text-segmentationchinese-word-segmentationspelling-correction
C# 3.25 k
3 个月前
https://static.github-zh.com/github_avatars/PyThaiNLP?size=40
PyThaiNLP / pythainlp

#自然语言处理#Thai natural language processing in Python

Pythonnlp-library自然语言处理word-segmentationthaiHacktoberfestcomputational-linguisticstext-processing
Python 1.04 k
13 天前
https://static.github-zh.com/github_avatars/VKCOM?size=40
VKCOM / YouTokenToMe

#自然语言处理#Unsupervised text tokenizer focused on computational efficiency

自然语言处理word-segmentationbpetokenization
C++ 968
1 年前
https://static.github-zh.com/github_avatars/mammothb?size=40
mammothb / symspellpy

Python port of SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm

Pythonspellcheckspell-checkfuzzy-matchingfuzzy-searchspelling-correctiondamerau-levenshteinapproximate-string-matchinglevenshteinedit-distancelevenshtein-distancespellingword-segmentationchinese-text-segmentationchinese-word-segmentation
Python 828
2 个月前
https://static.github-zh.com/github_avatars/ckiplab?size=40
ckiplab / ckip-transformers

CKIP Transformers

transformerslanguage-modelword-segmentationpart-of-speech-taggingnamed-entity-recognition
Python 731
2 年前
https://static.github-zh.com/github_avatars/cbaziotis?size=40
cbaziotis / ekphrasis

#自然语言处理#Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenization, word normalization, word segmentation (for splitting hashta...

自然语言处理text-processingnlp-libraryspelling-correctionParsingtokenizationword-segmentation
Python 670
14 天前
https://static.github-zh.com/github_avatars/vncorenlp?size=40
vncorenlp / VnCoreNLP

#自然语言处理#A Vietnamese natural language processing toolkit (NAACL 2018)

dependency-parsingnamed-entity-recognitionpos-taggingword-segmentationvietnamese-nlp自然语言处理pos-taggernerParsingvietnameseJavaPython
Java 622
2 年前
https://static.github-zh.com/github_avatars/bab2min?size=40
bab2min / Kiwi

#自然语言处理#Kiwi(지능형 한국어 형태소 분석기)

自然语言处理koreanmorphological-analysisword-segmentationC++
C++ 592
8 天前
https://static.github-zh.com/github_avatars/JayYip?size=40
JayYip / m3tl

#自然语言处理#BERT for Multitask Learning

bertnamed-entity-recognition自然语言处理word-segmentationmultitask-learningcwspretrained-modelsnertext-classificationmulti-task-learningtransformerencoder-decoder
Jupyter Notebook 548
2 年前
https://static.github-zh.com/github_avatars/modelscope?size=40
modelscope / AdaSeq

#自然语言处理#AdaSeq: An All-in-One Library for Developing State-of-the-Art Sequence Understanding Models

named-entity-recognition自然语言处理natural-language-understandingPyTorchsequence-labelingword-segmentationnerrelation-extractionbertchinese-nlpcrfinformation-extraction
Python 440
2 年前
https://static.github-zh.com/github_avatars/taishi-i?size=40
taishi-i / nagisa

#自然语言处理#A Japanese tokenizer based on recurrent neural networks

dynetword-segmentationpos-taggingjapanesenlp-librarysequence-labeling自然语言处理Parsing
Python 400
1 个月前
https://static.github-zh.com/github_avatars/ku-nlp?size=40
ku-nlp / jumanpp

#自然语言处理#Juman++ (a Morphological Analyzer Toolkit)

自然语言处理japanesemorphological-analysispos-taggingpos-taggerpart-of-speech-taggerword-segmentationcjkParsing
C++ 389
2 年前
https://static.github-zh.com/github_avatars/jacksonllee?size=40
jacksonllee / pycantonese

#自然语言处理#Cantonese Linguistics and NLP

cantonesecomputational-linguistics自然语言处理Pythonword-segmentationpart-of-speech-tagging
Python 381
1 年前
https://static.github-zh.com/github_avatars/yongzhuo?size=40
yongzhuo / Pytorch-NLU

中文文本分类、序列标注工具包(pytorch),支持中文长文本、短文本的多类、多标签分类任务,支持中文命名实体识别、词性标注、分词、抽取式文本摘要等序列标注任务。 Chinese text classification and sequence labeling toolkit, supports multi class and multi label classification, text s...

PythonPyTorchtext-classificationsequence-labelingnamed-entity-recognitionword-segmentationpos-taggingchinese-text-segmentationtransformersbertpretrained-models
Python 346
1 年前
https://static.github-zh.com/github_avatars/bab2min?size=40
bab2min / kiwipiepy

#自然语言处理#Python API for Kiwi

自然语言处理koreanmorphological-analysisword-segmentationpython-library
Python 317
1 个月前
https://static.github-zh.com/github_avatars/jidasheng?size=40
jidasheng / bi-lstm-crf

#自然语言处理# A PyTorch implementation of the BI-LSTM-CRF model.

crfPyTorch自然语言处理nersequence-labelingword-segmentationpos-tagging
Python 252
8 个月前
https://static.github-zh.com/github_avatars/monpa-team?size=40
monpa-team / monpa

#自然语言处理#MONPA 罔拍是一個提供正體中文斷詞、詞性標註以及命名實體辨識的多任務模型

自然语言处理nerposnamed-entity-recognitionword-segmentationchinese-word-segmentationpos-taggingbertalbert
Python 246
4 个月前
https://static.github-zh.com/github_avatars/fastcws?size=40
fastcws / fastcws

轻量级高性能中文分词项目

中文word-segmentation
C++ 199
2 年前
loading...