GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

ngram

Website
Wikipedia
https://static.github-zh.com/github_avatars/zhezhaoa?size=40
zhezhaoa / ngram2vec

Four word embedding models implemented in Python. Supporting arbitrary context features

ngramword2vecembedding中文glovesvdwordword-embedding
Python 849
6 年前
https://static.github-zh.com/github_avatars/lonePatient?size=40
lonePatient / albert_pytorch

#自然语言处理#A Lite Bert For Self-Supervised Learning Language Representations

albertbertPyTorchngrammask自然语言处理language-model
Python 717
5 年前
https://static.github-zh.com/github_avatars/wintermute-cell?size=40
wintermute-cell / ngrrram

A TUI tool to help you type faster and learn new layouts. Includes a free cat.

cat命令行界面colemakdvoraklayoutngramRusttouchtypingtuityping
Rust 663
8 个月前
https://static.github-zh.com/github_avatars/ranelpadon?size=40
ranelpadon / ngram-type

Touch typing trainer using N-grams as data source, with options to customize the auto-generated lessons and specify the minimum typing performance needed. There are sound/color effects as well.

ngramcolemakdvorakVue.jsmonkeytype
JavaScript 239
1 年前
https://static.github-zh.com/github_avatars/lonePatient?size=40
lonePatient / daguan_2019_rank9

datagrand 2019 information extraction competition rank9

bertieinformation-extractionnerlstmcrfspandropoutlookaheadPyTorchngram
Python 130
6 年前
https://static.github-zh.com/github_avatars/proycon?size=40
proycon / colibri-core

#自然语言处理#Colibri core is an NLP tool as well as a C++ and Python library for working with basic linguistic constructions such as n-grams and skipgrams (i.e patterns with one or more gaps, either of fixed or dy...

C++Python自然语言处理ngramsskipgramngramcorpusLibrarytext-processingcomputational-linguisticspattern-recognition
C++ 127
7 个月前
https://static.github-zh.com/github_avatars/ChrisMuir?size=40
ChrisMuir / refinr

Cluster and merge similar string values: an R implementation of Open Refine clustering algorithms

openrefinefuzzy-matchingngramapproximate-string-matchingdata-cleaningclusteringRrstats
C++ 104
1 年前
https://static.github-zh.com/github_avatars/joshualoehr?size=40
joshualoehr / ngram-language-model

#自然语言处理#Python implementation of an N-gram language model with Laplace smoothing and sentence generation.

ngramperplexity自然语言处理language-modelPythonngramslanguage-models
Python 86
7 年前
https://static.github-zh.com/github_avatars/words?size=40
words / n-gram

Get n-grams from text

ngramunigram
JavaScript 83
3 年前
https://static.github-zh.com/github_avatars/vickumar1981?size=40
vickumar1981 / stringdistance

A fuzzy matching string distance library for Scala and Java that includes Levenshtein distance, Jaro distance, Jaro-Winkler distance, Dice coefficient, N-Gram similarity, Cosine similarity, Jaccard si...

levenshtein-distancelevenshteinngramjarojaro-winklerdice-coefficienthamming-distancestring-similaritycosine-similarityfuzzy-matchingHacktoberfest
Scala 80
3 年前
https://static.github-zh.com/github_avatars/wrathematics?size=40
wrathematics / ngram

Fast n-Gram Tokenization

Rngramtexttext-mining
C 71
2 年前
https://static.github-zh.com/github_avatars/suggest-go?size=40
suggest-go / suggest

#搜索#Top-k Approximate String Matching.

golang-libraryngramfuzzy-search搜索引擎language-modelspellcheckerautocomplete
Go 68
4 年前
https://static.github-zh.com/github_avatars/jiangnanboy?size=40
jiangnanboy / llm_corpus_quality

#大语言模型#大模型预训练中文语料清洗及质量评估 Large model pre-training corpus cleaning

Java大语言模型ngram
Java 67
1 年前
https://static.github-zh.com/github_avatars/BitSpeech?size=40
BitSpeech / SRILM

Mirror of SRILM

language-modelngram
Roff 57
5 年前
https://static.github-zh.com/github_avatars/myazi?size=40
myazi / NLP

natural language processing

ngramcrf
C++ 36
7 年前
https://static.github-zh.com/github_avatars/JackHCC?size=40
JackHCC / Chinese-Tokenization

#自然语言处理#利用传统方法(N-gram,HMM等)、神经网络方法(CNN,LSTM等)和预训练方法(Bert等)的中文分词任务实现【The word segmentation task is realized by using traditional methods (n-gram, HMM, etc.), neural network methods (CNN, LSTM, etc.) and pre tr...

hmm-viterbi-algorithmngram自然语言处理tokenization
Python 35
3 年前
https://static.github-zh.com/github_avatars/0xVavaldi?size=40
0xVavaldi / gramify

Create n-grams of wordlists based on words, characters, or charsets to use in offline password attacks and data analysis

hashcatjtrngrampasswordpassword-cracking
Python 33
1 年前
https://static.github-zh.com/github_avatars/AsadiAhmad?size=40
AsadiAhmad / Ngram-Spark-Wikipedia

#自然语言处理#Calculating Ngram with PySpark for wikipedia text

big-datangram自然语言处理pysparkApache Spark
Jupyter Notebook 29
1 年前
https://static.github-zh.com/github_avatars/cyclone-github?size=40
cyclone-github / spider

#网络爬虫#Spider - web crawler and local wordlist processor to generate frequency sorted wordlist / ngrams

爬虫GeneratorngramscraperspiderurlWebwordlistweb-crawlerweb-scrapingwordlist-generator
Go 24
1 个月前
https://static.github-zh.com/github_avatars/yiyepiaoling0715?size=40
yiyepiaoling0715 / unsupervised_extract_detect_words

multiprocess unsupervised chinese_detect_words ngram_combination

pmientropymultiprocessingngramdetectunsupervised-learningsegmentrecursive
Python 24
7 年前
loading...