GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

text-processing

Website
Wikipedia
https://static.github-zh.com/github_avatars/learnbyexample?size=40
learnbyexample / Command-line-text-processing

⚡ From finding text to search and replace, from sorting to beautifying text and more 🎨

命令行界面Linuxtext-processingebooksedgrepPerlawkRubyRegular expression
Shell 10.19 k
1 年前
https://static.github-zh.com/github_avatars/google?size=40
google / diff-match-patch

Diff Match Patch is a high-performance library in multiple languages that manipulates plain text.

differencediffmatchpatchtext-processing
Python 7.78 k
1 年前
pymupdf/PyMuPDF
https://static.github-zh.com/github_avatars/pymupdf?size=40
pymupdf / PyMuPDF

PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.

mupdfxpspdf-documentsepubOCRpdf字体Python数据科学extract-datatable-extractiontesseracttext-processingtext-shaping
Python 7.38 k
2 天前
https://static.github-zh.com/github_avatars/chmln?size=40
chmln / sd

Intuitive find & replace CLI (sed alternative)

命令行界面Rust终端text-processingRegular expression
Rust 6.33 k
2 个月前
https://static.github-zh.com/github_avatars/fastnlp?size=40
fastnlp / fastNLP

#自然语言处理#fastNLP: A Modularized and Extensible NLP Framework. Currently still in incubation.

自然语言处理深度学习nlp-librarynlp-parsingchinese-nlptext-classificationtext-processing
Python 3.13 k
2 年前
chonkie-ai/chonkie
https://static.github-zh.com/github_avatars/chonkie-ai?size=40
chonkie-ai / chonkie

#自然语言处理#🦛 CHONK your texts with Chonkie ✨ - The no-nonsense RAG chunking library

人工智能chunkingragtext-processing自然语言处理Pythonsemantic-segmentationvector-searchetlretrieval
Python 2.87 k
3 个月前
https://static.github-zh.com/github_avatars/pyparsing?size=40
pyparsing / pyparsing

Python library for creating PEG parsers

Pythonparser-combinatorsParsingparsing-librarytext-processing
Python 2.34 k
8 天前
https://static.github-zh.com/github_avatars/kk7nc?size=40
kk7nc / Text_Classification

#计算机科学#Text Classification Algorithms: A Survey

text-classification自然语言处理document-classificationtext-processingdimensionality-reductionrocchio-algorithmboosting-algorithmslogistic-regressionnaive-bayes-classifierk-nearest-neighbourssupport-vector-machinesdecision-treesrandom-forest深度学习深度神经网络recurrent-neural-networksconvolutional-neural-networks
Python 1.81 k
2 个月前
https://static.github-zh.com/github_avatars/roshan-research?size=40
roshan-research / hazm

#自然语言处理#Persian NLP Toolkit

自然语言处理Pythonpersianpersian-nlpdependency-parserembeddingstext-processingParsingfarsiCSS Resetspos-tagging
Python 1.29 k
1 年前
pemistahl/lingua-go
https://static.github-zh.com/github_avatars/pemistahl?size=40
pemistahl / lingua-go

#自然语言处理#The most accurate natural language detection library for Go, suitable for short text and mixed-language text

自然语言处理language-detectionlanguage-recognitionlanguage-classificationlanguage-identificationlanguage-processinggolang-libraryGolanguage-modelingtext-processing
Go 1.25 k
4 个月前
https://static.github-zh.com/github_avatars/birchb1024?size=40
birchb1024 / frangipanni

Program to convert lines of text into a tree structure.

text-processingGotree-structure
Go 1.2 k
2 年前
https://static.github-zh.com/github_avatars/BurntSushi?size=40
BurntSushi / aho-corasick

A fast implementation of Aho-Corasick in Rust.

Finite-state machinetext-processingsearch
Rust 1.11 k
9 个月前
https://static.github-zh.com/github_avatars/helix-editor?size=40
helix-editor / nucleo

A fast and convenient fuzzy matcher library for rust

fuzzy-matchingfuzzy-searchperformanceRusttext-processing
Rust 1.11 k
23 天前
https://static.github-zh.com/github_avatars/PyThaiNLP?size=40
PyThaiNLP / pythainlp

#自然语言处理#Thai natural language processing in Python

Pythonnlp-library自然语言处理word-segmentationthaiHacktoberfestcomputational-linguisticstext-processing
Python 1.04 k
12 天前
https://static.github-zh.com/github_avatars/sstadick?size=40
sstadick / hck

A sharp cut(1) clone.

Rust命令行界面text-processing
Rust 712
3 个月前
https://static.github-zh.com/github_avatars/ChenghaoMou?size=40
ChenghaoMou / text-dedup

#自然语言处理#All-in-one text de-duplication

text-processing自然语言处理data-processing
Python 684
21 天前
https://static.github-zh.com/github_avatars/derek73?size=40
derek73 / python-nameparser

A simple Python module for parsing human names into their individual components

Pythontext-processingpython-module
Python 674
1 年前
https://static.github-zh.com/github_avatars/cbaziotis?size=40
cbaziotis / ekphrasis

#自然语言处理#Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenization, word normalization, word segmentation (for splitting hashta...

自然语言处理text-processingnlp-libraryspelling-correctionParsingtokenizationword-segmentation
Python 670
13 天前
https://static.github-zh.com/github_avatars/abadojack?size=40
abadojack / whatlanggo

#自然语言处理#Natural language detection library for Go

languageGo自然语言处理text-processing
Go 657
2 年前
https://static.github-zh.com/github_avatars/open-korean-text?size=40
open-korean-text / open-korean-text

#自然语言处理#Open Korean Text Processor - An Open-source Korean Text Processor

korean自然语言处理text-processingParsing
Scala 630
1 年前
loading...