GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

chunking

Website
Wikipedia
chonkie-ai/chonkie
https://static.github-zh.com/github_avatars/chonkie-ai?size=40
chonkie-ai / chonkie

#自然语言处理#🦛 CHONK your texts with Chonkie ✨ - The no-nonsense RAG chunking library

人工智能chunkingragtext-processing自然语言处理Pythonsemantic-segmentationvector-searchetlretrieval
Python 2.87 k
3 个月前
https://static.github-zh.com/github_avatars/jiesutd?size=40
jiesutd / NCRFpp

#自然语言处理#NCRF++, a Neural Sequence Labeling Toolkit. Easy use to any sequence labeling tasks (e.g. NER, POS, Segmentation). It includes character LSTM/CNN, word LSTM/CNN and softmax/CRF components.

PyTorchnersequence-labelingcrfnamed-entity-recognitionpart-of-speech-taggerchunkingneural-networkslstmcnn自然语言处理人工智能
Python 1.9 k
3 年前
https://static.github-zh.com/github_avatars/systemd?size=40
systemd / casync

#下载器#Content-Addressable Data Synchronization Tool

archivetarfile-systemHTTPchunkingsynchronizationdownloaduploaddelivery
C 1.52 k
1 年前
https://static.github-zh.com/github_avatars/smooks?size=40
smooks / smooks

An extensible Java framework for building event-driven applications that break up XML and non-XML data into chunks for data integration

Javaetlsaxpipelinesevent-drivenXMLstream-processinganalyticschunking
Java 403
14 天前
https://static.github-zh.com/github_avatars/mirth?size=40
mirth / chonky

Fully neural approach for text chunking

人工智能chunking大语言模型机器学习rag
Python 353
2 个月前
https://static.github-zh.com/github_avatars/folbricht?size=40
folbricht / desync

Alternative casync implementation

Gochunkingarchivesynchronization
Go 350
2 个月前
https://static.github-zh.com/github_avatars/isaacus-dev?size=40
isaacus-dev / semchunk

#自然语言处理#A fast, lightweight and easy-to-use Python library for splitting text into semantically meaningful chunks.

chunking自然语言处理Pythonsplittingtext
Python 318
3 个月前
https://static.github-zh.com/github_avatars/lazyFrogLOL?size=40
lazyFrogLOL / llmdocparser

#自然语言处理#A package for parsing PDFs and analyzing their content using LLMs.

大语言模型自然语言处理OCRragchunkingdocument-analysispdf-parser
Python 271
10 个月前
https://static.github-zh.com/github_avatars/microsoft?size=40
microsoft / rag-experiment-accelerator

#大语言模型#The RAG Experiment Accelerator is a versatile tool designed to expedite and facilitate the process of conducting experiments and evaluations using Azure Cognitive Search and RAG pattern.

chunkingembeddingragevaluationexperimentinformation-retrievalopenaiAzuregenai大语言模型indexingsparsevectors
Python 255
2 个月前
https://static.github-zh.com/github_avatars/26hzhang?size=40
26hzhang / neural_sequence_labeling

A TensorFlow implementation of Neural Sequence Labeling model, which is able to tackle sequence labeling tasks such as POS Tagging, Chunking, NER, Punctuation Restoration and etc.

TensorflowPythonsequence-labelingpos-taggerchunkingnamed-entity-recognitionpunctuation
Python 234
7 年前
https://static.github-zh.com/github_avatars/zeroentropy-ai?size=40
zeroentropy-ai / zchunk

#大语言模型#A new chunking strategy developed by ZeroEntropy for general semantic chunking using Llama-70B.

chunking大语言模型retrieval
Python 183
5 个月前
https://static.github-zh.com/github_avatars/jparkerweb?size=40
jparkerweb / semantic-chunking

#大语言模型#🍱 semantic-chunking ⇢ semantically create chunks from large document for passing to LLM workflows

chunkingembeddings大语言模型vector
JavaScript 96
4 个月前
https://static.github-zh.com/github_avatars/jordicenzano?size=40
jordicenzano / go-ts-segmenter

Live TS segmenter and HLS manifest creation in Go

hlsVideochunkingGochunked
Go 94
4 年前
https://static.github-zh.com/github_avatars/safakan?size=40
safakan / TalkWithYourFiles

An LLM GUI application; enables you to interact with your files, offering dynamic parameters that can modify response behavior during runtime.

faisslangchainopenaiopenai-chatgptPythonStreamlitfactory-patternquestion-answeringstrategy-patternembeddingssimilarity-searchvectorstorechunkingtext-processingDocker
Python 92
2 年前
https://static.github-zh.com/github_avatars/swarmauri?size=40
swarmauri / swarmauri-sdk

#自然语言处理#a modular multimodal framework for ai applications

人工智能modularmonorepoorchestrationorchestration-frameworkagentschunkingllm-framework监控自然语言处理Parsingtooling工具factoriesvectors
Python 91
6 天前
https://static.github-zh.com/github_avatars/xtabbas?size=40
xtabbas / The-Ultimate-Boilerplate

webpack 2, react hotloader 3, react router v4, code splitting and more

ReactReduxhot-reloadingWebpack模板chunking服务端渲染react-router-v4
JavaScript 85
8 年前
https://static.github-zh.com/github_avatars/Sammyjo20?size=40
Sammyjo20 / laravel-chunkable-jobs

📑 Split Laravel jobs into multiple separate job chunks

chunkingjobsLaravelPHPHacktoberfest
PHP 83
1 年前
https://static.github-zh.com/github_avatars/esastack?size=40
esastack / esa-restclient

An asynchronous event-driven HTTP client based on netty.

httpclientNettyasynchronousHTTPhaproxyfilterinterceptorretrychunking
Java 83
3 年前
https://static.github-zh.com/github_avatars/neondatabase-labs?size=40
neondatabase-labs / pgrag

Postgres extensions to support end-to-end Retrieval-Augmented Generation (RAG) pipelines

chunkingembeddingspgrxPostgreSQLrag
Rust 81
1 个月前
https://static.github-zh.com/github_avatars/ronomon?size=40
ronomon / deduplication

Fast multi-threaded content-dependent chunking deduplication for Buffers in C++ with a reference implementation in Javascript. Ships with extensive tests, a fuzz test and a benchmark.

Entity resolutionchunkingNode.js
JavaScript 76
5 年前
loading...