#

chunking

chonkie-ai/chonkie
https://static.github-zh.com/github_avatars/chonkie-ai?size=40
Python 2.87 k
6 个月前
https://static.github-zh.com/github_avatars/jiesutd?size=40

#自然语言处理#NCRF++, a Neural Sequence Labeling Toolkit. Easy use to any sequence labeling tasks (e.g. NER, POS, Segmentation). It includes character LSTM/CNN, word LSTM/CNN and softmax/CRF components.

Python 1.9 k
3 年前
https://static.github-zh.com/github_avatars/smooks?size=40

An extensible Java framework for building event-driven applications that break up XML and non-XML data into chunks for data integration

Java 412
21 天前
https://static.github-zh.com/github_avatars/mirth?size=40
Python 371
5 个月前
https://static.github-zh.com/github_avatars/isaacus-dev?size=40

#自然语言处理#A fast, lightweight and easy-to-use Python library for splitting text into semantically meaningful chunks.

Python 364
1 个月前
https://static.github-zh.com/github_avatars/folbricht?size=40
Go 358
2 个月前
https://static.github-zh.com/github_avatars/microsoft?size=40

#大语言模型#The RAG Experiment Accelerator is a versatile tool designed to expedite and facilitate the process of conducting experiments and evaluations using Azure Cognitive Search and RAG pattern.

Python 269
5 个月前
https://static.github-zh.com/github_avatars/lazyFrogLOL?size=40
Python 267
1 年前
https://static.github-zh.com/github_avatars/26hzhang?size=40

A TensorFlow implementation of Neural Sequence Labeling model, which is able to tackle sequence labeling tasks such as POS Tagging, Chunking, NER, Punctuation Restoration and etc.

Python 234
7 年前
https://static.github-zh.com/github_avatars/zeroentropy-ai?size=40

#大语言模型#A new chunking strategy developed by ZeroEntropy for general semantic chunking using Llama-70B.

Python 208
8 个月前
https://static.github-zh.com/github_avatars/jparkerweb?size=40

#大语言模型#🍱 semantic-chunking ⇢ semantically create chunks from large document for passing to LLM workflows

JavaScript 111
2 个月前
https://static.github-zh.com/github_avatars/safakatakancelik?size=40

An LLM GUI application; enables you to interact with your files, offering dynamic parameters that can modify response behavior during runtime.

Python 94
2 年前
https://static.github-zh.com/github_avatars/jordicenzano?size=40

Live TS segmenter and HLS manifest creation in Go

Go 94
4 年前
https://static.github-zh.com/github_avatars/xtabbas?size=40
JavaScript 85
8 年前
https://static.github-zh.com/github_avatars/neondatabase-labs?size=40

Postgres extensions to support end-to-end Retrieval-Augmented Generation (RAG) pipelines

Rust 85
24 天前
https://static.github-zh.com/github_avatars/esastack?size=40
Java 84
3 年前
https://static.github-zh.com/github_avatars/Sammyjo20?size=40

📑 Split Laravel jobs into multiple separate job chunks

PHP 84
1 年前
https://static.github-zh.com/github_avatars/ronomon?size=40

Fast multi-threaded content-dependent chunking deduplication for Buffers in C++ with a reference implementation in Javascript. Ships with extensive tests, a fuzz test and a benchmark.

JavaScript 75
6 年前
loading...
Website
Wikipedia