#自然语言处理#🦛 CHONK your texts with Chonkie ✨ - The no-nonsense RAG chunking library
#自然语言处理#NCRF++, a Neural Sequence Labeling Toolkit. Easy use to any sequence labeling tasks (e.g. NER, POS, Segmentation). It includes character LSTM/CNN, word LSTM/CNN and softmax/CRF components.
#下载器#Content-Addressable Data Synchronization Tool
An extensible Java framework for building event-driven applications that break up XML and non-XML data into chunks for data integration
#大语言模型#The RAG Experiment Accelerator is a versatile tool designed to expedite and facilitate the process of conducting experiments and evaluations using Azure Cognitive Search and RAG pattern.
#自然语言处理#A package for parsing PDFs and analyzing their content using LLMs.
A TensorFlow implementation of Neural Sequence Labeling model, which is able to tackle sequence labeling tasks such as POS Tagging, Chunking, NER, Punctuation Restoration and etc.
#大语言模型#A new chunking strategy developed by ZeroEntropy for general semantic chunking using Llama-70B.
#大语言模型#🍱 semantic-chunking ⇢ semantically create chunks from large document for passing to LLM workflows
#自然语言处理#a modular multimodal framework for ai applications
An LLM GUI application; enables you to interact with your files, offering dynamic parameters that can modify response behavior during runtime.
webpack 2, react hotloader 3, react router v4, code splitting and more
Postgres extensions to support end-to-end Retrieval-Augmented Generation (RAG) pipelines
An asynchronous event-driven HTTP client based on netty.
📑 Split Laravel jobs into multiple separate job chunks
Fast multi-threaded content-dependent chunking deduplication for Buffers in C++ with a reference implementation in Javascript. Ships with extensive tests, a fuzz test and a benchmark.