#大语言模型#:robot: The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, tr...
AI模型接口管理与分发系统,支持将多种大模型转为统一格式调用,支持OpenAI、Claude等格式,可供个人或者企业内部管理与分发渠道使用,本项目基于One API二次开发。🍥 The next-generation LLM gateway and AI asset management system supports multiple languages.
#大语言模型#gpt_server是一个用于生产级部署LLMs、Embedding、Reranker、ASR和TTS的开源框架。
rerank library for easy reranking of results
#大语言模型#SearchAugmentedLLM empowers LLMs with information from the web
A comprehensive RAG FastAPI service that handles document uploads and retrievals, built with Python. Uses PyMuPDF for document processing, turbopuffer for vector storage, OpenAI for models, and cohere...
#大语言模型#A Python project that deploys a Local RAG chatbot using Ollama API and vLLM API. Refines answers with internal RAG knowledge base, using both Embedding and Rerank models to improve accuracy of context...
go client for text-embedding-inference (https://github.com/huggingface/text-embeddings-inference)
#大语言模型#A Python project that deploys a Local RAG chatbot using Ollama API and vLLM API. Refines answers with internal RAG knowledge base, using both Embedding and Rerank models to improve accuracy of context...
The code features a Rerank RAG developed in the Python programming language.