llm-routing · GitHub Topics

The smart edge and AI gateway for agents. Arch is a high-performance proxy server that handles the low-level work in building agents: like applying guardrails, routing prompts to the right agent, and ...

gateway generative-ai llm-inference 大语言模型 prompt proxy proxy-server llmops openai Routing (disambiguation)ai-gateway llm-gateway llm-routing envoy envoyproxy ai-gateway-support llm-proxy

Rust 3.67 k

3 天前

junchenzhi / Awesome-LLM-Ensemble

A curated list of Awesome-LLM-Ensemble papers for the survey "Harnessing Multiple Large Language Models: A Survey on LLM Ensemble"

ensemble ensemble-learning llm-agents 大语言模型 multi-agent-systems llm-ensemble large-language-models llm-routing moe routing-algorithm

119

11 天前

thushan / olla

Lightweight & fast AI inference proxy for self-hosted LLMs backends like Ollama, LM Studio and others. Designed for speed, simplicity and local-first deployments.

人工智能 llm-inference lmstudio ollama proxy vllm Go llamacpp llm-proxy llm-routing 自托管

Go 86

8 天前