#

sglang

kvcache-ai/Mooncake
https://static.github-zh.com/github_avatars/kvcache-ai?size=40

#大语言模型#Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 4.09 k
15 小时前
https://static.github-zh.com/github_avatars/OpenMOSS?size=40

MOSS-TTSD is a spoken dialogue generation model that enables expressive dialogue speech synthesis in both Chinese and English, supporting zero-shot multi-speaker voice cloning, and long-form speech ge...

Python 978
18 天前
https://static.github-zh.com/github_avatars/ModelCloud?size=40

LLM model quantization (compression) toolkit with hw acceleration support for Nvidia CUDA, AMD ROCm, Intel XPU and Intel/AMD/Apple CPU via HF, vLLM, and SGLang.

Python 828
2 天前
https://static.github-zh.com/github_avatars/HuiResearch?size=40

基于SparkTTS、OrpheusTTS等模型,提供高质量中文语音合成与声音克隆服务。

Python 538
5 个月前
https://static.github-zh.com/github_avatars/sgl-project?size=40

#大语言模型#Train speculative decoding models effortlessly and port them smoothly to SGLang serving.

Python 420
6 天前
https://static.github-zh.com/github_avatars/sgl-project?size=40

#大语言模型#OME is a Kubernetes operator for enterprise-grade management and serving of Large Language Models (LLMs)

Go 289
5 天前
https://static.github-zh.com/github_avatars/InftyAI?size=40

#大语言模型#☸️ Easy, advanced inference platform for large language models on Kubernetes. 🌟 Star to support our work!

Go 260
3 天前
https://static.github-zh.com/github_avatars/shell-nlp?size=40

#大语言模型#gpt_server是一个用于生产级部署LLMs、Embedding、Reranker、ASR、TTS、文生图、图片编辑和文生视频的开源框架。

Python 213
19 小时前
https://static.github-zh.com/github_avatars/sgl-project?size=40

#大语言模型#A workload for deploying LLM inference services on Kubernetes

Go 77
5 天前
https://static.github-zh.com/github_avatars/scitix?size=40
Go 43
16 天前
https://static.github-zh.com/github_avatars/blackbird-io?size=40

A high-performance RDMA distributed storage system for fast LLM Inference and GPU Training

C++ 34
8 天前
https://static.github-zh.com/github_avatars/dzhsurf?size=40

DeepSeek-V3, R1 671B on 8xH100 Throughput Benchmarks

Python 16
7 个月前
https://static.github-zh.com/github_avatars/zejia-lin?size=40

#大语言模型#Boosting GPU utilization for LLM serving via dynamic spatial-temporal prefill & decode orchestration

Python 13
22 天前
https://static.github-zh.com/github_avatars/sgl-project?size=40

Kernel Library Wheel for SGLang

HTML 12
4 天前
https://static.github-zh.com/github_avatars/lucasavila00?size=40
TypeScript 9
1 年前
loading...
Website
Wikipedia