GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

sglang

Website
Wikipedia
kvcache-ai/Mooncake
https://static.github-zh.com/github_avatars/kvcache-ai?size=40
kvcache-ai / Mooncake

#大语言模型#Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

inferencekvcache大语言模型rdmasglangvllmdisaggregation
C++ 3.85 k
6 小时前
https://static.github-zh.com/github_avatars/ModelCloud?size=40
ModelCloud / GPTQModel

Production ready LLM model compression/quantization toolkit with hw accelerated inference support for both cpu/gpu via HF, vLLM, and SGLang.

gptqpeftquantizationsglangtransformersvllm
Python 758
10 小时前
https://static.github-zh.com/github_avatars/HuiResearch?size=40
HuiResearch / FlashTTS

基于SparkTTS、OrpheusTTS等模型,提供高质量中文语音合成与声音克隆服务。

sglangvllm
Python 513
3 个月前
https://static.github-zh.com/github_avatars/sgl-project?size=40
sgl-project / SpecForge

#大语言模型#Train speculative decoding models effortlessly and port them smoothly to SGLang serving.

PyTorchsglangtraining大语言模型
Python 340
7 小时前
https://static.github-zh.com/github_avatars/sgl-project?size=40
sgl-project / ome

#大语言模型#OME is a Kubernetes operator for enterprise-grade management and serving of Large Language Models (LLMs)

Kubernetesllm-inferencemodel-servingoracle-cloudsglang大语言模型deepseekllama
Go 250
1 天前
https://static.github-zh.com/github_avatars/InftyAI?size=40
InftyAI / llmaz

#大语言模型#☸️ Easy, advanced inference platform for large language models on Kubernetes. 🌟 Star to support our work!

Kubernetes大语言模型llamacppsglangvllmhuggingfacemodelscopeollamainferenceinference-platform
Go 242
3 天前
https://static.github-zh.com/github_avatars/shell-nlp?size=40
shell-nlp / gpt_server

#大语言模型#gpt_server是一个用于生产级部署LLMs、Embedding、Reranker、ASR和TTS的开源框架。

embeddinggptllama大语言模型openaiprompt-injectionrerankvllmttsfastchatfunction-callingasrsglang
Python 206
4 天前
https://static.github-zh.com/github_avatars/ovg-project?size=40
ovg-project / kvcached

#大语言模型#kvcached: Elastic KV cache for dynamic GPU sharing and efficient multi-LLM inference.

kvcache大语言模型sglangvllminference-engine
Python 66
4 小时前
https://static.github-zh.com/github_avatars/scitix?size=40
scitix / arks

#大语言模型#Arks is a cloud-native inference framework running on Kubernetes

dynamoKubernetessglangvllminferencereasoning人工智能大语言模型
Go 43
19 天前
https://static.github-zh.com/github_avatars/modal-labs?size=40
modal-labs / stopwatch

#计算机科学#A tool for benchmarking LLMs on Modal

大语言模型机器学习sglangtensorrt-llmvllm
Python 43
6 小时前
https://static.github-zh.com/github_avatars/sgl-project?size=40
sgl-project / awesome-sglang

#大语言模型#Make SGLang go brrr

deepseekdeepseek-r1deepseek-v3gpt-ossllama3llama3-1llama4qwen2qwen2-5qwen3omesglangKubernetes大语言模型
25
8 小时前
https://static.github-zh.com/github_avatars/sgl-project?size=40
sgl-project / rbg

#大语言模型#

Kubernetes大语言模型lwspdsglangcanary
Go 13
11 小时前
https://static.github-zh.com/github_avatars/dzhsurf?size=40
dzhsurf / deepseek-v3-r1-deploy-and-benchmarks

DeepSeek-V3, R1 671B on 8xH100 Throughput Benchmarks

deepseek-r1deepseek-v3sglangvllm
Python 12
6 个月前
https://static.github-zh.com/github_avatars/AidanCooper?size=40
AidanCooper / constrained-decoding

#自然语言处理#A guide to structured generation using constrained decoding

generative-modellarge-language-models自然语言处理sglangstructured-generation
Jupyter Notebook 11
1 年前
https://static.github-zh.com/github_avatars/sgl-project?size=40
sgl-project / whl

Kernel Library Wheel for SGLang

CUDAcutlasssglang
HTML 11
2 天前
https://static.github-zh.com/github_avatars/lucasavila00?size=40
lucasavila00 / LmScript

#大语言模型#Controllable Language Model Interactions in TypeScript

人工智能guidance大语言模型TypeScriptsglang
TypeScript 9
1 年前
https://static.github-zh.com/github_avatars/didier-durand?size=40
didier-durand / llms-in-clouds

#大语言模型#Experiments with LLMs in clouds (powered by SGLang)

Amazon Web ServicesDockerhuggingface大语言模型qwensglangllamamistral
Python 7
2 天前
https://static.github-zh.com/github_avatars/ugo-emekauwa?size=40
ugo-emekauwa / private-ai-setup-dream-guide

#大语言模型#The Private AI Setup Dream Guide for Demos automates the installation of the software needed for a local private AI setup, utilizing AI models (LLMs and diffusion models) for use cases such as general...

人工智能deepseekfluxNvidiaqwensdxlsglangstable-diffusionstable-diffusion-webuivllminference大语言模型Nimlocal-aigenerative-aiopen-webui
Shell 4
1 个月前
https://static.github-zh.com/github_avatars/llmd-io?size=40
llmd-io / llmd

llmd is a LLMs daemonset, it provide model manager and get up and running large language models, it can use llama.cpp or vllm or sglang to running large language models.

llm-inferencesglangvllminference
Makefile 3
7 个月前
https://static.github-zh.com/github_avatars/slinusc?size=40
slinusc / bench360

#大语言模型#Bench360 is a modular benchmarking suite for local LLM deployments. It offers a full-stack, extensible pipeline to evaluate the latency, throughput, quality, and cost of LLM inference on consumer and ...

大语言模型llm-inferenceoptimizationquantizationbenchmarkengine框架inferencevllm部署sglangenergy-consumptionperformanceenergylocal
Python 3
1 个月前
loading...