GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

kvcache

Website
Wikipedia
kvcache-ai/Mooncake
https://static.github-zh.com/github_avatars/kvcache-ai?size=40
kvcache-ai / Mooncake

#大语言模型#Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

inferencekvcache大语言模型rdmasglangvllmdisaggregation
C++ 3.85 k
7 小时前
https://static.github-zh.com/github_avatars/Zefan-Cai?size=40
Zefan-Cai / R-KV

#大语言模型#R-KV: Redundancy-aware KV Cache Compression for Reasoning Models

kvcache大语言模型
Python 1.11 k
18 小时前
https://static.github-zh.com/github_avatars/uccl-project?size=40
uccl-project / uccl

#大语言模型#Ultra and Unified CCL

人工智能amdbroadcomCUDAgpuhpc大语言模型NetworkNvidiardmakvcacheP2P
C++ 512
4 小时前
https://static.github-zh.com/github_avatars/ovg-project?size=40
ovg-project / kvcached

#大语言模型#kvcached: Elastic KV cache for dynamic GPU sharing and efficient multi-LLM inference.

kvcache大语言模型sglangvllminference-engine
Python 66
6 小时前
https://static.github-zh.com/github_avatars/NoakLiu?size=40
NoakLiu / PiKV

PiKV: KV Cache Management System for Mixture of Experts [Efficient ML System]

kvcachemoeparallel-computingkv-cachemanagement-systemmixture-of-experts
Python 34
12 天前
https://static.github-zh.com/github_avatars/Linking-ai?size=40
Linking-ai / SCOPE

(ACL 2025 oral) SCOPE: Optimizing KV Cache Compression in Long-context Generation

kvcachelong-context
Jupyter Notebook 32
3 个月前
https://static.github-zh.com/github_avatars/IBM?size=40
IBM / spnl

Span Queries: What if we had a way to plan and optimize GenAI like we do for SQL?

generative-aikvcachelocalityoptimizationSQL
Rust 6
5 小时前
https://static.github-zh.com/github_avatars/RohitMurali18?size=40
RohitMurali18 / Music-Generation-Emotion-Adaptive

This project implements an Emotion-Aware Music Generator (EAMG) that turns natural-language prompts into emotion-aligned music in real time. It uses a LoRA-tuned DistilBERT to classify emotions, maps ...

FastAPIkvcache大语言模型loratransformers
Jupyter Notebook 0
2 个月前