GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

distributed-inference

Website
Wikipedia
https://static.github-zh.com/github_avatars/flashinfer-ai?size=40
flashinfer-ai / flashinfer

FlashInfer: Kernel Library for LLM Serving

gpuCUDAPyTorchllm-inferencejitattentionNvidiadistributed-inferencemoe
Cuda 3.44 k
1 天前
https://static.github-zh.com/github_avatars/gpustack?size=40
gpustack / gpustack

#大语言模型#Simple, scalable AI model deployment on GPU clusters

ascendCUDAdeepseekdistributed-inferencegenaiinferencellamallamacpp大语言模型maasmetalopenaiqwenrocmvllmmindiellm-inferencellm-servinglocal-aiheterogeneous-cluster
Python 3.17 k
1 天前
https://static.github-zh.com/github_avatars/Lizonghang?size=40
Lizonghang / prima.cpp

prima.cpp: Speeding up 70B-scale LLM inference on low-resource everyday home clusters

distributed-aillm-inferenceon-device-llmsllama-cppdistributed-inference
C++ 992
11 天前
https://static.github-zh.com/github_avatars/mzbac?size=40
mzbac / mlx_sharding

Distributed Inference for mlx LLm

MLXdistributed-inference
Python 94
1 年前
https://static.github-zh.com/github_avatars/ADT109119?size=40
ADT109119 / llamacpp-distributed-inference

#大语言模型#一個基於 llama.cpp 的分佈式 LLM 推理程式,讓您能夠利用區域網路內的多台電腦協同進行大型語言模型的分佈式推理,使用 Electron 的製作跨平台桌面應用程式操作 UI。

distributed-inferenceggufllamacpp大语言模型llm-inference远程过程调用 (RPC)distributed-llm
JavaScript 14
4 天前
https://static.github-zh.com/github_avatars/ipc-lab?size=40
ipc-lab / collaborative-inference-oac

#计算机科学#Source code of the paper "Private Collaborative Edge Inference via Over-the-Air Computation".

differential-privacydistributed-inferenceensemble-learning机器学习
Python 4
7 个月前
https://static.github-zh.com/github_avatars/JiangkaiWu?size=40
JiangkaiWu / Attribute_Reid

Official impl. of ACM MM paper "Identity-Aware Attribute Recognition via Real-Time Distributed Inference in Mobile Edge Clouds". A distributed inference model for pedestrian attribute recognition with...

distributed-inferenceedge-computingre-identification
Python 2
5 年前