GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

model-serving

Website
Wikipedia
https://static.github-zh.com/github_avatars/vllm-project?size=40
vllm-project / vllm

#大语言模型#A high-throughput and memory-efficient inference and serving engine for LLMs

gpt大语言模型PyTorchllmopsmlopsmodel-servingtransformerllm-servinginferencellamaamdrocmCUDAinferentiatrainiumtpuxpuhpudeepseekqwen
Python 49.62 k
20 小时前
bentoml/BentoML
https://static.github-zh.com/github_avatars/bentoml?size=40
bentoml / BentoML

#大语言模型#The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!

model-servingmlopsllmopsgenerative-aillm-inference深度学习llm-serving机器学习Pythonmultimodalml-engineering大语言模型
Python 7.78 k
3 天前
https://static.github-zh.com/github_avatars/ahkarami?size=40
ahkarami / Deep-Learning-in-Production

#计算机科学#In this repository, I will share some useful notes and references about deploying deep learning-based models in production.

深度学习深度神经网络PythonPyTorchtesnorflowKerasmxnetcaffe2productionservingC++model-serving教程FlaskREST APIReactAngularTensorflow
4.35 k
7 个月前
https://static.github-zh.com/github_avatars/kserve?size=40
kserve / kserve

#计算机科学#Standardized Serverless ML Inference Platform on Kubernetes

knative机器学习model-interpretabilitymodel-servingistiokubeflow人工智能TensorflowPyTorchscikit-learnxgboostKubernetesservice-meshkserveHacktoberfestmlopsgenaillm-inference
Python 4.24 k
4 天前
https://static.github-zh.com/github_avatars/FedML-AI?size=40
FedML-AI / FedML

#计算机科学#FEDML - The unified and scalable ML library for large-scale distributed training, model serving, and federated learning. FEDML Launch, a cross-cloud scheduler, further enables running any AI jobs on a...

federated-learning深度学习distributed-trainingedge-ai机器学习on-device-traininginference-enginemlopsmodel-deploymentmodel-servingai-agent
Python 3.87 k
1 个月前
https://static.github-zh.com/github_avatars/ModelTC?size=40
ModelTC / lightllm

#自然语言处理#LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

深度学习gptllama大语言模型model-serving自然语言处理openai-triton
Python 3.3 k
4 天前
https://static.github-zh.com/github_avatars/predibase?size=40
predibase / lorax

#大语言模型#Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs

fine-tuninggptllama大语言模型llm-inferencellm-servingllmopsloramodel-servingPyTorchtransformers
Python 3.01 k
25 天前
https://static.github-zh.com/github_avatars/HuaizhengZhang?size=40
HuaizhengZhang / AI-Infra-from-Zero-to-Hero

🚀 Awesome System for Machine Learning ⚡️ AI System Papers and Industry Practice. ⚡️ System for Machine Learning, LLM (Large Language Model), GenAI (Generative AI). 🍻 OSDI, NSDI, SIGCOMM, SoCC, MLSys...

large-language-modelsai-infragenaimlsysmodel-servingmodel-training
2.98 k
22 天前
https://static.github-zh.com/github_avatars/tensorchord?size=40
tensorchord / envd

🏕️ Reproducible development environment

developer-toolsdevelopment-environmentDockerbuildkitHacktoberfestllmopsmlopsmodel-serving
Go 2.12 k
9 天前
beclab/Olares
https://static.github-zh.com/github_avatars/beclab?size=40
beclab / Olares

Olares: An Open-Source Personal Cloud to Reclaim Your Data

Kubernetes自托管home-automationhomelabedge-aihomeserverlocal-aiai-agentsmodel-servingmcphome-cloudhome-server
Go 2.07 k
4 天前
https://static.github-zh.com/github_avatars/microsoft?size=40
microsoft / aici

#大语言模型#AICI: Prompts as (Wasm) Programs

人工智能RustWebAssemblywasmtimeinferencelanguage-model大语言模型llm-frameworkllm-inferencellm-servingllmopsmodel-servingtransformer
Rust 2.03 k
5 个月前
https://static.github-zh.com/github_avatars/mlrun?size=40
mlrun / mlrun

#计算机科学#MLRun is an open source MLOps platform for quickly building and managing continuous ML applications across their lifecycle. MLRun integrates into your development and CI/CD environment and automates t...

mlopsPython数据科学机器学习data-engineeringexperiment-trackingmodel-servingworkflowKubernetes
Python 1.55 k
2 小时前
https://static.github-zh.com/github_avatars/logicalclocks?size=40
logicalclocks / hopsworks

#计算机科学#Hopsworks - Data-Intensive AI platform with a Feature Store

feature-storeAmazon Web ServicesAzure数据科学feature-engineeringfeature-managementGoogle 云governancekserve机器学习mlopsmodel-servingpysparkPythonServerless
Java 1.23 k
4 个月前
https://static.github-zh.com/github_avatars/thu-pacman?size=40
thu-pacman / chitu

#大语言模型#High-performance inference framework for large language models, focusing on efficiency, flexibility, and availability.

deepseekgpu大语言模型PyTorchllm-servingmodel-serving
Python 1.14 k
6 天前
https://static.github-zh.com/github_avatars/basetenlabs?size=40
basetenlabs / truss

#计算机科学#The simplest way to serve AI/ML models in production

机器学习人工智能easy-to-useinference-apiinference-servermodel-servingOpen Sourcepackagingfalconstable-diffusionWhisperwizardlm
Python 1 k
5 天前
https://static.github-zh.com/github_avatars/zhihu?size=40
zhihu / ZhiLight

#大语言模型#A highly optimized LLM inference acceleration engine for Llama and its variants.

inference-engine大语言模型CUDAgptllamallm-servingPyTorchllm-inferencemodel-servingdeepseek-r1
C++ 891
1 个月前
https://static.github-zh.com/github_avatars/kitops-ml?size=40
kitops-ml / kitops

#数据仓库#An open source DevOps tool for packaging and versioning AI/ML models, datasets, code, and configuration into an OCI artifact.

人工智能Code数据集DevOpsdevops-tools机器学习mlopsmodelsmlops-toolsggufKuberneteskubernetes-deploymentPyTorchscikit-learnTensorflowmodel-interpretabilitymodel-servingOpen SourceHacktoberfestplatform-engineering
Go 872
5 天前
https://static.github-zh.com/github_avatars/mosecorg?size=40
mosecorg / mosec

#大语言模型#A high-performance ML model serving framework, offers dynamic batching and CPU/GPU pipelines to fully exploit your compute machine

model-serving深度学习机器学习nerual-networkmlopsHacktoberfestgpuPythonPyTorchTensorflow大语言模型jaxllm-servingRustcvmxnettts
Python 842
5 天前
https://static.github-zh.com/github_avatars/efeslab?size=40
efeslab / Nanoflow

#大语言模型#A throughput-oriented high-performance serving framework for LLMs

CUDAinferencellama2大语言模型llm-servingmodel-serving
Jupyter Notebook 820
11 天前
https://static.github-zh.com/github_avatars/bentoml?size=40
bentoml / Yatai

#计算机科学#Model Deployment at Scale on Kubernetes 🦄️

bentomlKubernetesmlopsmodel-deploymentmodel-serving机器学习
TypeScript 812
1 年前
loading...