GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

model-inference-service

Website
Wikipedia
bentoml/BentoML
https://static.github-zh.com/github_avatars/bentoml?size=40
bentoml / BentoML

#大语言模型#The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!

model-servingmlopsllmopsgenerative-aillm-inferencemodel-inference-serviceinference-platform深度学习llm-serving机器学习Pythonmultimodalml-engineering大语言模型ai-inference
Python 7.87 k
2 天前
https://static.github-zh.com/github_avatars/bentoml?size=40
bentoml / CLIP-API-service

CLIP as a service - Embed image and sentences, object recognition, visual reasoning, image classification and reverse image search

ai-applicationsclipcloud-nativemlopsmodel-inferencemodel-inference-servicemodel-serving
Jupyter Notebook 65
1 年前
https://static.github-zh.com/github_avatars/bentoml?size=40
bentoml / transformers-nlp-service

#自然语言处理#Online Inference API for NLP Transformer models - summarization, text classification, sentiment analysis and more

大语言模型mlopsmodel-deploymentmodel-inference-servicemodel-serving自然语言处理transformerllmops
Python 44
1 年前
https://static.github-zh.com/github_avatars/ksm26?size=40
ksm26 / Efficiently-Serving-LLMs

Learn the ins and outs of efficiently serving Large Language Models (LLMs). Dive into optimization techniques, including KV caching and Low Rank Adapters (LoRA), and gain hands-on experience with Pred...

batch-processinginference-optimizationmachine-learning-operationsmodel-inference-servicemodel-servingtext-generation
Jupyter Notebook 16
1 年前