GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

llm-inference

Website
Wikipedia
https://static.github-zh.com/github_avatars/nomic-ai?size=40
nomic-ai / gpt4all

GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.

llm-inferenceai-chat
C++ 73.61 k
19 天前
https://static.github-zh.com/github_avatars/ray-project?size=40
ray-project / ray

#大语言模型#Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

raydistributedparallel机器学习reinforcement-learning深度学习Pythonrllibhyperparameter-searchoptimization数据科学hyperparameter-optimizationserving部署PyTorchTensorflowllm-servinglarge-language-models大语言模型llm-inference
Python 37.52 k
10 小时前
https://static.github-zh.com/github_avatars/gitleaks?size=40
gitleaks / gitleaks

#大语言模型#Gitleaks 是一个开源SAST(静态应用安全测试)命令行工具,用于检测Git 仓库以防止把密码、API 密钥和访问令牌等机密信息硬编码到代码中

安全GitGosecretgitleaksdevsecopsHacktoberfestCI/CD命令行界面data-loss-preventiondlpOpen Sourceai-powered大语言模型llm-inferencellm-training
Go 20.21 k
7 天前
https://static.github-zh.com/github_avatars/liguodongiot?size=40
liguodongiot / llm-action

#大语言模型#本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)

大语言模型llm-inferencellm-servingllm-trainingllmops
HTML 18.54 k
12 小时前
https://static.github-zh.com/github_avatars/Lightning-AI?size=40
Lightning-AI / litgpt

#大语言模型#20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.

人工智能深度学习large-language-models大语言模型llm-inference
Python 12.26 k
3 天前
https://static.github-zh.com/github_avatars/bentoml?size=40
bentoml / OpenLLM

#大语言模型#Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.

大语言模型llmopsmodel-inferencefine-tuningllm-servingllamavicunabentomlllama2llm-inferencellm-opsmistralmlopsllama3-1
Python 11.35 k
6 天前
https://static.github-zh.com/github_avatars/mistralai?size=40
mistralai / mistral-inference

#大语言模型#Official inference library for Mistral models

大语言模型llm-inferencemistralai
Jupyter Notebook 10.29 k
3 个月前
https://static.github-zh.com/github_avatars/openvinotoolkit?size=40
openvinotoolkit / openvino

#自然语言处理#OpenVINO™ is an open source toolkit for optimizing and deploying AI inference

inference深度学习openvino人工智能机器视觉diffusion-modelsgenerative-aillm-inference自然语言处理performance-boostspeech-recognitionstable-diffusiondeploy-aioptimize-aitransformersyolorecommendation-systemgood-first-issue
C++ 8.43 k
16 小时前
https://static.github-zh.com/github_avatars/SJTU-IPADS?size=40
SJTU-IPADS / PowerInfer

#大语言模型#PowerInfer 是一个快速的、可运行在消费级GPU、个人电脑上的大模型服务

large-language-modelsllama大语言模型llm-inferencelocal-inference
C++ 8.22 k
4 个月前
bentoml/BentoML
https://static.github-zh.com/github_avatars/bentoml?size=40
bentoml / BentoML

#大语言模型#The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!

model-servingmlopsllmopsgenerative-aillm-inference深度学习llm-serving机器学习Pythonmultimodalml-engineering大语言模型
Python 7.78 k
2 天前
https://static.github-zh.com/github_avatars/InternLM?size=40
InternLM / lmdeploy

#大语言模型#LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

cuda-kernelsdeepspeedfastertransformerllm-inferenceturbomindinternlmllama大语言模型codellamallama2llama3
Python 6.52 k
2 天前
superduper-io/superduper
https://static.github-zh.com/github_avatars/superduper-io?size=40
superduper-io / superduper

#向量搜索引擎#Superduper: End-to-end framework for building custom AI applications and agents.

人工智能mlopstorchtransformersMongoDBPythonPyTorch机器学习数据库datainferencellm-inferencepretrained-models聊天机器人semantic-searchllm-servingllmopsvector-searchrag
Python 5.08 k
3 天前
https://static.github-zh.com/github_avatars/kserve?size=40
kserve / kserve

#计算机科学#Standardized Serverless ML Inference Platform on Kubernetes

knative机器学习model-interpretabilitymodel-servingistiokubeflow人工智能TensorflowPyTorchscikit-learnxgboostKubernetesservice-meshkserveHacktoberfestmlopsgenaillm-inference
Python 4.24 k
4 天前
xlite-dev/Awesome-LLM-Inference
https://static.github-zh.com/github_avatars/xlite-dev?size=40
xlite-dev / Awesome-LLM-Inference

📚A curated list of Awesome LLM Inference Papers with Codes.

flash-attentiontensorrt-llmvllmllm-inferencedeepseekdeepseek-v3deepseek-r1qwen3
Python 4.12 k
7 天前
https://static.github-zh.com/github_avatars/FellouAI?size=40
FellouAI / eko

Eko (Eko Keeps Operating) - Build Production-ready Agentic Workflow with Natural Language - eko.fellou.ai

agentagentic-aiagentic-frameworkagentic-workflowbrowserusecomputerusenatural-language-inferenceworkflowragagentic-ai-developmentagentschain-of-thoughtgenaillm-inferencellmapiprompt-engineeringllm-agentsai-agentsbrowser-automationcomputer-automation
TypeScript 3.99 k
1 天前
https://static.github-zh.com/github_avatars/NVIDIA?size=40
NVIDIA / GenerativeAIExamples

#大语言模型#Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.

gpu-accelerationlarge-language-models大语言模型llm-inference微服务nemoragretrieval-augmented-generationtensorrttriton-inference-server
Jupyter Notebook 3.18 k
5 天前
https://static.github-zh.com/github_avatars/flashinfer-ai?size=40
flashinfer-ai / flashinfer

FlashInfer: Kernel Library for LLM Serving

gpuCUDAPyTorchllm-inferencejitattentionNvidia
Cuda 3.17 k
2 天前
neuralmagic/deepsparse
https://static.github-zh.com/github_avatars/neuralmagic?size=40
neuralmagic / deepsparse

#自然语言处理#Sparsity-aware deep learning inference runtime for CPUs

机器学习onnxinference机器视觉object-detectionpruningquantizationpretrained-models自然语言处理cpussparsificationllm-inferenceperformance
Python 3.15 k
13 天前
https://static.github-zh.com/github_avatars/predibase?size=40
predibase / lorax

#大语言模型#Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs

fine-tuninggptllama大语言模型llm-inferencellm-servingllmopsloramodel-servingPyTorchtransformers
Python 3.01 k
25 天前
https://static.github-zh.com/github_avatars/gpustack?size=40
gpustack / gpustack

#大语言模型#Simple, scalable AI model deployment on GPU clusters

ascendCUDAdeepseekdistributed-inferencegenaiinferencellamallamacpp大语言模型maasmetalopenaiqwenrocmvllmmindiellm-inferencellm-servinglocal-aiheterogeneous-cluster
Python 2.92 k
3 天前
loading...