GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

vllm

Website
Wikipedia
https://static.github-zh.com/github_avatars/meta-llama?size=40
meta-llama / llama-cookbook

#大语言模型#Llama 2 微调/推理方法和示例

人工智能finetuninglangchainllamallama2大语言模型机器学习PythonPyTorchvllm
Jupyter Notebook 17.48 k
2 天前
https://static.github-zh.com/github_avatars/xorbitsai?size=40
xorbitsai / inference

#大语言模型#Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any...

ggmlPyTorchchatglm部署flan-t5大语言模型wizardlm人工智能机器学习Whisperinferenceopenai-apimistralgemmallamallamacppvllmqwenllama3glm4
Python 8.03 k
13 小时前
https://static.github-zh.com/github_avatars/OpenRLHF?size=40
OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agent RL)

transformersvllmlarge-language-modelsraylibreinforcement-learning-from-human-feedbackreinforcement-learningopenai-o1proximal-policy-optimization
Python 7.08 k
2 天前
https://static.github-zh.com/github_avatars/katanaml?size=40
katanaml / sparrow

#大语言模型#Data processing and instruction calling with ML, LLM and Vision LLM

机器学习huggingface-transformers自然语言处理机器视觉gpt大语言模型ragvllm
Python 4.57 k
3 天前
xlite-dev/Awesome-LLM-Inference
https://static.github-zh.com/github_avatars/xlite-dev?size=40
xlite-dev / Awesome-LLM-Inference

📚A curated list of Awesome LLM Inference Papers with Codes.

flash-attentiontensorrt-llmvllmllm-inferencedeepseekdeepseek-v3deepseek-r1qwen3
Python 4.12 k
7 天前
https://static.github-zh.com/github_avatars/gpustack?size=40
gpustack / gpustack

#大语言模型#Simple, scalable AI model deployment on GPU clusters

ascendCUDAdeepseekdistributed-inferencegenaiinferencellamallamacpp大语言模型maasmetalopenaiqwenrocmvllmmindiellm-inferencellm-servinglocal-aiheterogeneous-cluster
Python 2.92 k
4 天前
https://static.github-zh.com/github_avatars/containers?size=40
containers / ramalama

#大语言模型#RamaLama is an open-source developer tool that simplifies the local serving of AI models from any source and facilitates their use for inference in production, all through the familiar language of con...

人工智能containersCUDADockerhipinference-serverintelllamacpp大语言模型podmanvllm
Python 1.76 k
5 天前
https://static.github-zh.com/github_avatars/bricks-cloud?size=40
bricks-cloud / BricksLLM

#大语言模型#🔒 Enterprise-grade API gateway that helps you monitor and impose cost or rate limits per API key. Get fine-grained access control and monitoring per user, application, or environment. Supports OpenAI...

Go大语言模型openai人工智能anthropicAzuregptPostgreSQLREST APIycombinatorAPIDocker隐私安全generative-aiOpen Source自托管vllm
Go 1.06 k
5 个月前
https://static.github-zh.com/github_avatars/substratusai?size=40
substratusai / kubeai

#大语言模型#AI Inference Operator for Kubernetes. The easiest way to serve ML models in production. Supports VLMs, LLMs, embeddings, and speech-to-text.

Kubernetes大语言模型openai-apiautoscalerollamavllmollama-operatorvllm-operator人工智能Whisperfaster-whisper
Go 987
4 天前
https://static.github-zh.com/github_avatars/prometheus-eval?size=40
prometheus-eval / prometheus-eval

#大语言模型#Evaluate your LLM's response with Prometheus and GPT4 💯

evaluation大语言模型llmopsPythonvllmgpt4llm-as-a-judge
Python 954
2 个月前
https://static.github-zh.com/github_avatars/mostlygeek?size=40
mostlygeek / llama-swap

Model swapping for llama.cpp (or any local OpenAPI compatible server)

Gollamallamacpplocalllamalocalllmopenaiopenai-apivllm
Go 876
7 天前
https://static.github-zh.com/github_avatars/harleyszhang?size=40
harleyszhang / llm_note

#大语言模型#LLM notes, including model inference, transformer model structure, and llm framework code analysis notes.

大语言模型llm-inferencevllmcuda-programmingkv-cachetransformer-models
Python 778
5 天前
https://static.github-zh.com/github_avatars/apconw?size=40
apconw / sanic-web

#大语言模型#一个轻量级、支持全链路且易于二次开发的大模型应用项目(Large Model Data Assistant) 支持DeepSeek/Qwen2.5等大模型 基于 Dify 、Ollama&Vllm、Sanic 和 Text2SQL 📊 等技术构建的一站式大模型应用开发项目,采用 Vue3、TypeScript 和 Vite 5 打造现代UI。它支持通过 ECharts 📈 实现基于大模型的数据...

人工智能bigdataChatGPTdifyollamaragvllmchat大语言模型qwenechartssanictext2sqlVue.jsPythondeepseek-r1mcp
JavaScript 773
5 天前
https://static.github-zh.com/github_avatars/vllm-project?size=40
vllm-project / vllm-ascend

#大语言模型#Community maintained hardware plugin for vLLM on Ascend

ascendinference大语言模型llm-servingllmopsmlopsmodel-servingtransformervllm
Python 758
1 天前
https://static.github-zh.com/github_avatars/ModelCloud?size=40
ModelCloud / GPTQModel

Production ready LLM model compression/quantization toolkit with hw accelerated inference support for both cpu/gpu via HF, vLLM, and SGLang.

gptqpeftquantizationtransformersvllm
Python 606
18 天前
https://static.github-zh.com/github_avatars/pgalko?size=40
pgalko / BambooAI

#大语言模型#A Python library powered by Language Models (LLMs) for conversational data discovery and analysis.

大语言模型openai-apipandasPython人工智能数据分析数据科学ai-agentsvector-databasepineconegeminigroqmistralollamaanthropicvllmDocker
Python 605
6 天前
https://static.github-zh.com/github_avatars/mustafaaljadery?size=40
mustafaaljadery / llama3v

A SOTA vision model built on top of llama3 8B.

llamallama3vllm
Python 586
1 年前
https://static.github-zh.com/github_avatars/jakobdylanc?size=40
jakobdylanc / llmcord

#前端开发#Make Discord your LLM frontend ● Supports any OpenAI compatible API (Ollama, LM Studio, vLLM, OpenRouter, xAI, Mistral, Groq and more)

gptopenaigpt-4Discord聊天机器人大语言模型Botollamallama3llamamistralgroqxaigrokvllm前端chatllama4gemini
Python 580
3 天前
https://static.github-zh.com/github_avatars/ModelTC?size=40
ModelTC / llmc

#大语言模型#[EMNLP 2024 Industry Track] This is the official PyTorch implementation of "LLMC: Benchmarking Large Language Model Quantization with a Versatile Compression Toolkit".

部署大语言模型pruningquantization工具benchmarkevaluationlarge-language-modelsinternlm2llama3smoothquantpost-training-quantizationmixtralvllm
Python 486
6 天前
https://static.github-zh.com/github_avatars/varunvasudeva1?size=40
varunvasudeva1 / llm-server-docs

#大语言模型#Documentation on setting up an LLM server on Debian from scratch, using Ollama/vLLM, Open WebUI, OpenedAI Speech/Kokoro FastAPI, and ComfyUI.

Linux大语言模型ollamaServeropen-webuiDebiancomfyuivllm
472
2 个月前
loading...