#

llm-inference

https://static.github-zh.com/github_avatars/nomic-ai?size=40

GPT4All: 可在任何设备(笔记本 & 台式机)上运行大型语言模型(LLM)

C++ 76.68 k
4 个月前
https://static.github-zh.com/github_avatars/gitleaks?size=40

#大语言模型#Gitleaks 是一个开源SAST(静态应用安全测试)命令行工具,用于检测Git 仓库以防止把密码、API 密钥和访问令牌等机密信息硬编码到代码中

Go 23.24 k
1 个月前
https://static.github-zh.com/github_avatars/liguodongiot?size=40

#大语言模型#本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)

HTML 20.78 k
1 个月前
https://static.github-zh.com/github_avatars/Lightning-AI?size=40

#大语言模型#20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.

Python 12.75 k
3 天前
https://static.github-zh.com/github_avatars/bentoml?size=40
Python 11.78 k
10 小时前
https://static.github-zh.com/github_avatars/mistralai?size=40
Jupyter Notebook 10.47 k
6 个月前
https://static.github-zh.com/github_avatars/SJTU-IPADS?size=40

#大语言模型#PowerInfer 是一个快速的、可运行在消费级GPU、个人电脑上的大模型服务

C++ 8.33 k
1 个月前
bentoml/BentoML
https://static.github-zh.com/github_avatars/bentoml?size=40
Python 8.07 k
18 小时前
https://static.github-zh.com/github_avatars/duixcom?size=40

🚀 全网效果最好的移动端【实时对话数字人】。 支持本地部署、多模态交互(语音、文本、表情),响应速度低于 1.5 秒,适用于直播、教学、客服、金融、政务等对隐私与实时性要求极高的场景。开箱即用,开发者友好。

C++ 7.46 k
18 天前
https://static.github-zh.com/github_avatars/kserve?size=40
Python 4.54 k
14 小时前
xlite-dev/Awesome-LLM-Inference
https://static.github-zh.com/github_avatars/xlite-dev?size=40

📚A curated list of Awesome LLM/VLM Inference Papers with Codes: Flash-Attention, Paged-Attention, WINT8/4, Parallelism, etc.🎉

Python 4.52 k
1 个月前
https://static.github-zh.com/github_avatars/katanemo?size=40

The smart edge and AI gateway for agents. Arch is a high-performance proxy server that handles the low-level work in building agents: like applying guardrails, routing prompts to the right agent, and ...

Rust 3.68 k
4 小时前
loading...
Website
Wikipedia