#

on-device-llms

https://static.github-zh.com/github_avatars/Lizonghang?size=40

prima.cpp: Speeding up 70B-scale LLM inference on low-resource everyday home clusters

C++ 997
3 个月前
https://static.github-zh.com/github_avatars/dmis-lab?size=40

#计算机科学#[ACL 2025] Outlier-Safe Pre-Training for Robust 4-Bit Quantization of Large Language Models

Python 31
2 个月前
Website
Wikipedia