#

mlsys

https://static.github-zh.com/github_avatars/Infrasys-AI?size=40

AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术

Jupyter Notebook 15.1 k
13 天前
https://static.github-zh.com/github_avatars/HuaizhengZhang?size=40

🚀 Awesome System for Machine Learning ⚡️ AI System Papers and Industry Practice. ⚡️ System for Machine Learning, LLM (Large Language Model), GenAI (Generative AI). 🍻 OSDI, NSDI, SIGCOMM, SoCC, MLSys...

3.27 k
2 个月前
https://static.github-zh.com/github_avatars/nunchaku-tech?size=40

[ICLR2025 Spotlight] SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models

Python 2.98 k
3 天前
https://static.github-zh.com/github_avatars/thu-ml?size=40

#大语言模型#Quantized Attention achieves speedup of 2-5x and 3-11x compared to FlashAttention and xformers, without lossing end-to-end metrics across language, image, and video models.

Cuda 2.39 k
1 个月前
https://static.github-zh.com/github_avatars/thu-ml?size=40

#大语言模型#SpargeAttention: A training-free sparse attention that can accelerate any model inference.

Cuda 713
1 个月前
https://static.github-zh.com/github_avatars/bytedance?size=40
MLIR 447
1 个月前
https://static.github-zh.com/github_avatars/SymbioticLab?size=40
Python 400
2 年前
https://static.github-zh.com/github_avatars/ml-energy?size=40

#计算机科学#Measure and optimize the energy consumption of your AI applications!

Python 291
1 个月前
https://static.github-zh.com/github_avatars/MLSys-Learner-Resources?size=40
HTML 276
8 个月前
https://static.github-zh.com/github_avatars/sbu-fsl?size=40

#计算机科学#Machine Learning Framework for Operating Systems - Brings ML to Linux kernel

C 249
4 年前
https://static.github-zh.com/github_avatars/bytedance?size=40

An acceleration library that supports arbitrary bit-width combinatorial quantization operations

C++ 232
1 年前
https://static.github-zh.com/github_avatars/HuaizhengZhang?size=40
Python 217
1 年前
https://static.github-zh.com/github_avatars/xlite-dev?size=40

🤖FFPA: Extend FlashAttention-2 with Split-D, ~O(1) SRAM complexity for large headdim, 1.8x~3x↑🎉 vs SDPA EA.

Cuda 215
1 个月前
https://static.github-zh.com/github_avatars/HPMLL?size=40

#大语言模型#A ChatGPT(GPT-3.5) & GPT-4 Workload Trace to Optimize LLM Serving Systems

Python 204
2 个月前
https://static.github-zh.com/github_avatars/jacopotagliabue?size=40

#自然语言处理#Materials for my 2021 NYU class on NLP and ML Systems (Master of Engineering).

Jupyter Notebook 96
3 年前
https://static.github-zh.com/github_avatars/tanyuqian?size=40
Python 68
9 个月前
loading...
Website
Wikipedia