🚀 Awesome System for Machine Learning ⚡️ AI System Papers and Industry Practice. ⚡️ System for Machine Learning, LLM (Large Language Model), GenAI (Generative AI). 🍻 OSDI, NSDI, SIGCOMM, SoCC, MLSys...
[ICLR2025 Spotlight] SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models
#大语言模型#Quantized Attention achieves speedup of 2-5x and 3-11x compared to FlashAttention and xformers, without lossing end-to-end metrics across language, image, and video models.
#大语言模型#Distributed RL System for LLM Reasoning
ComfyUI plugin of Nunchaku
#大语言模型#SpargeAttention: A training-free sparse attention that can accelerate any model inference.
#大语言模型#A model compilation solution for various hardware
#计算机科学#FedScale is a scalable and extensible open-source federated learning (FL) platform.
#计算机科学#Measure and optimize the energy consumption of your AI applications!
#大语言模型#The repository has collected a batch of noteworthy MLSys bloggers (Algorithms/Systems)
#计算机科学#Machine Learning Framework for Operating Systems - Brings ML to Linux kernel
An acceleration library that supports arbitrary bit-width combinatorial quantization operations
#计算机科学#A scalable & efficient active learning/data selection system for everyone.
⚡️FFPA: Extend FlashAttention-2 with Split-D, achieve ~O(1) SRAM complexity for large headdim, 1.8x~3x↑ vs SDPA.🎉
#大语言模型#A ChatGPT(GPT-3.5) & GPT-4 Workload Trace to Optimize LLM Serving Systems
#算法刷题#Optimal Sparse Decision Trees
#自然语言处理#Materials for my 2021 NYU class on NLP and ML Systems (Master of Engineering).
#Awesome#Federated Learning Systems Paper List
NAACL '24 (Best Demo Paper RunnerUp) / MlSys @ NeurIPS '23 - RedCoast: A Lightweight Tool to Automate Distributed Training and Inference