#自然语言处理#[NeurIPS 2024] KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization
#计算机科学#59 篇深度学习论文的实现,并带有详细注释。包括 transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, ...), gans(cyclegan, stylegan2, ...), 🎮 强化学习 (ppo, dqn), capsnet, distillation, ... 🧠
Tutorial for Harvard Medical School ML from Scratch Series: Transformer from Scratch. Demo the usage of transformer in various domains: Music sheet, audio signal, image generation & discrimination
Notes for C++ Deep Dive Course on Udemy by Abdul Bari.
All Homeworks for TinyML and Efficient Deep Learning Computing 6.5940 • Fall • 2023 • https://efficientml.ai
Educational materials on deep learning by Weights & Biases
#计算机科学#Machine Learning From Scratch. Bare bones NumPy implementations of machine learning models and algorithms with a focus on accessibility. Aims to cover everything from linear regression to deep learnin...
4 bits quantization of LLaMA using GPTQ
Learn C++ Programming -Beginner to Advance- Deep Dive in C++
#自然语言处理#[ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization
#计算机科学#Faster Whisper transcription with CTranslate2
iPG Y-Combinator Interview Simulator
Lab 5 project of MIT-6.5940, deploying LLaMA2-7B-chat on one's laptop with TinyChatEngine.
0 条讨论