#计算机科学#An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.
Efficient AI Backbones including GhostNet, TNT and MLP, developed by Huawei Noah's Ark Lab.
#计算机科学#Awesome Knowledge Distillation
Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.
#大语言模型#[CVPR 2023] DepGraph: Towards Any Structural Pruning; LLMs, Vision Foundation Models, etc.
#计算机科学#An Automatic Model Compression (AutoMC) framework for developing smaller and faster AI applications.
#计算机科学#Awesome Knowledge-Distillation. 分类整理的知识蒸馏paper(2014-2021)。
#Awesome#A curated list of neural network pruning resources.
micronet, a model compression and deploy lib. compression: 1、quantization: quantization-aware-training(QAT), High-Bit(>2b)(DoReFa/Quantization and Training of Neural Networks for Efficient Integer-Ari...
#计算机科学#A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are continuously improving the project. Welcome to PR the works (pape...
A PyTorch implementation for exploring deep and shallow knowledge distillation (KD) experiments with flexibility
Pytorch implementation of various Knowledge Distillation (KD) methods.
#计算机科学#A toolkit to optimize ML models for deployment for Keras and TensorFlow, including quantization and pruning.
#自然语言处理#NLP DNN Toolkit - Building Your NLP DNN Models Like Playing Lego
Efficient computing methods developed by Huawei Noah's Ark Lab
Channel Pruning for Accelerating Very Deep Neural Networks (ICCV'17)
#计算机科学#Collection of recent methods on (deep) neural network compression and acceleration.
[CVPR 2024] DeepCache: Accelerating Diffusion Models for Free
#Awesome#A list of high-quality (newest) AutoML works and lightweight models including 1.) Neural Architecture Search, 2.) Lightweight Structures, 3.) Model Compression, Quantization and Acceleration, 4.) Hype...
#计算机科学#TinyNeuralNetwork is an efficient and easy-to-use deep learning model compression framework.