#

pruning

https://static.github-zh.com/github_avatars/datawhalechina?size=40

#大语言模型#《李宏毅深度学习教程》(李宏毅老师推荐👍,苹果书🍎),PDF下载地址:https://github.com/datawhalechina/leedl-tutorial/releases

Jupyter Notebook 15.72 k
3 个月前
https://static.github-zh.com/github_avatars/IntelLabs?size=40

Neural Network Distiller by Intel AI Lab: a Python package for neural network compression research. https://intellabs.github.io/distiller

Jupyter Notebook 4.4 k
2 年前
https://static.github-zh.com/github_avatars/VainF?size=40

#大语言模型#[CVPR 2023] DepGraph: Towards Any Structural Pruning; LLMs, Vision Foundation Models, etc.

Python 3.13 k
11 天前
https://static.github-zh.com/github_avatars/intel?size=40

SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime

Python 2.49 k
1 天前
https://static.github-zh.com/github_avatars/he-y?size=40

#Awesome#A curated list of neural network pruning resources.

2.47 k
1 年前
https://static.github-zh.com/github_avatars/quic?size=40

#计算机科学#AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.

Python 2.45 k
15 小时前
https://static.github-zh.com/github_avatars/666DZY666?size=40

micronet, a model compression and deploy lib. compression: 1、quantization: quantization-aware-training(QAT), High-Bit(>2b)(DoReFa/Quantization and Training of Neural Networks for Efficient Integer-Ari...

Python 2.26 k
4 个月前
https://static.github-zh.com/github_avatars/PaddlePaddle?size=40
Python 1.6 k
16 天前
https://static.github-zh.com/github_avatars/tensorflow?size=40

#计算机科学#A toolkit to optimize ML models for deployment for Keras and TensorFlow, including quantization and pruning.

Python 1.55 k
8 天前
https://static.github-zh.com/github_avatars/huawei-noah?size=40
Jupyter Notebook 1.29 k
10 个月前
https://static.github-zh.com/github_avatars/horseee?size=40

#大语言模型#[NeurIPS 2023] LLM-Pruner: On the Structural Pruning of Large Language Models. Support Llama-3/3.1, Llama-2, LLaMA, BLOOM, Vicuna, Baichuan, TinyLlama, etc.

Python 1.06 k
1 年前
https://static.github-zh.com/github_avatars/jacobgil?size=40

#计算机科学#PyTorch Implementation of [1611.06440] Pruning Convolutional Neural Networks for Resource Efficient Inference

Python 884
6 年前
https://static.github-zh.com/github_avatars/Syencil?size=40

mobilev2-yolov5s剪枝、蒸馏,支持ncnn,tensorRT部署。ultra-light but better performence!

Jupyter Notebook 856
3 年前
loading...
Website
Wikipedia