GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

post-training-quantization

Website
Wikipedia
https://static.github-zh.com/github_avatars/intel?size=40
intel / neural-compressor

SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime

low-precisionpruningsparsityauto-tuningknowledge-distillationquantizationquantization-aware-trainingpost-training-quantizationsmoothquantlarge-language-modelsgptqint8
Python 2.43 k
3 天前
https://static.github-zh.com/github_avatars/666DZY666?size=40
666DZY666 / micronet

micronet, a model compression and deploy lib. compression: 1、quantization: quantization-aware-training(QAT), High-Bit(>2b)(DoReFa/Quantization and Training of Neural Networks for Efficient Integer-Ari...

quantizationpruningdorefatwnbnnxnor-netPyTorchmodel-compressiongroup-convolutionconvolutional-networksquantization-aware-trainingpost-training-quantizationtensorrtonnx
Python 2.25 k
1 个月前
https://static.github-zh.com/github_avatars/alibaba?size=40
alibaba / TinyNeuralNetwork

#计算机科学#TinyNeuralNetwork is an efficient and easy-to-use deep learning model compression framework.

PyTorch深度学习model-compressionpruningmodel-converterquantization-aware-training深度神经网络post-training-quantization
Python 827
21 天前
https://static.github-zh.com/github_avatars/SqueezeAILab?size=40
SqueezeAILab / SqueezeLLM

#自然语言处理#[ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization

efficient-inferencelarge-language-models大语言模型model-compression自然语言处理post-training-quantizationquantizationtext-generationtransformerllamalocalllm
Python 691
10 个月前
https://static.github-zh.com/github_avatars/ModelTC?size=40
ModelTC / llmc

#大语言模型#[EMNLP 2024 Industry Track] This is the official PyTorch implementation of "LLMC: Benchmarking Large Language Model Quantization with a Versatile Compression Toolkit".

部署大语言模型pruningquantization工具benchmarkevaluationlarge-language-modelsinternlm2llama3smoothquantpost-training-quantizationmixtralvllm
Python 486
6 天前
https://static.github-zh.com/github_avatars/Xiuyu-Li?size=40
Xiuyu-Li / q-diffusion

[ICCV 2023] Q-Diffusion: Quantizing Diffusion Models.

diffusion-modelsquantizationPyTorchstable-diffusionmodel-compressionpost-training-quantization
Python 347
1 年前
https://static.github-zh.com/github_avatars/megvii-research?size=40
megvii-research / FQ-ViT

[IJCAI 2022] FQ-ViT: Post-Training Quantization for Fully Quantized Vision Transformer

vision-transformerquantizationpost-training-quantizationPyTorchimagenet
Python 343
2 年前
https://static.github-zh.com/github_avatars/megvii-research?size=40
megvii-research / Sparsebit

#计算机科学#A model compression and acceleration toolbox based on pytorch.

深度学习post-training-quantizationpruningquantizationquantization-aware-trainingsparsetensorrt
Python 331
1 年前
https://static.github-zh.com/github_avatars/sayakpaul?size=40
sayakpaul / Adventures-in-TensorFlow-Lite

This repository contains notebooks that show the usage of TensorFlow Lite for quantizing deep neural networks.

tensorflow-2tensorflow-liteon-device-mlmodel-quantizationpost-training-quantizationquantization-aware-trainingpruninginference
Jupyter Notebook 170
2 年前
https://static.github-zh.com/github_avatars/Hsu1023?size=40
Hsu1023 / DuQuant

#大语言模型#[NeurIPS 2024 Oral🔥] DuQuant: Distributing Outliers via Dual Transformation Makes Stronger Quantized LLMs.

large-language-models大语言模型post-training-quantizationquantization
Python 161
8 个月前
https://static.github-zh.com/github_avatars/hkproj?size=40
hkproj / quantization-notes

#计算机科学#Notes on quantization in neural networks

深度学习neural-networkspost-training-quantizationPyTorchquantizationquantization-aware-training
Jupyter Notebook 85
2 年前
https://static.github-zh.com/github_avatars/ModelTC?size=40
ModelTC / TFMQ-DM

[CVPR 2024 Highlight] This is the official PyTorch implementation of "TFMQ-DM: Temporal Feature Maintenance Quantization for Diffusion Models".

diffusion-modelspost-training-quantizationstable-diffusioncvprcvpr2024quantizationhighlight
Jupyter Notebook 63
10 个月前
https://static.github-zh.com/github_avatars/ModelTC?size=40
ModelTC / QLLM

#大语言模型#[ICLR 2024] This is the official PyTorch implementation of "QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Models"

llamallama2大语言模型post-training-quantizationPyTorchquantizationtransformers
Python 38
1 年前
https://static.github-zh.com/github_avatars/Sanjana7395?size=40
Sanjana7395 / static_quantization

Post-training static quantization using ResNet18 architecture

quantizationpost-training-quantizationmnist-classificationPyTorch
Jupyter Notebook 37
5 年前
https://static.github-zh.com/github_avatars/zysxmu?size=40
zysxmu / FDDA

Pytorch implementation of our paper accepted by ECCV 2022-- Fine-grained Data Distribution Alignment for Post-Training Quantization

post-training-quantizationaccelerationcompression
Python 15
3 年前
https://static.github-zh.com/github_avatars/KwangHoonAn?size=40
KwangHoonAn / Quantizations

quantizationpost-training-quantization
Python 13
4 年前
https://static.github-zh.com/github_avatars/shieldforever?size=40
shieldforever / NeuronQuant

[ASP-DAC 2025] "NeuronQuant: Accurate and Efficient Post-Training Quantization for Spiking Neural Networks" Official Implementation

post-training-quantization
Python 11
3 个月前
https://static.github-zh.com/github_avatars/iszry?size=40
iszry / DI2N-PTQ4DM

Improved the performance of 8-bit PTQ4DM expecially on FID.

diffusion-modelpost-training-quantization
Python 11
2 年前
https://static.github-zh.com/github_avatars/Rumeysakeskin?size=40
Rumeysakeskin / ASR-Quantization

Post-training quantization on Nvidia Nemo ASR model

model-deploymentpost-training-quantizationPyTorchpytorch-lightningquantizationspeech-recognition
Jupyter Notebook 7
2 年前
https://static.github-zh.com/github_avatars/GongCheng1919?size=40
GongCheng1919 / bias-compensation

[CAAI AIR'24] Minimize Quantization Output Error with Bias Compensation

post-training-quantization
Python 7
3 个月前
loading...