A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are continuously improving the project. Welcome to PR the works (p…

2,300 232 Updated Mar 4, 2025

OpenPPL / ppq

PPL Quantization Tool (PPQ) is a powerful offline neural network quantization tool.

Python 1,774 274 Updated Mar 28, 2024

htqin / BiBERT

This project is the official implementation of our accepted ICLR 2022 paper BiBERT: Accurate Fully Binarized BERT.

Python 89 6 Updated Jun 2, 2023

ziplab / QTool

Collections of model quantization algorithms. Any issues, please contact Peng Chen ([email protected])

Python 73 16 Updated Oct 7, 2021

liuzechun / Nonuniform-to-Uniform-Quantization

Nonuniform-to-Uniform Quantization: Towards Accurate Quantization via Generalized Straight-Through Estimation. In CVPR 2022.

Python 136 14 Updated Apr 28, 2022

zysxmu / DDTB

Pytorch implementation of our paper accepted by ECCV2022 -- Dynamic Dual Trainable Bounds for Ultra-low Precision Super-Resolution Networks

Python 30 4 Updated Sep 13, 2022

megvii-research / SSQL-ECCV2022

PyTorch implementation of SSQL (Accepted to ECCV2022 oral presentation)

Python 73 6 Updated Mar 15, 2023

papers-submission / CalibTIP

Improving Post Training Neural Quantization: Layer-wise Calibration and Integer Programming

Python 35 6 Updated Jun 29, 2023

huqinghao / PalQuant

Python 12 1 Updated Aug 26, 2022

Zhen-Dong / HAWQ

Quantization library for PyTorch. Support low-precision and mixed-precision quantization, with hardware implementation through TVM.

Python 452 83 Updated May 15, 2023

YanjingLi0202 / Q-ViT

The official implementation of the NeurIPS 2022 paper Q-ViT.

Python 102 13 Updated May 22, 2023

Tabrizian / learning-to-quantize

Code for "Adaptive Gradient Quantization for Data-Parallel SGD", published in NeurIPS 2020.

Jupyter Notebook 30 5 Updated Jan 14, 2021

sfalkena / LAB

The official repository for the paper LAB: Learnable Activation Binarizer for Binary Neural Networks.

Python 6 1 Updated Oct 27, 2022

hpi-xnor / BNext

Join the High Accuracy Club on ImageNet with A Binary Neural Network Ticket

Python 69 5 Updated Feb 12, 2023

mrusci / training-mixed-precision-quantized-networks

This repository containts the pytorch scripts to train mixed-precision networks for microcontroller deployment, based on the memory contraints of the target device.

Python 50 16 Updated May 9, 2024

itayhubara / CalibTIP

Improving Post Training Neural Quantization: Layer-wise Calibration and Integer Programming

Python 98 23 Updated Jun 10, 2021

hahnyuan / RPTQ4LLM

Reorder-based post-training quantization for large language model

Python 196 15 Updated May 17, 2023

IST-DASLab / gptq

Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".

Python 2,240 188 Updated Mar 27, 2024

ModelTC / Dipoorlet

Offline Quantization Tools for Deploy.

Python 141 19 Updated Dec 28, 2023

deepglint / EasyQuant

EasyQuant(EQ) is an efficient and simple post-training quantization method via effectively optimizing the scales of weights and activations.

Python 405 69 Updated Nov 22, 2022

jakc4103 / DFQ

PyTorch implementation of Data Free Quantization Through Weight Equalization and Bias Correction.

Python 264 47 Updated Oct 3, 2023

mit-han-lab / smoothquant

[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models

Python 1,576 189 Updated Jul 12, 2024

deepspeedai / DeepSpeedExamples

Example models using DeepSpeed

Python 6,752 1,112 Updated Dec 19, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

lixc lixcli

Achievements

Achievements

Block or report lixcli

quantization

AojunZhou / Incremental-Network-Quantization

zhutmost / neuralzip

zhutmost / lsq-net

deJQK / AdaBits

ModelTC / MQBench

ModelTC / mqbench-paper

allenbai01 / ProxQuant

Efficient-ML / Awesome-Model-Quantization