Skip to content
View lixcli's full-sized avatar

Block or report lixcli

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

quantization

40 repositories

Caffe Implementation for Incremental network quantization

C++ 191 73 Updated Jul 29, 2018

A Out-of-box PyTorch Scaffold for Neural Network Quantization-Aware-Training (QAT) Research. Website: https://github.com/zhutmost/neuralzip

Python 25 1 Updated Dec 20, 2022

Unofficial implementation of LSQ-Net, a neural network quantization framework

Python 307 44 Updated May 8, 2024
Python 41 7 Updated Dec 15, 2022

Model Quantization Benchmark

Python 855 142 Updated Apr 20, 2025
Python 44 9 Updated Jul 14, 2021

ProxQuant: Quantized Neural Networks via Proximal Operators

Python 30 4 Updated Feb 19, 2019

A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are continuously improving the project. Welcome to PR the works (p…

2,300 232 Updated Mar 4, 2025

PPL Quantization Tool (PPQ) is a powerful offline neural network quantization tool.

Python 1,774 274 Updated Mar 28, 2024

This project is the official implementation of our accepted ICLR 2022 paper BiBERT: Accurate Fully Binarized BERT.

Python 89 6 Updated Jun 2, 2023

Collections of model quantization algorithms. Any issues, please contact Peng Chen ([email protected])

Python 73 16 Updated Oct 7, 2021

Nonuniform-to-Uniform Quantization: Towards Accurate Quantization via Generalized Straight-Through Estimation. In CVPR 2022.

Python 136 14 Updated Apr 28, 2022

Pytorch implementation of our paper accepted by ECCV2022 -- Dynamic Dual Trainable Bounds for Ultra-low Precision Super-Resolution Networks

Python 30 4 Updated Sep 13, 2022

PyTorch implementation of SSQL (Accepted to ECCV2022 oral presentation)

Python 73 6 Updated Mar 15, 2023

Improving Post Training Neural Quantization: Layer-wise Calibration and Integer Programming

Python 35 6 Updated Jun 29, 2023
Python 12 1 Updated Aug 26, 2022

Quantization library for PyTorch. Support low-precision and mixed-precision quantization, with hardware implementation through TVM.

Python 452 83 Updated May 15, 2023

The official implementation of the NeurIPS 2022 paper Q-ViT.

Python 102 13 Updated May 22, 2023

Code for "Adaptive Gradient Quantization for Data-Parallel SGD", published in NeurIPS 2020.

Jupyter Notebook 30 5 Updated Jan 14, 2021

The official repository for the paper LAB: Learnable Activation Binarizer for Binary Neural Networks.

Python 6 1 Updated Oct 27, 2022

Join the High Accuracy Club on ImageNet with A Binary Neural Network Ticket

Python 69 5 Updated Feb 12, 2023

This repository containts the pytorch scripts to train mixed-precision networks for microcontroller deployment, based on the memory contraints of the target device.

Python 50 16 Updated May 9, 2024

Improving Post Training Neural Quantization: Layer-wise Calibration and Integer Programming

Python 98 23 Updated Jun 10, 2021

Reorder-based post-training quantization for large language model

Python 196 15 Updated May 17, 2023

Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".

Python 2,240 188 Updated Mar 27, 2024

Offline Quantization Tools for Deploy.

Python 141 19 Updated Dec 28, 2023

EasyQuant(EQ) is an efficient and simple post-training quantization method via effectively optimizing the scales of weights and activations.

Python 405 69 Updated Nov 22, 2022

PyTorch implementation of Data Free Quantization Through Weight Equalization and Bias Correction.

Python 264 47 Updated Oct 3, 2023

[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models

Python 1,576 189 Updated Jul 12, 2024

Example models using DeepSpeed

Python 6,752 1,112 Updated Dec 19, 2025