Quantization

1032 papers with code • 10 benchmarks • 18 datasets

Quantization is a promising technique to reduce the computation cost of neural network training, which can replace high-cost floating-point numbers (e.g., float32) with low-cost fixed-point numbers (e.g., int8/int16).

Source: Adaptive Precision Training: Quantify Back Propagation in Neural Networks with Fixed-point Numbers

Benchmarks

Add a Result

These leaderboards are used to track progress in Quantization

Dataset	Best Model	Compare
ImageNet	FQ-ViT (ViT-L)	See all
CIFAR-10	3DCNN_VIVA_3	See all
Knowledge-based:	3DCNN_VIVA_5	See all
MS COCO	SSD ResNet50 V1 FPN 640x640	See all
LFW		See all
CFP-FP		See all
AgeDB-30		See all
IJB-C		See all
IJB-B		See all
Wiki-40B	OutEffHop-Bert_base	See all

Libraries

Use these libraries to find Quantization models and implementations

microsoft/DeepSpeed

8 papers

32,603

faceonlive/ai-research

5 papers

144

UCMerced-ML/LC-model-compression

5 papers

huggingface/transformers

4 papers

124,889

See all 6 libraries.

Datasets

Subtasks

Most implemented papers

Most implemented Social Latest No code

Polysemous codes

facebookresearch/faiss • • 7 Sep 2016

This paper considers the problem of approximate nearest neighbor search in the compressed domain.

Paper
Code

Learned Step Size Quantization

zhutmost/lsq-net • • ICLR 2020

Deep networks run with low precision operations at inference time offer power and space advantages over high precision alternatives, but need to overcome the challenge of maintaining high accuracy as precision decreases.

Paper
Code

Improvements to Target-Based 3D LiDAR to Camera Calibration

UMich-BipedLab/extrinsic_lidar_camera_calibration • 7 Oct 2019

The homogeneous transformation between a LiDAR and monocular camera is required for sensor fusion tasks, such as SLAM.

Paper
Code

YOLOv6: A Single-Stage Object Detection Framework for Industrial Applications

meituan/yolov6 • • 7 Sep 2022

The YOLO community has prospered overwhelmingly to enrich its use in a multitude of hardware platforms and abundant scenarios.

Paper
Code

Link and code: Fast indexing with graphs and compact regression codes

facebookresearch/faiss • • CVPR 2018

Similarity search approaches based on graph walks have recently attained outstanding speed-accuracy trade-offs, taking aside the memory requirements.

Paper
Code

Single Path One-Shot Neural Architecture Search with Uniform Sampling

open-mmlab/mmrazor • • ECCV 2020

It is easy to train and fast to search.

Paper
Code

Unsupervised Cross-lingual Representation Learning for Speech Recognition

huggingface/transformers • • 24 Jun 2020

This paper presents XLSR which learns cross-lingual speech representations by pretraining a single model from the raw waveform of speech in multiple languages.

Paper
Code

QVRF: A Quantization-error-aware Variable Rate Framework for Learned Image Compression

bytedance/qraf • • 10 Mar 2023

In this paper, we present a Quantization-error-aware Variable Rate Framework (QVRF) that utilizes a univariate quantization regulator a to achieve wide-range variable rates within a single model.

Paper
Code