TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Quantization	ImageNet	ResNet50-W4A4 (paper)	Top-1 Accuracy (%)	76.7	# 12
Quantization	ImageNet	ResNet50-W4A4 (paper)	Weight bits	4	# 4
Quantization	ImageNet	ResNet50-W4A4 (paper)	Activation bits	4	# 1
Model Compression	ImageNet	ADLIK-MO-ResNet50+W4A4	Top-1	77.878	# 1
Model Compression	ImageNet	ADLIK-MO-ResNet50+W3A4	Top-1	77.34	# 2
Quantization	ImageNet	ADLIK-MO-ResNet50-W4A4	Top-1 Accuracy (%)	77.878	# 9
Quantization	ImageNet	ADLIK-MO-ResNet50-W4A4	Weight bits	4	# 4
Quantization	ImageNet	ADLIK-MO-ResNet50-W4A4	Activation bits	4	# 1
Quantization	ImageNet	ADLIK-MO-ResNet50-W3A4	Top-1 Accuracy (%)	77.34	# 10
Quantization	ImageNet	ADLIK-MO-ResNet50-W3A4	Weight bits	3	# 2
Quantization	ImageNet	ADLIK-MO-ResNet50-W3A4	Activation bits	4	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/learned-step-size-quantization/model-compression-on-imagenet)](https://paperswithcode.com/sota/model-compression-on-imagenet?p=learned-step-size-quantization)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/learned-step-size-quantization/quantization-on-imagenet)](https://paperswithcode.com/sota/quantization-on-imagenet?p=learned-step-size-quantization)`

Learned Step Size Quantization

ICLR 2020 · Steven K. Esser, Jeffrey L. McKinstry, Deepika Bablani, Rathinakumar Appuswamy, Dharmendra S. Modha ·

Deep networks run with low precision operations at inference time offer power and space advantages over high precision alternatives, but need to overcome the challenge of maintaining high accuracy as precision decreases. Here, we present a method for training such networks, Learned Step Size Quantization, that achieves the highest accuracy to date on the ImageNet dataset when using models, from a variety of architectures, with weights and activations quantized to 2-, 3- or 4-bits of precision, and that can train 3-bit models that reach full precision baseline accuracy. Our approach builds upon existing methods for learning weights in quantized networks by improving how the quantizer itself is configured. Specifically, we introduce a novel means to estimate and scale the task loss gradient at each weight and activation layer's quantizer step size, such that it can be learned in conjunction with other network parameters. This approach works using different levels of precision as needed for a given system and requires only a simple modification of existing training code.

PDF Abstract ICLR 2020 PDF ICLR 2020 Abstract

Code

Add Remove Mark official

zhutmost/lsq-net

245

hustzxd/LSQuantization

114

ZouJiu1/LSQplus

Adlik/model_optimizer

DeadAt0m/LSQFakeQuantize-PyTorch

See all 8 implementations

Tasks

Add Remove

Model Compression

Quantization

Datasets

ImageNet

Results from the Paper

Edit

Ranked #1 on Model Compression on ImageNet

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Quantization	ImageNet	ResNet50-W4A4 (paper)	Top-1 Accuracy (%)	76.7	# 12	Compare
			Weight bits	4	# 4	Compare
			Activation bits	4	# 1	Compare
Model Compression	ImageNet	ADLIK-MO-ResNet50+W4A4	Top-1	77.878	# 1	Compare
Model Compression	ImageNet	ADLIK-MO-ResNet50+W3A4	Top-1	77.34	# 2	Compare
Quantization	ImageNet	ADLIK-MO-ResNet50-W4A4	Top-1 Accuracy (%)	77.878	# 9	Compare
			Weight bits	4	# 4	Compare
			Activation bits	4	# 1	Compare
Quantization	ImageNet	ADLIK-MO-ResNet50-W3A4	Top-1 Accuracy (%)	77.34	# 10	Compare
			Weight bits	3	# 2	Compare
			Activation bits	4	# 1	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Learned Step Size Quantization

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove