TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Quantization	ImageNet	DenseNet-121 W8A8	Top-1 Accuracy (%)	73.356	# 19
Quantization	ImageNet	DenseNet-121 W8A8	Weight bits	8	# 10
Quantization	ImageNet	DenseNet-121 W8A8	Activation bits	8	# 9
Quantization	ImageNet	EfficientNet-B0 W8A8	Top-1 Accuracy (%)	74.216	# 16
Quantization	ImageNet	EfficientNet-B0 W8A8	Weight bits	8	# 10
Quantization	ImageNet	EfficientNet-B0 W8A8	Activation bits	8	# 9
Quantization	ImageNet	EfficientNet-B0 ReLU W8A8	Top-1 Accuracy (%)	77.092	# 11
Quantization	ImageNet	EfficientNet-B0 ReLU W8A8	Weight bits	8	# 10
Quantization	ImageNet	EfficientNet-B0 ReLU W8A8	Activation bits	8	# 9
Quantization	ImageNet	Xception W8A8	Top-1 Accuracy (%)	78.972	# 8
Quantization	ImageNet	Xception W8A8	Weight bits	8	# 10
Quantization	ImageNet	Xception W8A8	Activation bits	8	# 9
Quantization	ImageNet	MobileNetV2 W8A8	Top-1 Accuracy (%)	71.46	# 23
Quantization	ImageNet	MobileNetV2 W8A8	Weight bits	8	# 10
Quantization	ImageNet	MobileNetV2 W8A8	Activation bits	8	# 9
Quantization	MS COCO	SSD ResNet50 V1 FPN 640x640	MAP	34.3	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/hptq-hardware-friendly-post-training/quantization-on-coco)](https://paperswithcode.com/sota/quantization-on-coco?p=hptq-hardware-friendly-post-training)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/hptq-hardware-friendly-post-training/quantization-on-imagenet)](https://paperswithcode.com/sota/quantization-on-imagenet?p=hptq-hardware-friendly-post-training)`

HPTQ: Hardware-Friendly Post Training Quantization

19 Sep 2021 · Hai Victor Habi, Reuven Peretz, Elad Cohen, Lior Dikstein, Oranit Dror, Idit Diamant, Roy H. Jennings, Arnon Netzer ·

Neural network quantization enables the deployment of models on edge devices. An essential requirement for their hardware efficiency is that the quantizers are hardware-friendly: uniform, symmetric, and with power-of-two thresholds. To the best of our knowledge, current post-training quantization methods do not support all of these constraints simultaneously. In this work, we introduce a hardware-friendly post training quantization (HPTQ) framework, which addresses this problem by synergistically combining several known quantization methods. We perform a large-scale study on four tasks: classification, object detection, semantic segmentation and pose estimation over a wide variety of network architectures. Our extensive experiments show that competitive results can be obtained under hardware-friendly constraints.

PDF Abstract

Code

Add Remove Mark official

sony/model_optimization official

259

Tasks

Add Remove

object-detection

Object Detection

Pose Estimation

Quantization

Semantic Segmentation

Datasets

ImageNet

MS COCO

ssd

PASCAL VOC

LIP

Results from the Paper

Edit

Ranked #1 on Quantization on MS COCO

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Quantization	ImageNet	DenseNet-121 W8A8	Top-1 Accuracy (%)	73.356	# 19	Compare
			Weight bits	8	# 10	Compare
			Activation bits	8	# 9	Compare
Quantization	ImageNet	EfficientNet-B0 W8A8	Top-1 Accuracy (%)	74.216	# 16	Compare
			Weight bits	8	# 10	Compare
			Activation bits	8	# 9	Compare
Quantization	ImageNet	EfficientNet-B0 ReLU W8A8	Top-1 Accuracy (%)	77.092	# 11	Compare
			Weight bits	8	# 10	Compare
			Activation bits	8	# 9	Compare
Quantization	ImageNet	Xception W8A8	Top-1 Accuracy (%)	78.972	# 8	Compare
			Weight bits	8	# 10	Compare
			Activation bits	8	# 9	Compare
Quantization	ImageNet	MobileNetV2 W8A8	Top-1 Accuracy (%)	71.46	# 23	Compare
			Weight bits	8	# 10	Compare
			Activation bits	8	# 9	Compare
Quantization	MS COCO	SSD ResNet50 V1 FPN 640x640	MAP	34.3	# 1	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

HPTQ: Hardware-Friendly Post Training Quantization

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove