TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Face Alignment	300W	HR-Net	NME_inter-ocular (%, Full)	3.32	# 24
Face Alignment	300W	HR-Net	NME_inter-ocular (%, Common)	2.87	# 21
Face Alignment	300W	HR-Net	NME_inter-ocular (%, Challenge)	5.15	# 24
Semantic Segmentation	ADE20K	HRNetV2	Validation mIoU	43.2	# 202
Semantic Segmentation	ADE20K val	HRNetV2 (HRNetV2-W48)	mIoU	42.99	# 87
Face Alignment	AFLW-19	HR-Net	NME_diag (%, Full)	1.57	# 13
Face Alignment	AFLW-19	HR-Net	NME_diag (%, Frontal)	1.46	# 9
Semantic Segmentation	Cityscapes test	HRNet (HRNetV2-W48)	Mean IoU (class)	81.6%	# 37
Face Alignment	COFW	HRNet	NME (inter-ocular)	3.45%	# 12
Semantic Segmentation	LIP val	HRNetV2 (HRNetV2-W48)	mIoU	55.90%	# 7

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/high-resolution-representations-for-labeling/semantic-segmentation-on-lip-val)](https://paperswithcode.com/sota/semantic-segmentation-on-lip-val?p=high-resolution-representations-for-labeling)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/high-resolution-representations-for-labeling/face-alignment-on-cofw)](https://paperswithcode.com/sota/face-alignment-on-cofw?p=high-resolution-representations-for-labeling)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/high-resolution-representations-for-labeling/face-alignment-on-aflw-19)](https://paperswithcode.com/sota/face-alignment-on-aflw-19?p=high-resolution-representations-for-labeling)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/high-resolution-representations-for-labeling/face-alignment-on-300w)](https://paperswithcode.com/sota/face-alignment-on-300w?p=high-resolution-representations-for-labeling)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/high-resolution-representations-for-labeling/semantic-segmentation-on-cityscapes)](https://paperswithcode.com/sota/semantic-segmentation-on-cityscapes?p=high-resolution-representations-for-labeling)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/high-resolution-representations-for-labeling/semantic-segmentation-on-ade20k-val)](https://paperswithcode.com/sota/semantic-segmentation-on-ade20k-val?p=high-resolution-representations-for-labeling)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/high-resolution-representations-for-labeling/semantic-segmentation-on-ade20k)](https://paperswithcode.com/sota/semantic-segmentation-on-ade20k?p=high-resolution-representations-for-labeling)`

High-Resolution Representations for Labeling Pixels and Regions

9 Apr 2019 · Ke Sun, Yang Zhao, Borui Jiang, Tianheng Cheng, Bin Xiao, Dong Liu, Yadong Mu, Xinggang Wang, Wenyu Liu, Jingdong Wang ·

High-resolution representation learning plays an essential role in many vision problems, e.g., pose estimation and semantic segmentation. The high-resolution network (HRNet)~\cite{SunXLW19}, recently developed for human pose estimation, maintains high-resolution representations through the whole process by connecting high-to-low resolution convolutions in \emph{parallel} and produces strong high-resolution representations by repeatedly conducting fusions across parallel convolutions. In this paper, we conduct a further study on high-resolution representations by introducing a simple yet effective modification and apply it to a wide range of vision tasks. We augment the high-resolution representation by aggregating the (upsampled) representations from all the parallel convolutions rather than only the representation from the high-resolution convolution as done in~\cite{SunXLW19}. This simple modification leads to stronger representations, evidenced by superior results. We show top results in semantic segmentation on Cityscapes, LIP, and PASCAL Context, and facial landmark detection on AFLW, COFW, $300$W, and WFLW. In addition, we build a multi-level representation from the high-resolution representation and apply it to the Faster R-CNN object detection framework and the extended frameworks. The proposed approach achieves superior results to existing single-model networks on COCO object detection. The code and models have been publicly available at \url{https://github.com/HRNet}.

PDF Abstract

Code

Add Remove Mark official

leoxiaobin/deep-high-resolution-net… official

4,220

PaddlePaddle/PaddleDetection

12,041

PaddlePaddle/PaddleClas

5,251

CSAILVision/semantic-segmentation-p…

↳ Quickstart in

Colab

4,834

HRNet/HRNet-Semantic-Segmentation

3,048

See all 39 implementations

Tasks

Add Remove

Face Alignment

Facial Landmark Detection

Object Detection

Pose Estimation

Representation Learning

Semantic Segmentation

Vocal Bursts Intensity Prediction

Datasets

Cityscapes

ADE20K

300W

Helen AFW

LFPW

COFW

WFLW

LIP

AFLW-19

Results from the Paper

Edit

Ranked #7 on Semantic Segmentation on LIP val

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Semantic Segmentation	ADE20K	HRNetV2	Validation mIoU	43.2	# 202	Compare
Face Alignment	AFLW-19	HR-Net	NME_diag (%, Full)	1.57	# 13	Compare
Face Alignment	AFLW-19	HR-Net	NME_diag (%, Frontal)	1.46	# 9	Compare
Semantic Segmentation	Cityscapes test	HRNet (HRNetV2-W48)	Mean IoU (class)	81.6%	# 37	Compare
Face Alignment	COFW	HRNet	NME (inter-ocular)	3.45%	# 12	Compare
Semantic Segmentation	LIP val	HRNetV2 (HRNetV2-W48)	mIoU	55.90%	# 7	Compare

Results from Other Papers

Task	Dataset	Model	Metric Name	Metric Value	Rank	Compare
Face Alignment	300W	HR-Net	NME_inter-ocular (%, Full)	3.32	# 24	See all
			NME_inter-ocular (%, Common)	2.87	# 21	See all
			NME_inter-ocular (%, Challenge)	5.15	# 24	See all
Semantic Segmentation	ADE20K val	HRNetV2 (HRNetV2-W48)	mIoU	42.99	# 87	See all

Methods

Add Remove

Convolution • Faster R-CNN • RoIPool • RPN • Softmax

Edit Social Preview

High-Resolution Representations for Labeling Pixels and Regions

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit