TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Multi-Label Classification	MS-COCO	MCAR (ResNet101, 576x576)	mAP	84.5	# 25
Multi-Label Classification	MS-COCO	MCAR (ResNet101, 448x448)	mAP	83.8	# 26
Multi-Label Classification	PASCAL VOC 2007	MCAR (ResNet101, 448x448)	mAP	94.8	# 10
Multi-Label Classification	PASCAL VOC 2012	MCAR (ResNet101, 448x448)	mAP	94.3	# 2

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/multi-label-image-recognition-with-multi/multi-label-classification-on-pascal-voc-2012)](https://paperswithcode.com/sota/multi-label-classification-on-pascal-voc-2012?p=multi-label-image-recognition-with-multi)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/multi-label-image-recognition-with-multi/multi-label-classification-on-pascal-voc-2007)](https://paperswithcode.com/sota/multi-label-classification-on-pascal-voc-2007?p=multi-label-image-recognition-with-multi)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/multi-label-image-recognition-with-multi/multi-label-classification-on-ms-coco)](https://paperswithcode.com/sota/multi-label-classification-on-ms-coco?p=multi-label-image-recognition-with-multi)`

Learning to Discover Multi-Class Attentional Regions for Multi-Label Image Recognition

3 Jul 2020 · Bin-Bin Gao, Hong-Yu Zhou ·

Multi-label image recognition is a practical and challenging task compared to single-label image classification. However, previous works may be suboptimal because of a great number of object proposals or complex attentional region generation modules. In this paper, we propose a simple but efficient two-stream framework to recognize multi-category objects from global image to local regions, similar to how human beings perceive objects. To bridge the gap between global and local streams, we propose a multi-class attentional region module which aims to make the number of attentional regions as small as possible and keep the diversity of these regions as high as possible. Our method can efficiently and effectively recognize multi-class objects with an affordable computation cost and a parameter-free region localization module. Over three benchmarks on multi-label image classification, we create new state-of-the-art results with a single model only using image semantics without label dependency. In addition, the effectiveness of the proposed method is extensively demonstrated under different factors such as global pooling strategy, input size and network architecture. Code has been made available at~\url{https://github.com/gaobb/MCAR}.

PDF Abstract

Code

Add Remove Mark official

gaobb/MCAR official

Tasks

Add Remove

Multi-Label Classification

Multi-Label Image Classification

Datasets

MS COCO

PASCAL VOC

PASCAL VOC 2007

Results from the Paper

Edit

Ranked #2 on Multi-Label Classification on PASCAL VOC 2012

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Multi-Label Classification	MS-COCO	MCAR (ResNet101, 576x576)	mAP	84.5	# 25	Compare
Multi-Label Classification	MS-COCO	MCAR (ResNet101, 448x448)	mAP	83.8	# 26	Compare
Multi-Label Classification	PASCAL VOC 2007	MCAR (ResNet101, 448x448)	mAP	94.8	# 10	Compare
Multi-Label Classification	PASCAL VOC 2012	MCAR (ResNet101, 448x448)	mAP	94.3	# 2	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Learning to Discover Multi-Class Attentional Regions for Multi-Label Image Recognition

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove