TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Image Classification	WebVision-1000	MILe (ResNet-50-D)	Top-1 Accuracy	76.5%	# 6
Image Classification	WebVision-1000	MILe (ResNet-50-D)	Top-5 Accuracy	90.9%	# 5
Image Classification	WebVision-1000	MILe (ResNet-50-D)	ImageNet Top-1 Accuracy	68.7	# 3
Image Classification	WebVision-1000	MILe (ResNet-50-D)	ImageNet Top-5 Accuracy	86.4	# 6
Image Classification	WebVision-1000	MILe (ResNet-50)	Top-1 Accuracy	75.2%	# 10
Image Classification	WebVision-1000	MILe (ResNet-50)	Top-5 Accuracy	90.3%	# 8
Image Classification	WebVision-1000	MILe (ResNet-50)	ImageNet Top-1 Accuracy	67.1	# 6
Image Classification	WebVision-1000	MILe (ResNet-50)	ImageNet Top-5 Accuracy	85.6	# 7

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/multi-label-iterated-learning-for-image/image-classification-on-webvision-1000)](https://paperswithcode.com/sota/image-classification-on-webvision-1000?p=multi-label-iterated-learning-for-image)`

Multi-label Iterated Learning for Image Classification with Label Ambiguity

CVPR 2022 · Sai Rajeswar, Pau Rodriguez, Soumye Singhal, David Vazquez, Aaron Courville ·

Transfer learning from large-scale pre-trained models has become essential for many computer vision tasks. Recent studies have shown that datasets like ImageNet are weakly labeled since images with multiple object classes present are assigned a single label. This ambiguity biases models towards a single prediction, which could result in the suppression of classes that tend to co-occur in the data. Inspired by language emergence literature, we propose multi-label iterated learning (MILe) to incorporate the inductive biases of multi-label learning from single labels using the framework of iterated learning. MILe is a simple yet effective procedure that builds a multi-label description of the image by propagating binary predictions through successive generations of teacher and student networks with a learning bottleneck. Experiments show that our approach exhibits systematic benefits on ImageNet accuracy as well as ReaL F1 score, which indicates that MILe deals better with label ambiguity than the standard training procedure, even when fine-tuning from self-supervised weights. We also show that MILe is effective reducing label noise, achieving state-of-the-art performance on real-world large-scale noisy data such as WebVision. Furthermore, MILe improves performance in class incremental settings such as IIRC and it is robust to distribution shifts. Code: https://github.com/rajeswar18/MILe

PDF Abstract CVPR 2022 PDF CVPR 2022 Abstract

Code

Add Remove Mark official

No code implementations yet. Submit your code now

Tasks

Add Remove

Classification

Image Classification

Multi-Label Learning

Transfer Learning

Datasets

ImageNet

CelebA Colored MNIST

WebVision

Results from the Paper

Edit

Ranked #6 on Image Classification on WebVision-1000

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Image Classification	WebVision-1000	MILe (ResNet-50-D)	Top-1 Accuracy	76.5%	# 6	Compare
			Top-5 Accuracy	90.9%	# 5	Compare
			ImageNet Top-1 Accuracy	68.7	# 3	Compare
			ImageNet Top-5 Accuracy	86.4	# 6	Compare
Image Classification	WebVision-1000	MILe (ResNet-50)	Top-1 Accuracy	75.2%	# 10	Compare
			Top-5 Accuracy	90.3%	# 8	Compare
			ImageNet Top-1 Accuracy	67.1	# 6	Compare
			ImageNet Top-5 Accuracy	85.6	# 7	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Multi-label Iterated Learning for Image Classification with Label Ambiguity

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove