TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Learning with noisy labels	ANIMAL	Nested+Co-teaching (NCT)	Accuracy	84.1	# 11
Learning with noisy labels	ANIMAL	Nested+Co-teaching (NCT)	Network	Vgg19-BN	# 1
Learning with noisy labels	ANIMAL	Nested+Co-teaching (NCT)	ImageNet Pretrained	NO	# 1
Image Classification	Clothing1M	Nested+Co-teaching (ResNet-50)	Accuracy	75%	# 10

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/compressing-features-for-learning-with-noisy/image-classification-on-clothing1m)](https://paperswithcode.com/sota/image-classification-on-clothing1m?p=compressing-features-for-learning-with-noisy)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/compressing-features-for-learning-with-noisy/learning-with-noisy-labels-on-animal)](https://paperswithcode.com/sota/learning-with-noisy-labels-on-animal?p=compressing-features-for-learning-with-noisy)`

Compressing Features for Learning with Noisy Labels

27 Jun 2022 · Yingyi Chen, Shell Xu Hu, Xi Shen, Chunrong Ai, Johan A. K. Suykens ·

Supervised learning can be viewed as distilling relevant information from input data into feature representations. This process becomes difficult when supervision is noisy as the distilled information might not be relevant. In fact, recent research shows that networks can easily overfit all labels including those that are corrupted, and hence can hardly generalize to clean datasets. In this paper, we focus on the problem of learning with noisy labels and introduce compression inductive bias to network architectures to alleviate this over-fitting problem. More precisely, we revisit one classical regularization named Dropout and its variant Nested Dropout. Dropout can serve as a compression constraint for its feature dropping mechanism, while Nested Dropout further learns ordered feature representations w.r.t. feature importance. Moreover, the trained models with compression regularization are further combined with Co-teaching for performance boost. Theoretically, we conduct bias-variance decomposition of the objective function under compression regularization. We analyze it for both single model and Co-teaching. This decomposition provides three insights: (i) it shows that over-fitting is indeed an issue for learning with noisy labels; (ii) through an information bottleneck formulation, it explains why the proposed feature compression helps in combating label noise; (iii) it gives explanations on the performance boost brought by incorporating compression regularization into Co-teaching. Experiments show that our simple approach can have comparable or even better performance than the state-of-the-art methods on benchmarks with real-world label noise including Clothing1M and ANIMAL-10N. Our implementation is available at https://yingyichen-cyy.github.io/CompressFeatNoisyLabels/.

PDF Abstract

Code

Add Remove Mark official

yingyichen-cyy/Nested-Co-teaching official

Tasks

Add Remove

Feature Compression

Feature Importance

Image Classification

Inductive Bias

Learning with noisy labels

Datasets

CIFAR-10

ImageNet

CIFAR-100

Clothing1M

ANIMAL

Results from the Paper

Edit

Ranked #10 on Image Classification on Clothing1M (using extra training data)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Learning with noisy labels	ANIMAL	Nested+Co-teaching (NCT)	Accuracy	84.1	# 11	Compare
			Network	Vgg19-BN	# 1	Compare
			ImageNet Pretrained	NO	# 1	Compare
Image Classification	Clothing1M	Nested+Co-teaching (ResNet-50)	Accuracy	75%	# 10	Compare

Methods

Add Remove

Dropout

Edit Social Preview

Compressing Features for Learning with Noisy Labels

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove