TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Self-Supervised Image Classification	ImageNet	C-BYOL (ResNet-50 2x, 1000 epochs)	Top 1 Accuracy	78.8%	# 40
Self-Supervised Image Classification	ImageNet	C-BYOL (ResNet-50 2x, 1000 epochs)	Top 5 Accuracy	94.5%	# 4
Self-Supervised Image Classification	ImageNet	C-BYOL (ResNet-50, 1000 epochs)	Top 1 Accuracy	75.6%	# 66
Self-Supervised Image Classification	ImageNet	C-BYOL (ResNet-50, 1000 epochs)	Top 5 Accuracy	92.7%	# 10
Image Classification	ObjectNet	C-BYOL	Top-1 Accuracy	25.5	# 81
Image Classification	ObjectNet	C-SimCLR	Top-1 Accuracy	20.8	# 88

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/compressive-visual-representations/self-supervised-image-classification-on)](https://paperswithcode.com/sota/self-supervised-image-classification-on?p=compressive-visual-representations)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/compressive-visual-representations/image-classification-on-objectnet)](https://paperswithcode.com/sota/image-classification-on-objectnet?p=compressive-visual-representations)`

Compressive Visual Representations

NeurIPS 2021 · Kuang-Huei Lee, Anurag Arnab, Sergio Guadarrama, John Canny, Ian Fischer ·

Learning effective visual representations that generalize well without human supervision is a fundamental problem in order to apply Machine Learning to a wide variety of tasks. Recently, two families of self-supervised methods, contrastive learning and latent bootstrapping, exemplified by SimCLR and BYOL respectively, have made significant progress. In this work, we hypothesize that adding explicit information compression to these algorithms yields better and more robust representations. We verify this by developing SimCLR and BYOL formulations compatible with the Conditional Entropy Bottleneck (CEB) objective, allowing us to both measure and control the amount of compression in the learned representation, and observe their impact on downstream tasks. Furthermore, we explore the relationship between Lipschitz continuity and compression, showing a tractable lower bound on the Lipschitz constant of the encoders we learn. As Lipschitz continuity is closely related to robustness, this provides a new explanation for why compressed models are more robust. Our experiments confirm that adding compression to SimCLR and BYOL significantly improves linear evaluation accuracies and model robustness across a wide range of domain shifts. In particular, the compressed version of BYOL achieves 76.0% Top-1 linear evaluation accuracy on ImageNet with ResNet-50, and 78.8% with ResNet-50 2x.

PDF Abstract NeurIPS 2021 PDF NeurIPS 2021 Abstract

Code

Add Remove Mark official

google-research/compressive-visual-… official

Tasks

Add Remove

Contrastive Learning

Image Classification

Self-Supervised Image Classification

Datasets

ImageNet

ImageNet-C

ImageNet-R

ImageNet-A

ObjectNet

Results from the Paper

Add Remove

Ranked #40 on Self-Supervised Image Classification on ImageNet

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Self-Supervised Image Classification	ImageNet	C-BYOL (ResNet-50 2x, 1000 epochs)	Top 1 Accuracy	78.8%	# 40	Compare
Self-Supervised Image Classification	ImageNet	C-BYOL (ResNet-50 2x, 1000 epochs)	Top 5 Accuracy	94.5%	# 4	Compare
Self-Supervised Image Classification	ImageNet	C-BYOL (ResNet-50, 1000 epochs)	Top 1 Accuracy	75.6%	# 66	Compare
Self-Supervised Image Classification	ImageNet	C-BYOL (ResNet-50, 1000 epochs)	Top 5 Accuracy	92.7%	# 10	Compare
Image Classification	ObjectNet	C-BYOL	Top-1 Accuracy	25.5	# 81	Compare
Image Classification	ObjectNet	C-SimCLR	Top-1 Accuracy	20.8	# 88	Compare

Methods

Add Remove

1x1 Convolution • Average Pooling • Batch Normalization • Bottleneck Residual Block • BYOL • ColorJitter • Contrastive Learning • Convolution • Dense Connections • Feedforward Network • Global Average Pooling • Kaiming Initialization • Max Pooling • NT-Xent • Random Gaussian Blur • Random Resized Crop • ReLU • Residual Block • Residual Connection • ResNet • SimCLR

Edit Social Preview

Compressive Visual Representations

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove