TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Zero-Shot Transfer Image Classification	ImageNet	LiT-tuning	Accuracy (Private)	84.5	# 7
Zero-Shot Transfer Image Classification	ImageNet	LiT-tuning	Accuracy (Public)	75.7	# 2
Zero-Shot Transfer Image Classification	ImageNet-A	LiT-tuning	Accuracy (Private)	79.4	# 9
Zero-Shot Transfer Image Classification	ImageNet-A	LiT-tuning	Accuracy (Public)	37.8	# 1
Zero-Shot Transfer Image Classification	ImageNet-R	LiT-tuning	Accuracy	93.9	# 8
Zero-Shot Transfer Image Classification	ImageNet ReaL	LiT-tuning	Accuracy (Private)	88.0	# 1
Zero-Shot Transfer Image Classification	ImageNet ReaL	LiT-tuning	Accuracy (Public)	82.2	# 1
Zero-Shot Transfer Image Classification	ImageNet V2	LiT-tuning	Accuracy (Private)	78.7	# 6
Zero-Shot Transfer Image Classification	ImageNet V2	LiT-tuning	Accuracy (Public)	66.6	# 1
Image Classification	ObjectNet	LiT	Top-1 Accuracy	82.5	# 2
Zero-Shot Transfer Image Classification	ObjectNet	LiT-tuning	Accuracy (Private)	81.1	# 5
Zero-Shot Transfer Image Classification	ObjectNet	LiT-tuning	Accuracy (Public)	54.5	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/lit-zero-shot-transfer-with-locked-image-text/zero-shot-transfer-image-classification-on-7)](https://paperswithcode.com/sota/zero-shot-transfer-image-classification-on-7?p=lit-zero-shot-transfer-with-locked-image-text)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/lit-zero-shot-transfer-with-locked-image-text/image-classification-on-objectnet)](https://paperswithcode.com/sota/image-classification-on-objectnet?p=lit-zero-shot-transfer-with-locked-image-text)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/lit-zero-shot-transfer-with-locked-image-text/zero-shot-transfer-image-classification-on-6)](https://paperswithcode.com/sota/zero-shot-transfer-image-classification-on-6?p=lit-zero-shot-transfer-with-locked-image-text)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/lit-zero-shot-transfer-with-locked-image-text/zero-shot-transfer-image-classification-on-3)](https://paperswithcode.com/sota/zero-shot-transfer-image-classification-on-3?p=lit-zero-shot-transfer-with-locked-image-text)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/lit-zero-shot-transfer-with-locked-image-text/zero-shot-transfer-image-classification-on-1)](https://paperswithcode.com/sota/zero-shot-transfer-image-classification-on-1?p=lit-zero-shot-transfer-with-locked-image-text)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/lit-zero-shot-transfer-with-locked-image-text/zero-shot-transfer-image-classification-on-4)](https://paperswithcode.com/sota/zero-shot-transfer-image-classification-on-4?p=lit-zero-shot-transfer-with-locked-image-text)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/lit-zero-shot-transfer-with-locked-image-text/zero-shot-transfer-image-classification-on-5)](https://paperswithcode.com/sota/zero-shot-transfer-image-classification-on-5?p=lit-zero-shot-transfer-with-locked-image-text)`

LiT: Zero-Shot Transfer with Locked-image text Tuning

CVPR 2022 · Xiaohua Zhai, Xiao Wang, Basil Mustafa, Andreas Steiner, Daniel Keysers, Alexander Kolesnikov, Lucas Beyer ·

This paper presents contrastive-tuning, a simple method employing contrastive training to align image and text models while still taking advantage of their pre-training. In our empirical study we find that locked pre-trained image models with unlocked text models work best. We call this instance of contrastive-tuning "Locked-image Tuning" (LiT), which just teaches a text model to read out good representations from a pre-trained image model for new tasks. A LiT model gains the capability of zero-shot transfer to new vision tasks, such as image classification or retrieval. The proposed LiT is widely applicable; it works reliably with multiple pre-training methods (supervised and unsupervised) and across diverse architectures (ResNet, Vision Transformers and MLP-Mixer) using three different image-text datasets. With the transformer-based pre-trained ViT-g/14 model, the LiT model achieves 85.2% zero-shot transfer accuracy on the ImageNet test set, and 82.5% on the challenging out-of-distribution ObjectNet test set.

PDF Abstract CVPR 2022 PDF CVPR 2022 Abstract

Code

Add Remove Mark official

google-research/vision_transformer official

↳ Quickstart in

Colab

9,224

google-research/big_vision official

↳ Quickstart in

Colab

1,539

mlfoundations/open_clip

↳ Quickstart in

Colab

8,381

laion-ai/clip_benchmark

486

Tasks

Add Remove

Image Classification

Retrieval

Zero-Shot Image Classification

Zero-Shot Transfer Image Classification

Datasets

ImageNet

ImageNet-R

ImageNet-A

Conceptual Captions

YFCC100M

ObjectNet

CC12M JFT-3B

Results from the Paper

Edit

Ranked #1 on Zero-Shot Transfer Image Classification on ImageNet ReaL

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Zero-Shot Transfer Image Classification	ImageNet	LiT-tuning	Accuracy (Private)	84.5	# 7	Compare
Zero-Shot Transfer Image Classification	ImageNet	LiT-tuning	Accuracy (Public)	75.7	# 2	Compare
Zero-Shot Transfer Image Classification	ImageNet-A	LiT-tuning	Accuracy (Private)	79.4	# 9	Compare
Zero-Shot Transfer Image Classification	ImageNet-A	LiT-tuning	Accuracy (Public)	37.8	# 1	Compare
Zero-Shot Transfer Image Classification	ImageNet-R	LiT-tuning	Accuracy	93.9	# 8	Compare
Zero-Shot Transfer Image Classification	ImageNet ReaL	LiT-tuning	Accuracy (Private)	88.0	# 1	Compare
Zero-Shot Transfer Image Classification	ImageNet ReaL	LiT-tuning	Accuracy (Public)	82.2	# 1	Compare
Zero-Shot Transfer Image Classification	ImageNet V2	LiT-tuning	Accuracy (Private)	78.7	# 6	Compare
Zero-Shot Transfer Image Classification	ImageNet V2	LiT-tuning	Accuracy (Public)	66.6	# 1	Compare
Image Classification	ObjectNet	LiT	Top-1 Accuracy	82.5	# 2	Compare
Zero-Shot Transfer Image Classification	ObjectNet	LiT-tuning	Accuracy (Private)	81.1	# 5	Compare
Zero-Shot Transfer Image Classification	ObjectNet	LiT-tuning	Accuracy (Public)	54.5	# 1	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

LiT: Zero-Shot Transfer with Locked-image text Tuning

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove