Zero-Shot Learning

562 papers with code • 18 benchmarks • 29 datasets

Zero-shot learning (ZSL) is a model's ability to detect classes never seen during training. The condition is that the classes are not known during supervised learning.

Earlier work in zero-shot learning use attributes in a two-step approach to infer unknown classes. In the computer vision context, more recent advances learn mappings from image feature space to semantic space. Other approaches learn non-linear multimodal embeddings. In the modern NLP context, language models can be evaluated on downstream tasks without fine tuning.

Benchmark datasets for zero-shot learning include aPY, AwA, and CUB, among others.

( Image credit: Prototypical Networks for Few shot Learning in PyTorch )

Benchmarks

Add a Result

These leaderboards are used to track progress in Zero-Shot Learning

Dataset	Best Model	Compare
CUB-200-2011	DUET	See all
SUN Attribute	SPOT (VAEGAN)	See all
AwA2	ZSL-KG	See all
Oxford 102 Flower	SPOT	See all
VOC-MLT	CLIP(ResNet-50)	See all
COCO-MLT	ResNet-50	See all
CUB-200 - 0-Shot Learning	zsl_ADA	See all
PASCAL Context	ZS3Net	See all
iVQA	FrozenBiLM	See all
SNIPS	ZSL-KG	See all
aPY - 0-Shot	ZSL-KG	See all
LSMDC	FrozenBiLM	See all
MSRVTT-QA	HiTeA	See all
MSVD-QA	HiTeA	See all
TVQA	FrozenBiLM	See all
MIT-States	CZSL	See all
ImageNet_CN	$M^2$-Encoder	See all
How2QA	SeViLA	See all

Show all 18 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find Zero-Shot Learning models and implementations

mlfoundations/open_clip

3 papers

8,439

faceonlive/ai-research

3 papers

156

alibaba/EasyNLP

2 papers

1,949

sicara/easy-few-shot-learning

2 papers

899

Datasets

Subtasks

Multi-label zero-shot learning

GZSL Video Classification

Most implemented papers

Most implemented Social Latest No code

Learning Transferable Visual Models From Natural Language Supervision

openai/CLIP • • 26 Feb 2021

State-of-the-art computer vision systems are trained to predict a fixed set of predetermined object categories.

Paper
Code

Language Models are Few-Shot Learners

openai/gpt-3 • NeurIPS 2020

By contrast, humans can generally perform a new language task from only a few examples or from simple instructions - something which current NLP systems still largely struggle to do.

Paper
Code

Prototypical Networks for Few-shot Learning

jakesnell/prototypical-networks • • NeurIPS 2017

We propose prototypical networks for the problem of few-shot classification, where a classifier must generalize to new classes not seen in the training set, given only a small number of examples of each new class.

Paper
Code

Learning to Compare: Relation Network for Few-Shot Learning

floodsung/LearningToCompare_FSL • • CVPR 2018

Once trained, a RN is able to classify images of new classes by computing relation scores between query images and the few examples of each new class without further updating the network.

Paper
Code

Learning Deep Representations of Fine-grained Visual Descriptions

hanzhanggit/StackGAN-v2 • • CVPR 2016

State-of-the-art methods for zero-shot visual recognition formulate learning as a joint embedding problem of images and side information.

Paper
Code

Zero-Shot Learning -- A Comprehensive Evaluation of the Good, the Bad and the Ugly

sbharadwajj/embarrassingly-simple-zero-shot-learning • 3 Jul 2017

Due to the importance of zero-shot learning, i. e. classifying images where there is a lack of labeled training data, the number of proposed approaches has recently increased steadily.

Paper
Code

Sampling Matters in Deep Embedding Learning

CompVis/metric-learning-divide-and-conquer • • ICCV 2017

In addition, we show that a simple margin based loss is sufficient to outperform all other loss functions.

Paper
Code

Zero-shot User Intent Detection via Capsule Neural Networks

congyingxia/ZeroShotCapsule • • EMNLP 2018

User intent detection plays a critical role in question-answering and dialog systems.

Paper
Code

CPM: A Large-scale Generative Chinese Pre-trained Language Model

TsinghuaAI/CPM-Generate • • 1 Dec 2020

However, applying GPT-3 to address Chinese NLP tasks is still challenging, as the training corpus of GPT-3 is primarily English, and the parameters are not publicly available.

Paper
Code

Finetuned Language Models Are Zero-Shot Learners

google-research/flan • • ICLR 2022

We show that instruction tuning -- finetuning language models on a collection of tasks described via instructions -- substantially improves zero-shot performance on unseen tasks.

Paper
Code

Zero-Shot Learning

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Most implemented papers

Content

Benchmarks

Add a Result