Zero-Shot Learning

562 papers with code • 18 benchmarks • 29 datasets

Zero-shot learning (ZSL) is a model's ability to detect classes never seen during training. The condition is that the classes are not known during supervised learning.

Earlier work in zero-shot learning use attributes in a two-step approach to infer unknown classes. In the computer vision context, more recent advances learn mappings from image feature space to semantic space. Other approaches learn non-linear multimodal embeddings. In the modern NLP context, language models can be evaluated on downstream tasks without fine tuning.

Benchmark datasets for zero-shot learning include aPY, AwA, and CUB, among others.

( Image credit: Prototypical Networks for Few shot Learning in PyTorch )

Benchmarks

Add a Result

These leaderboards are used to track progress in Zero-Shot Learning

Dataset	Best Model	Compare
CUB-200-2011	DUET	See all
SUN Attribute	SPOT (VAEGAN)	See all
AwA2	ZSL-KG	See all
Oxford 102 Flower	SPOT	See all
VOC-MLT	CLIP(ResNet-50)	See all
COCO-MLT	ResNet-50	See all
CUB-200 - 0-Shot Learning	zsl_ADA	See all
PASCAL Context	ZS3Net	See all
iVQA	FrozenBiLM	See all
SNIPS	ZSL-KG	See all
aPY - 0-Shot	ZSL-KG	See all
LSMDC	FrozenBiLM	See all
MSRVTT-QA	HiTeA	See all
MSVD-QA	HiTeA	See all
TVQA	FrozenBiLM	See all
MIT-States	CZSL	See all
ImageNet_CN	$M^2$-Encoder	See all
How2QA	SeViLA	See all

Show all 18 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find Zero-Shot Learning models and implementations

mlfoundations/open_clip

3 papers

8,439

faceonlive/ai-research

3 papers

152

alibaba/EasyNLP

2 papers

1,949

sicara/easy-few-shot-learning

2 papers

899

Datasets

Subtasks

Multi-label zero-shot learning

GZSL Video Classification

Latest papers

Most implemented Social Latest No code

The Devil is in the Few Shots: Iterative Visual Knowledge Completion for Few-shot Learning

mark-sky/kcl • • 15 Apr 2024

Few-shot learning aims to further enhance the transfer capability of CLIP by giving few images in each class, aka 'few shots'.

15 Apr 2024

Paper
Code

CREST: Cross-modal Resonance through Evidential Deep Learning for Enhanced Zero-Shot Learning

JethroJames/CREST • • 15 Apr 2024

Zero-shot learning (ZSL) enables the recognition of novel classes by leveraging semantic knowledge transfer from known to unknown categories.

15 Apr 2024

Paper
Code

Knowledge-enhanced Visual-Language Pretraining for Computational Pathology

magic-ai4med/kep • • 15 Apr 2024

In this paper, we consider the problem of visual representation learning for computational pathology, by exploiting large-scale image-text pairs gathered from public resources, along with the domain specific knowledge in pathology.

15 Apr 2024

Paper
Code

Audio-Visual Generalized Zero-Shot Learning using Pre-Trained Large Multi-Modal Models

faceonlive/ai-research • 9 Apr 2024

However, existing benchmarks predate the popularization of large multi-modal models, such as CLIP and CLAP.

152

09 Apr 2024

Paper
Code

Forget NLI, Use a Dictionary: Zero-Shot Topic Classification for Low-Resource Languages with Application to Luxembourgish

faceonlive/ai-research • 5 Apr 2024

A common method for ZSC is to fine-tune a language model on a Natural Language Inference (NLI) dataset and then use it to infer the entailment between the input document and the target labels.

152

05 Apr 2024

Paper
Code

Label Propagation for Zero-shot Classification with Vision-Language Models

faceonlive/ai-research • 5 Apr 2024

We leverage the graph structure of the unlabeled data and introduce ZLaP, a method based on label propagation (LP) that utilizes geodesic distances for classification.

152

05 Apr 2024

Paper
Code

Emergent Abilities in Reduced-Scale Generative Language Models

text-machine-lab/mini_gpt • • 2 Apr 2024

Large language models can solve new tasks without task-specific fine-tuning.

02 Apr 2024

Paper
Code

X-MIC: Cross-Modal Instance Conditioning for Egocentric Action Generalization

annusha/xmic • 28 Mar 2024

Lately, there has been growing interest in adapting vision-language models (VLMs) to image and third-person video classification due to their success in zero-shot recognition.

28 Mar 2024

Paper
Code

VLM-CPL: Consensus Pseudo Labels from Vision-Language Models for Human Annotation-Free Pathological Image Classification

lanfz2000/vlm-cpl • • 23 Mar 2024

To address this issue, we introduce VLM-CPL, a novel approach based on consensus pseudo labels that integrates two noisy label filtering techniques with a semi-supervised learning strategy.

23 Mar 2024

Paper
Code

Long-CLIP: Unlocking the Long-Text Capability of CLIP

beichenzbc/long-clip • • 22 Mar 2024

Contrastive Language-Image Pre-training (CLIP) has been the cornerstone for zero-shot classification, text-image retrieval, and text-image generation by aligning image and text modalities.

307

22 Mar 2024

Paper
Code

Zero-Shot Learning

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers

Content

Benchmarks

Add a Result