Compositional Zero-Shot Learning

24 papers with code • 4 benchmarks • 6 datasets

Compositional Zero-Shot Learning (CZSL) is a computer vision task in which the goal is to recognize unseen compositions fromed from seen state and object during training. The key challenge in CZSL is the inherent entanglement between the state and object within the context of an image. Some example benchmarks for this task are MIT-states, UT-Zappos, and C-GQA. Models are usually evaluated with the Accuracy for both seen and unseen compositions, as well as their Harmonic Mean(HM).

( Image credit: Heosuab )

Benchmarks

Add a Result

These leaderboards are used to track progress in Compositional Zero-Shot Learning

Dataset	Best Model	Compare
MIT-States	CANet	See all
MIT-States, generalized split	CAILA	See all
UT-Zappos	SymNet	See all
UT Zappos50K	CANet	See all

Libraries

Use these libraries to find Compositional Zero-Shot Learning models and implementations

ExplainableML/czsl

3 papers

107

Datasets

Latest papers

Most implemented Social Latest No code

Synthesize, Diagnose, and Optimize: Towards Fine-Grained Vision-Language Understanding

wjpoom/spec • • 30 Nov 2023

With this in mind, we propose a simple yet effective approach to optimize VLMs in fine-grained understanding, achieving significant improvements on SPEC without compromising the zero-shot performance.

30 Nov 2023

Paper
Code

GIPCOL: Graph-Injected Soft Prompting for Compositional Zero-Shot Learning

hlr/gipcol • • 9 Nov 2023

In this work, we propose GIP-COL (Graph-Injected Soft Prompting for COmpositional Learning) to better explore the compositional zero-shot learning (CZSL) ability of VLMs within the prompt-based learning framework.

09 Nov 2023

Paper
Code

Hierarchical Visual Primitive Experts for Compositional Zero-Shot Learning

hanjaekim98/cot • • ICCV 2023

Previous works for CZSL often suffer from grasping the contextuality between attribute and object, as well as the discriminability of visual features, and the long-tailed distribution of real-world compositional data.

08 Aug 2023

Paper
Code

Learning Conditional Attributes for Compositional Zero-Shot Learning

wqshmzh/canet-czsl • • CVPR 2023

Compositional Zero-Shot Learning (CZSL) aims to train models to recognize novel compositional concepts based on learned concepts such as attribute-object combinations.

29 May 2023

Paper
Code

CAILA: Concept-Aware Intra-Layer Adapters for Compositional Zero-Shot Learning

zhaohengz/caila • • 26 May 2023

In this paper, we study the problem of Compositional Zero-Shot Learning (CZSL), which is to recognize novel attribute-object combinations with pre-existing concepts.

26 May 2023

Paper
Code

Learning Attention as Disentangler for Compositional Zero-shot Learning

haoosz/ade-czsl • • CVPR 2023

The key to CZSL is learning the disentanglement of the attribute-object composition.

27 Mar 2023

Paper
Code

Troika: Multi-Path Cross-Modal Traction for Compositional Zero-Shot Learning

bighuang624/troika • 27 Mar 2023

Recent compositional zero-shot learning (CZSL) methods adapt pre-trained vision-language models (VLMs) by constructing trainable prompts only for composed state-object pairs.

27 Mar 2023

Paper
Code

Decomposed Soft Prompt Guided Fusion Enhancing for Compositional Zero-Shot Learning

forest-art/dfsp • • CVPR 2023

Existing methods either learn the combined state-object representation, challenging the generalization of unseen compositions, or design two classifiers to identify state and object separately from image features, ignoring the intrinsic relationship between them.

19 Nov 2022

Paper
Code

Reference-Limited Compositional Zero-Shot Learning

bighuang624/rl-czsl • • 22 Aug 2022

Compositional zero-shot learning (CZSL) refers to recognizing unseen compositions of known visual primitives, which is an essential ability for artificial intelligence systems to learn and understand the world.

22 Aug 2022

Paper
Code

Siamese Contrastive Embedding Network for Compositional Zero-Shot Learning

xduxyli/scen-master • • CVPR 2022

Some methods recognize state and object with two trained classifiers, ignoring the impact of the interaction between object and state; the other methods try to learn the joint representation of the state-object compositions, leading to the domain gap between seen and unseen composition sets.

29 Jun 2022

Paper
Code

Compositional Zero-Shot Learning

Benchmarks Add a Result

Libraries

Datasets

Latest papers

Content

Benchmarks

Add a Result