Zero-Shot Human-Object Interaction Detection

5 papers with code • 2 benchmarks • 2 datasets

This task has no description! Would you like to contribute one?

Benchmarks

Add a Result

These leaderboards are used to track progress in Zero-Shot Human-Object Interaction Detection

Trend	Dataset	Best Model	Paper	Code	Compare
	HICO-DET	RLIPv2			See all
	HICO	RelViT			See all

Datasets

Most implemented papers

Most implemented Social Latest No code

RLIPv2: Fast Scaling of Relational Language-Image Pre-training

jacobyuan7/rlipv2 • • ICCV 2023

In this paper, we propose RLIPv2, a fast converging model that enables the scaling of relational pre-training to large-scale pseudo-labelled scene graph data.

Paper
Code

ConsNet: Learning Consistency Graph for Zero-Shot Human-Object Interaction Detection

yeliudev/ConsNet • • 14 Aug 2020

We consider the problem of Human-Object Interaction (HOI) Detection, which aims to locate and recognize HOI instances in the form of <human, action, object> in images.

Paper
Code

End-to-End Zero-Shot HOI Detection via Vision and Language Knowledge Distillation

mrwu-mac/EoID • • 1 Apr 2022

Extensive experiments on HICO-Det dataset demonstrate that our model discovers potential interactive pairs and enables the recognition of unseen HOIs.

Paper
Code

RelViT: Concept-guided Vision Transformer for Visual Relational Reasoning

NVlabs/RelViT • • ICLR 2022

This task remains challenging for current deep learning algorithms since it requires addressing three key technical problems jointly: 1) identifying object entities and their properties, 2) inferring semantic relations between pairs of entities, and 3) generalizing to novel object-relation combinations, i. e., systematic generalization.

Paper
Code

Boosting Human-Object Interaction Detection with Text-to-Image Diffusion Model

IDEA-Research/DiffHOI • • 20 May 2023

Extensive experiments demonstrate that DiffHOI significantly outperforms the state-of-the-art in regular detection (i. e., 41. 50 mAP) and zero-shot detection.

Paper
Code

Zero-Shot Human-Object Interaction Detection

Benchmarks Add a Result

Datasets

Most implemented papers

RLIPv2: Fast Scaling of Relational Language-Image Pre-training

ConsNet: Learning Consistency Graph for Zero-Shot Human-Object Interaction Detection

End-to-End Zero-Shot HOI Detection via Vision and Language Knowledge Distillation

RelViT: Concept-guided Vision Transformer for Visual Relational Reasoning

Boosting Human-Object Interaction Detection with Text-to-Image Diffusion Model

Content

Benchmarks

Add a Result