TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Human-Object Interaction Detection	HICO-DET	HOTR	mAP	23.46	# 35
Human-Object Interaction Detection	V-COCO	HOTR	AP(S1)	55.2	# 16
Human-Object Interaction Detection	V-COCO	HOTR	AP(S2)	64.4	# 13

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/hotr-end-to-end-human-object-interaction/human-object-interaction-detection-on-v-coco)](https://paperswithcode.com/sota/human-object-interaction-detection-on-v-coco?p=hotr-end-to-end-human-object-interaction)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/hotr-end-to-end-human-object-interaction/human-object-interaction-detection-on-hico)](https://paperswithcode.com/sota/human-object-interaction-detection-on-hico?p=hotr-end-to-end-human-object-interaction)`

HOTR: End-to-End Human-Object Interaction Detection with Transformers

CVPR 2021 · Bumsoo Kim, Junhyun Lee, Jaewoo Kang, Eun-Sol Kim, Hyunwoo J. Kim ·

Human-Object Interaction (HOI) detection is a task of identifying "a set of interactions" in an image, which involves the i) localization of the subject (i.e., humans) and target (i.e., objects) of interaction, and ii) the classification of the interaction labels. Most existing methods have indirectly addressed this task by detecting human and object instances and individually inferring every pair of the detected instances. In this paper, we present a novel framework, referred to by HOTR, which directly predicts a set of <human, object, interaction> triplets from an image based on a transformer encoder-decoder architecture. Through the set prediction, our method effectively exploits the inherent semantic relationships in an image and does not require time-consuming post-processing which is the main bottleneck of existing methods. Our proposed algorithm achieves the state-of-the-art performance in two HOI detection benchmarks with an inference time under 1 ms after object detection.

PDF Abstract CVPR 2021 PDF CVPR 2021 Abstract

Code

Add Remove Mark official

kakaobrain/HOTR official

133

Tasks

Add Remove

Human-Object Interaction Detection

Object

object-detection

Object Detection

Datasets

HICO-DET

V-COCO

Results from the Paper

Edit

Ranked #16 on Human-Object Interaction Detection on V-COCO

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Human-Object Interaction Detection	HICO-DET	HOTR	mAP	23.46	# 35	Compare
Human-Object Interaction Detection	V-COCO	HOTR	AP(S1)	55.2	# 16	Compare
Human-Object Interaction Detection	V-COCO	HOTR	AP(S2)	64.4	# 13	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

HOTR: End-to-End Human-Object Interaction Detection with Transformers

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove