Human-Object Interaction Detection
132 papers with code • 6 benchmarks • 22 datasets
Human-Object Interaction (HOI) detection is a task of identifying "a set of interactions" in an image, which involves the i) localization of the subject (i.e., humans) and target (i.e., objects) of interaction, and ii) the classification of the interaction labels.
Benchmarks
These leaderboards are used to track progress in Human-Object Interaction Detection
Libraries
Use these libraries to find Human-Object Interaction Detection models and implementationsLatest papers with no code
FreeA: Human-object Interaction Detection using Free Annotation Labels
Recent human-object interaction (HOI) detection approaches rely on high cost of manpower and require comprehensive annotated image datasets.
Zero-Shot Learning for the Primitives of 3D Affordance in General Objects
One of the major challenges in AI is teaching machines to precisely respond and utilize environmental functionalities, thereby achieving the affordance awareness that humans possess.
ParaHome: Parameterizing Everyday Home Activities Towards 3D Generative Modeling of Human-Object Interactions
To address this challenge, we introduce the ParaHome system, designed to capture and parameterize dynamic 3D movements of humans and objects within a common home environment.
AffordanceLLM: Grounding Affordance from Vision Language Models
Affordance grounding refers to the task of finding the area of an object with which one can interact.
Exploring Self- and Cross-Triplet Correlations for Human-Object Interaction Detection
In this work, we propose to explore Self- and Cross-Triplet Correlations (SCTC) for HOI detection.
RHOBIN Challenge: Reconstruction of Human Object Interaction
Modeling the interaction between humans and objects has been an emerging research direction in recent years.
UnionDet: Union-Level Detector Towards Real-Time Human-Object Interaction Detection
This is a major bottleneck in HOI detection inference time.
Primitive-based 3D Human-Object Interaction Modelling and Programming
To explore an effective embedding of HAOI for the machine, we build a new benchmark on 3D HAOI consisting of primitives together with their images and propose a task requiring machines to recover 3D HAOI using primitives from images.
Few-Shot Learning from Augmented Label-Uncertain Queries in Bongard-HOI
In our proposed method, we introduce novel label-uncertain query augmentation techniques to enhance the diversity of the query inputs, aiming to distinguish the positive HOI class from the negative ones.
LEMON: Learning 3D Human-Object Interaction Relation from 2D Images
Which underexploit certain correlations between the interaction counterparts (human and object), and struggle to address the uncertainty in interactions.