Human Interaction Recognition

5 papers with code • 7 benchmarks • 7 datasets

Human Interaction Recognition (HIR) is a field of study that involves the development of computer algorithms to detect and recognize human interactions in videos, images, or other multimedia content. The goal of HIR is to automatically identify and analyze the social interactions between people, their body language, and facial expressions.

Benchmarks

Add a Result

These leaderboards are used to track progress in Human Interaction Recognition

Dataset	Best Model	Compare
NTU RGB+D 120	SkateFormer	See all
UT	H-LSTCM	See all
NTU RGB+D	SkateFormer	See all
BIT	H-LSTCM	See all
SBU	ISTA-Net	See all
UT-Interaction	LSTM-IRN'fc1inter+intra	See all
EPIC-SOUNDS	Slow-Fast(Finetune by Fivewin team)	See all

Datasets

Subtasks

Most implemented papers

Most implemented Social Latest No code

Slow-Fast Auditory Streams For Audio Recognition

ekazakos/auditory-slow-fast • • 5 Mar 2021

We propose a two-stream convolutional network for audio recognition, that operates on time-frequency spectrogram inputs.

Paper
Code

Interaction Relational Network for Mutual Action Recognition

mauriciolp/inter-rel-net • • 11 Oct 2019

Our solution is able to achieve state-of-the-art performance on the traditional interaction recognition datasets SBU and UT, and also on the mutual actions from the large-scale dataset NTU RGB+D.

Paper
Code

Two-person Graph Convolutional Network for Skeleton-based Human Interaction Recognition

mgiant/2p-gcn • • 12 Aug 2022

To overcome the above shortcoming, we introduce a novel unified two-person graph to represent inter-body and intra-body correlations between joints.

Paper
Code

Interactive Spatiotemporal Token Attention Network for Skeleton-based General Interactive Action Recognition

Necolizer/ISTA-Net • • 14 Jul 2023

To address these problems, we propose an Interactive Spatiotemporal Token Attention Network (ISTA-Net), which simultaneously model spatial, temporal, and interactive relations.

Paper
Code

SkateFormer: Skeletal-Temporal Transformer for Human Action Recognition

KAIST-VICLab/SkateFormer • • 14 Mar 2024

We categorize the key skeletal-temporal relations for action recognition into a total of four distinct types.

Paper
Code

Human Interaction Recognition

Benchmarks Add a Result

Datasets

Subtasks

Most implemented papers

Slow-Fast Auditory Streams For Audio Recognition

Interaction Relational Network for Mutual Action Recognition

Two-person Graph Convolutional Network for Skeleton-based Human Interaction Recognition

Interactive Spatiotemporal Token Attention Network for Skeleton-based General Interactive Action Recognition

SkateFormer: Skeletal-Temporal Transformer for Human Action Recognition

Content

Benchmarks

Add a Result