3D Action Recognition

34 papers with code • 3 benchmarks • 14 datasets

Image: Rahmani et al

Libraries

Use these libraries to find 3D Action Recognition models and implementations
2 papers
3,892

On the Utility of 3D Hand Poses for Action Recognition

s-shamil/HandFormer 14 Mar 2024

3D hand poses are an under-explored modality for action recognition.

1
14 Mar 2024

Masked Motion Predictors are Strong 3D Action Representation Learners

maoyunyao/mamp ICCV 2023

To be specific, the proposed MAMP takes as input the masked spatio-temporal skeleton sequence and predicts the corresponding temporal motion of the masked human joints.

30
14 Aug 2023

Interactive Spatiotemporal Token Attention Network for Skeleton-based General Interactive Action Recognition

Necolizer/ISTA-Net 14 Jul 2023

To address these problems, we propose an Interactive Spatiotemporal Token Attention Network (ISTA-Net), which simultaneously model spatial, temporal, and interactive relations.

10
14 Jul 2023

CMD: Self-supervised 3D Action Representation Learning with Cross-modal Mutual Distillation

maoyunyao/cmd 26 Aug 2022

In this work, we formulate the cross-modal interaction as a bidirectional knowledge distillation problem.

35
26 Aug 2022

Collaborating Domain-shared and Target-specific Feature Clustering for Cross-domain 3D Action Recognition

canbaoburen/CoDT 20 Jul 2022

Furthermore, to leverage the complementarity of domain-shared features and target-specific features, we propose a novel collaborative clustering strategy to enforce pair-wise relationship consistency between the two branches.

73
20 Jul 2022

Multi-Scale Spatial Temporal Graph Convolutional Network for Skeleton-Based Action Recognition

czhaneva/mst-gcn 27 Jun 2022

To solve this problem, we present a multi-scale spatial graph convolution (MS-GC) module and a multi-scale temporal graph convolution (MT-GC) module to enrich the receptive field of the model in spatial and temporal dimensions.

25
27 Jun 2022

PSTNet: Point Spatio-Temporal Convolution on Point Cloud Sequences

hehefan/Point-Spatio-Temporal-Convolution ICLR 2021

Then, a spatial convolution is employed to capture the local structure of points in the 3D space, and a temporal convolution is used to model the dynamics of the spatial regions along the time dimension.

88
27 May 2022

Assembly101: A Large-Scale Multi-View Video Dataset for Understanding Procedural Activities

assembly101/assembly101.github.io CVPR 2022

Assembly101 is a new procedural activity dataset featuring 4321 videos of people assembling and disassembling 101 "take-apart" toy vehicles.

1
28 Mar 2022

Domain Knowledge-Informed Self-Supervised Representations for Workout Form Assessment

ParitoshParmar/Fitness-AQA 28 Feb 2022

To that end, we propose to learn exercise-oriented image and video representations from unlabeled samples such that a small dataset annotated by experts suffices for supervised error detection.

35
28 Feb 2022