3D Action Recognition

34 papers with code • 3 benchmarks • 14 datasets

Benchmarks

Add a Result

These leaderboards are used to track progress in 3D Action Recognition

Dataset	Best Model	Compare
Assembly101	HandFormer-B/21	See all
NTU RGB+D	Kinet	See all
100 sleep nights of 8 caregivers	htf	See all

Libraries

Use these libraries to find 3D Action Recognition models and implementations

open-mmlab/mmaction2

2 papers

3,892

Datasets

Subtasks

Latest papers

Most implemented Social Latest No code

On the Utility of 3D Hand Poses for Action Recognition

s-shamil/HandFormer • 14 Mar 2024

3D hand poses are an under-explored modality for action recognition.

14 Mar 2024

Paper
Code

Masked Motion Predictors are Strong 3D Action Representation Learners

maoyunyao/mamp • • ICCV 2023

To be specific, the proposed MAMP takes as input the masked spatio-temporal skeleton sequence and predicts the corresponding temporal motion of the masked human joints.

14 Aug 2023

Paper
Code

Interactive Spatiotemporal Token Attention Network for Skeleton-based General Interactive Action Recognition

Necolizer/ISTA-Net • • 14 Jul 2023

To address these problems, we propose an Interactive Spatiotemporal Token Attention Network (ISTA-Net), which simultaneously model spatial, temporal, and interactive relations.

14 Jul 2023

Paper
Code

CMD: Self-supervised 3D Action Representation Learning with Cross-modal Mutual Distillation

maoyunyao/cmd • • 26 Aug 2022

In this work, we formulate the cross-modal interaction as a bidirectional knowledge distillation problem.

26 Aug 2022

Paper
Code

Collaborating Domain-shared and Target-specific Feature Clustering for Cross-domain 3D Action Recognition

canbaoburen/CoDT • • 20 Jul 2022

Furthermore, to leverage the complementarity of domain-shared features and target-specific features, we propose a novel collaborative clustering strategy to enforce pair-wise relationship consistency between the two branches.

20 Jul 2022

Paper
Code

Multi-Scale Spatial Temporal Graph Convolutional Network for Skeleton-Based Action Recognition

czhaneva/mst-gcn • • 27 Jun 2022

To solve this problem, we present a multi-scale spatial graph convolution (MS-GC) module and a multi-scale temporal graph convolution (MT-GC) module to enrich the receptive field of the model in spatial and temporal dimensions.

27 Jun 2022

Paper
Code

PSTNet: Point Spatio-Temporal Convolution on Point Cloud Sequences

hehefan/Point-Spatio-Temporal-Convolution • • ICLR 2021

Then, a spatial convolution is employed to capture the local structure of points in the 3D space, and a temporal convolution is used to model the dynamics of the spatial regions along the time dimension.

27 May 2022

Paper
Code

Assembly101: A Large-Scale Multi-View Video Dataset for Understanding Procedural Activities

assembly101/assembly101.github.io • CVPR 2022

Assembly101 is a new procedural activity dataset featuring 4321 videos of people assembling and disassembling 101 "take-apart" toy vehicles.

28 Mar 2022

Paper
Code

No Pain, Big Gain: Classify Dynamic Point Cloud Sequences with Static Models by Fitting Feature-level Space-time Surfaces

jx-zhong-for-academic-purpose/kinet • • CVPR 2022

Scene flow is a powerful tool for capturing the motion field of 3D point clouds.

21 Mar 2022

Paper
Code

Domain Knowledge-Informed Self-Supervised Representations for Workout Form Assessment

ParitoshParmar/Fitness-AQA • 28 Feb 2022

To that end, we propose to learn exercise-oriented image and video representations from unlabeled samples such that a small dataset annotated by experts suffices for supervised error detection.

28 Feb 2022

Paper
Code

3D Action Recognition

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers

Content

Benchmarks

Add a Result