TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Action Classification	Charades	PA3D + (GCN + I3D + NL I3D)	MAP	41	# 31
Skeleton Based Action Recognition	J-HMDB	PA3D	Accuracy (RGB+pose)	69.5	# 8
Skeleton Based Action Recognition	J-HMDB	PA3D+RPAN	Accuracy (RGB+pose)	86.1	# 2

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/pa3d-pose-action-3d-machine-for-video/skeleton-based-action-recognition-on-j-hmdb)](https://paperswithcode.com/sota/skeleton-based-action-recognition-on-j-hmdb?p=pa3d-pose-action-3d-machine-for-video)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/pa3d-pose-action-3d-machine-for-video/action-classification-on-charades)](https://paperswithcode.com/sota/action-classification-on-charades?p=pa3d-pose-action-3d-machine-for-video)`

PA3D: Pose-Action 3D Machine for Video Recognition

CVPR 2019 · An Yan, Yali Wang, Zhifeng Li, Yu Qiao ·

Recent studies have witnessed the successes of using 3D CNNs for video action recognition. However, most 3D models are built upon RGB and optical flow streams, which may not fully exploit pose dynamics, i.e., an important cue of modeling human actions. To fill this gap, we propose a concise Pose-Action 3D Machine (PA3D), which can effectively encode multiple pose modalities within a unified 3D framework, and consequently learn spatio-temporal pose representations for action recognition. More specifically, we introduce a novel temporal pose convolution to aggregate spatial poses over frames. Unlike the classical temporal convolution, our operation can explicitly learn the pose motions that are discriminative to recognize human actions. Extensive experiments on three popular benchmarks (i.e., JHMDB, HMDB, and Charades) show that, PA3D outperforms the recent pose-based approaches. Furthermore, PA3D is highly complementary to the recent 3D CNNs, e.g., I3D. Multi-stream fusion achieves the state-of-the-art performance on all evaluated data sets.

PDF Abstract

Code

Add Remove Mark official

No code implementations yet. Submit your code now

Tasks

Add Remove

Action Recognition

Optical Flow Estimation

Skeleton Based Action Recognition

Temporal Action Localization

Video Recognition

Datasets

Kinetics

Charades

JHMDB

Results from the Paper

Add Remove

Ranked #2 on Skeleton Based Action Recognition on J-HMDB

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Action Classification	Charades	PA3D + (GCN + I3D + NL I3D)	MAP	41	# 31	Compare
Skeleton Based Action Recognition	J-HMDB	PA3D	Accuracy (RGB+pose)	69.5	# 8	Compare
Skeleton Based Action Recognition	J-HMDB	PA3D+RPAN	Accuracy (RGB+pose)	86.1	# 2	Compare

Methods

Add Remove

Convolution

Edit Social Preview

PA3D: Pose-Action 3D Machine for Video Recognition

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove