TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK	REMOVE
3D Human Pose Estimation	Human3.6M	2D-3D-Softargmax (multi-crop + h.flip)	Average MPJPE (mm)	53.2	# 205
Action Recognition In Videos	NTU RGB+D	2D-3D-Softargmax (RGB only)	Accuracy (CS)	85.5	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/2d3d-pose-estimation-and-action-recognition/action-recognition-in-videos-on-ntu-rgb-d)](https://paperswithcode.com/sota/action-recognition-in-videos-on-ntu-rgb-d?p=2d3d-pose-estimation-and-action-recognition)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/2d3d-pose-estimation-and-action-recognition/3d-human-pose-estimation-on-human36m)](https://paperswithcode.com/sota/3d-human-pose-estimation-on-human36m?p=2d3d-pose-estimation-and-action-recognition)`

2D/3D Pose Estimation and Action Recognition using Multitask Deep Learning

CVPR 2018 · Diogo C. Luvizon, David Picard, Hedi Tabia ·

Action recognition and human pose estimation are closely related but both problems are generally handled as distinct tasks in the literature. In this work, we propose a multitask framework for jointly 2D and 3D pose estimation from still images and human action recognition from video sequences. We show that a single architecture can be used to solve the two problems in an efficient way and still achieves state-of-the-art results. Additionally, we demonstrate that optimization from end-to-end leads to significantly higher accuracy than separated learning. The proposed architecture can be trained with data from different categories simultaneously in a seamlessly way. The reported results on four datasets (MPII, Human3.6M, Penn Action and NTU) demonstrate the effectiveness of our method on the targeted tasks.

PDF Abstract CVPR 2018 PDF CVPR 2018 Abstract

Code

Add Remove Mark official

dluvizon/deephar

372

pminhtam/2D-3D_Multitask_Deep_Learn…

Tasks

Add Remove

3D Human Pose Estimation

3D Pose Estimation

Action Recognition

Action Recognition In Videos

Pose Estimation

Temporal Action Localization

Datasets

Human3.6M

MPII

NTU RGB+D

MPII Human Pose

Penn Action

Results from the Paper

Edit

Ranked #1 on Action Recognition In Videos on NTU RGB+D

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Result	Benchmark
3D Human Pose Estimation	Human3.6M	2D-3D-Softargmax (multi-crop + h.flip)	Average MPJPE (mm)	53.2	# 205		Compare
Action Recognition In Videos	NTU RGB+D	2D-3D-Softargmax (RGB only)	Accuracy (CS)	85.5	# 1		Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

2D/3D Pose Estimation and Action Recognition using Multitask Deep Learning

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove