TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
3D Human Pose Estimation	3DPW	DSD-SATN	PA-MPJPE	69.5	# 111
3D Human Pose Estimation	Human3.6M	DSD+SATN	Average MPJPE (mm)	59.1	# 242
3D Human Pose Estimation	Human3.6M	DSD+SATN	PA-MPJPE	42.4	# 70
3D Human Pose Estimation	Human3.6M	DSD+SATN	Acceleration Error	6.8	# 12

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/human-mesh-recovery-from-monocular-images-via/3d-human-pose-estimation-on-3dpw)](https://paperswithcode.com/sota/3d-human-pose-estimation-on-3dpw?p=human-mesh-recovery-from-monocular-images-via)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/human-mesh-recovery-from-monocular-images-via/3d-human-pose-estimation-on-human36m)](https://paperswithcode.com/sota/3d-human-pose-estimation-on-human36m?p=human-mesh-recovery-from-monocular-images-via)`

Human Mesh Recovery from Monocular Images via a Skeleton-disentangled Representation

ICCV 2019 · Sun Yu, Ye Yun, Liu Wu, Gao Wenpeng, Fu YiLi, Mei Tao ·

We describe an end-to-end method for recovering 3D human body mesh from single images and monocular videos. Different from the existing methods try to obtain all the complex 3D pose, shape, and camera parameters from one coupling feature, we propose a skeleton-disentangling based framework, which divides this task into multi-level spatial and temporal granularity in a decoupling manner. In spatial, we propose an effective and pluggable "disentangling the skeleton from the details" (DSD) module. It reduces the complexity and decouples the skeleton, which lays a good foundation for temporal modeling. In temporal, the self-attention based temporal convolution network is proposed to efficiently exploit the short and long-term temporal cues. Furthermore, an unsupervised adversarial training strategy, temporal shuffles and order recovery, is designed to promote the learning of motion dynamics. The proposed method outperforms the state-of-the-art 3D human mesh recovery methods by 15.4% MPJPE and 23.8% PA-MPJPE on Human3.6M. State-of-the-art results are also achieved on the 3D pose in the wild (3DPW) dataset without any fine-tuning. Especially, ablation studies demonstrate that skeleton-disentangled representation is crucial for better temporal modeling and generalization.

PDF Abstract ICCV 2019 PDF ICCV 2019 Abstract

Code

Add Remove Mark official

Arthur151/DSD-SATN official

132

JDAI-CV/DSD-SATN

132

Tasks

Add Remove

3D Human Pose Estimation

3D Pose Estimation

Human Mesh Recovery

Datasets

Human3.6M

MPII

3DPW

LSP

Results from the Paper

Edit

Ranked #111 on 3D Human Pose Estimation on 3DPW (PA-MPJPE metric, using extra training data)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
3D Human Pose Estimation	3DPW	DSD-SATN	PA-MPJPE	69.5	# 111	Compare
3D Human Pose Estimation	Human3.6M	DSD+SATN	Average MPJPE (mm)	59.1	# 242	Compare
			PA-MPJPE	42.4	# 70	Compare
			Acceleration Error	6.8	# 12	Compare

Methods

Add Remove

Convolution

Edit Social Preview

Human Mesh Recovery from Monocular Images via a Skeleton-disentangled Representation

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove