TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
3D Human Pose Estimation	3DPW	MAED	PA-MPJPE	45.7	# 39
3D Human Pose Estimation	3DPW	MAED	MPJPE	79.1	# 53
3D Human Pose Estimation	3DPW	MAED	MPVPE	92.6	# 40
3D Human Pose Estimation	3DPW	MAED	Acceleration Error	17.6	# 18
3D Human Pose Estimation	Human3.6M	MAED	Average MPJPE (mm)	56.4	# 226
3D Human Pose Estimation	Human3.6M	MAED	PA-MPJPE	38.7	# 43
3D Human Pose Estimation	MPI-INF-3DHP	MAED	MPJPE	83.6	# 40
3D Human Pose Estimation	MPI-INF-3DHP	MAED	PA-MPJPE	56.2	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/encoder-decoder-with-multi-level-attention/3d-human-pose-estimation-on-3dpw)](https://paperswithcode.com/sota/3d-human-pose-estimation-on-3dpw?p=encoder-decoder-with-multi-level-attention)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/encoder-decoder-with-multi-level-attention/3d-human-pose-estimation-on-mpi-inf-3dhp)](https://paperswithcode.com/sota/3d-human-pose-estimation-on-mpi-inf-3dhp?p=encoder-decoder-with-multi-level-attention)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/encoder-decoder-with-multi-level-attention/3d-human-pose-estimation-on-human36m)](https://paperswithcode.com/sota/3d-human-pose-estimation-on-human36m?p=encoder-decoder-with-multi-level-attention)`

Encoder-decoder with Multi-level Attention for 3D Human Shape and Pose Estimation

ICCV 2021 · Ziniu Wan, Zhengjia Li, Maoqing Tian, Jianbo Liu, Shuai Yi, Hongsheng Li ·

3D human shape and pose estimation is the essential task for human motion analysis, which is widely used in many 3D applications. However, existing methods cannot simultaneously capture the relations at multiple levels, including spatial-temporal level and human joint level. Therefore they fail to make accurate predictions in some hard scenarios when there is cluttered background, occlusion, or extreme pose. To this end, we propose Multi-level Attention Encoder-Decoder Network (MAED), including a Spatial-Temporal Encoder (STE) and a Kinematic Topology Decoder (KTD) to model multi-level attentions in a unified framework. STE consists of a series of cascaded blocks based on Multi-Head Self-Attention, and each block uses two parallel branches to learn spatial and temporal attention respectively. Meanwhile, KTD aims at modeling the joint level attention. It regards pose estimation as a top-down hierarchical process similar to SMPL kinematic tree. With the training set of 3DPW, MAED outperforms previous state-of-the-art methods by 6.2, 7.2, and 2.4 mm of PA-MPJPE on the three widely used benchmarks 3DPW, MPI-INF-3DHP, and Human3.6M respectively. Our code is available at https://github.com/ziniuwan/maed.

PDF Abstract ICCV 2021 PDF ICCV 2021 Abstract

Code

Add Remove Mark official

ziniuwan/maed official

203

Tasks

Add Remove

3D Absolute Human Pose Estimation

3D Human Pose Estimation

Pose Estimation

Datasets

Human3.6M

3DPW

MPI-INF-3DHP

Results from the Paper

Edit

Ranked #40 on 3D Human Pose Estimation on MPI-INF-3DHP

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
3D Human Pose Estimation	3DPW	MAED	PA-MPJPE	45.7	# 39	Compare
			MPJPE	79.1	# 53	Compare
			MPVPE	92.6	# 40	Compare
			Acceleration Error	17.6	# 18	Compare
3D Human Pose Estimation	Human3.6M	MAED	Average MPJPE (mm)	56.4	# 226	Compare
3D Human Pose Estimation	Human3.6M	MAED	PA-MPJPE	38.7	# 43	Compare
3D Human Pose Estimation	MPI-INF-3DHP	MAED	MPJPE	83.6	# 40	Compare
3D Human Pose Estimation	MPI-INF-3DHP	MAED	PA-MPJPE	56.2	# 1	Compare

Methods

Add Remove

Temporal attention

Edit Social Preview

Encoder-decoder with Multi-level Attention for 3D Human Shape and Pose Estimation

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove