TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
3D Human Pose Estimation	3DPW	MION	PA-MPJPE	52.34	# 69
3D Human Pose Estimation	3DPW	MION	MPJPE	81.98	# 64
3D Human Pose Estimation	Human3.6M	MION	Average MPJPE (mm)	56.88	# 230
3D Human Pose Estimation	Human3.6M	MION	PA-MPJPE	41.59	# 64

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/multi-initialization-optimization-network-for/3d-human-pose-estimation-on-3dpw)](https://paperswithcode.com/sota/3d-human-pose-estimation-on-3dpw?p=multi-initialization-optimization-network-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/multi-initialization-optimization-network-for/3d-human-pose-estimation-on-human36m)](https://paperswithcode.com/sota/3d-human-pose-estimation-on-human36m?p=multi-initialization-optimization-network-for)`

Multi-initialization Optimization Network for Accurate 3D Human Pose and Shape Estimation

24 Dec 2021 · Zhiwei Liu, Xiangyu Zhu, Lu Yang, Xiang Yan, Ming Tang, Zhen Lei, Guibo Zhu, Xuetao Feng, Yan Wang, Jinqiao Wang ·

3D human pose and shape recovery from a monocular RGB image is a challenging task. Existing learning based methods highly depend on weak supervision signals, e.g. 2D and 3D joint location, due to the lack of in-the-wild paired 3D supervision. However, considering the 2D-to-3D ambiguities existed in these weak supervision labels, the network is easy to get stuck in local optima when trained with such labels. In this paper, we reduce the ambituity by optimizing multiple initializations. Specifically, we propose a three-stage framework named Multi-Initialization Optimization Network (MION). In the first stage, we strategically select different coarse 3D reconstruction candidates which are compatible with the 2D keypoints of input sample. Each coarse reconstruction can be regarded as an initialization leads to one optimization branch. In the second stage, we design a mesh refinement transformer (MRT) to respectively refine each coarse reconstruction result via a self-attention mechanism. Finally, a Consistency Estimation Network (CEN) is proposed to find the best result from mutiple candidates by evaluating if the visual evidence in RGB image matches a given 3D reconstruction. Experiments demonstrate that our Multi-Initialization Optimization Network outperforms existing 3D mesh based methods on multiple public benchmarks.

PDF Abstract

Code

Add Remove Mark official

No code implementations yet. Submit your code now

Tasks

Add Remove

3D human pose and shape estimation

3D Human Pose Estimation

3D Reconstruction

Datasets

MS COCO

LSUN

Human3.6M

MPII

3DPW

AMASS

MPI-INF-3DHP

DensePose

LSP

SURREAL

Results from the Paper

Edit

Ranked #64 on 3D Human Pose Estimation on 3DPW (MPJPE metric)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
3D Human Pose Estimation	3DPW	MION	PA-MPJPE	52.34	# 69	Compare
3D Human Pose Estimation	3DPW	MION	MPJPE	81.98	# 64	Compare
3D Human Pose Estimation	Human3.6M	MION	Average MPJPE (mm)	56.88	# 230	Compare
3D Human Pose Estimation	Human3.6M	MION	PA-MPJPE	41.59	# 64	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Multi-initialization Optimization Network for Accurate 3D Human Pose and Shape Estimation

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove