TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
3D Human Pose Estimation	Human3.6M	Stereoscopic View Synthesis Subnetwork (GTt)	Average MPJPE (mm)	37.6	# 64
3D Human Pose Estimation	Human3.6M	Stereoscopic View Synthesis Subnetwork (GTt)	Using 2D ground-truth joints	Yes	# 2
3D Human Pose Estimation	Human3.6M	Stereoscopic View Synthesis Subnetwork (GTt)	Multi-View or Monocular	Multi-View	# 1
3D Human Pose Estimation	Human3.6M	Stereoscopic View Synthesis Subnetwork	Average MPJPE (mm)	58	# 237
3D Human Pose Estimation	Human3.6M	Stereoscopic View Synthesis Subnetwork	Using 2D ground-truth joints	No	# 2
3D Human Pose Estimation	Human3.6M	Stereoscopic View Synthesis Subnetwork	Multi-View or Monocular	Multi-View	# 1
3D Human Pose Estimation	MPI-INF-3DHP	Stereoscopic View Synthesis Subnetwork	AUC	33.8	# 75
3D Human Pose Estimation	MPI-INF-3DHP	Stereoscopic View Synthesis Subnetwork	PCK	71.2	# 79

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/generalizing-monocular-3d-human-pose/3d-human-pose-estimation-on-human36m)](https://paperswithcode.com/sota/3d-human-pose-estimation-on-human36m?p=generalizing-monocular-3d-human-pose)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/generalizing-monocular-3d-human-pose/3d-human-pose-estimation-on-mpi-inf-3dhp)](https://paperswithcode.com/sota/3d-human-pose-estimation-on-mpi-inf-3dhp?p=generalizing-monocular-3d-human-pose)`

Generalizing Monocular 3D Human Pose Estimation in the Wild

11 Apr 2019 · Luyang Wang, Yan Chen, Zhenhua Guo, Keyuan Qian, Mude Lin, Hongsheng Li, Jimmy S. Ren ·

The availability of the large-scale labeled 3D poses in the Human3.6M dataset plays an important role in advancing the algorithms for 3D human pose estimation from a still image. We observe that recent innovation in this area mainly focuses on new techniques that explicitly address the generalization issue when using this dataset, because this database is constructed in a highly controlled environment with limited human subjects and background variations. Despite such efforts, we can show that the results of the current methods are still error-prone especially when tested against the images taken in-the-wild. In this paper, we aim to tackle this problem from a different perspective. We propose a principled approach to generate high quality 3D pose ground truth given any in-the-wild image with a person inside. We achieve this by first devising a novel stereo inspired neural network to directly map any 2D pose to high quality 3D counterpart. We then perform a carefully designed geometric searching scheme to further refine the joints. Based on this scheme, we build a large-scale dataset with 400,000 in-the-wild images and their corresponding 3D pose ground truth. This enables the training of a high quality neural network model, without specialized training scheme and auxiliary loss function, which performs favorably against the state-of-the-art 3D pose estimation methods. We also evaluate the generalization ability of our model both quantitatively and qualitatively. Results show that our approach convincingly outperforms the previous methods. We make our dataset and code publicly available.

PDF Abstract

Code

Add Remove Mark official

llcshappy/Monocular-3D-Human-Pose official

145

Tasks

Add Remove

3D Human Pose Estimation

3D Pose Estimation

Monocular 3D Human Pose Estimation

Pose Estimation

Datasets

Human3.6M

MPII

MPI-INF-3DHP

LSP

Penn Action

Results from the Paper

Edit

Ranked #64 on 3D Human Pose Estimation on Human3.6M

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
3D Human Pose Estimation	Human3.6M	Stereoscopic View Synthesis Subnetwork (GTt)	Average MPJPE (mm)	37.6	# 64	Compare
			Using 2D ground-truth joints	Yes	# 2	Compare
			Multi-View or Monocular	Multi-View	# 1	Compare
3D Human Pose Estimation	Human3.6M	Stereoscopic View Synthesis Subnetwork	Average MPJPE (mm)	58	# 237	Compare
			Using 2D ground-truth joints	No	# 2	Compare
			Multi-View or Monocular	Multi-View	# 1	Compare
3D Human Pose Estimation	MPI-INF-3DHP	Stereoscopic View Synthesis Subnetwork	AUC	33.8	# 75	Compare
3D Human Pose Estimation	MPI-INF-3DHP	Stereoscopic View Synthesis Subnetwork	PCK	71.2	# 79	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Generalizing Monocular 3D Human Pose Estimation in the Wild

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove