TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
3D Human Pose Estimation	Human3.6M	c2f-vol	Average MPJPE (mm)	71.9	# 281
3D Human Pose Estimation	Human3.6M	c2f-vol	PA-MPJPE	51.9	# 96
3D Human Pose Estimation	HumanEva-I	c2f-vol	Mean Reconstruction Error (mm)	24.3	# 16

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/coarse-to-fine-volumetric-prediction-for/3d-human-pose-estimation-on-humaneva-i)](https://paperswithcode.com/sota/3d-human-pose-estimation-on-humaneva-i?p=coarse-to-fine-volumetric-prediction-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/coarse-to-fine-volumetric-prediction-for/3d-human-pose-estimation-on-human36m)](https://paperswithcode.com/sota/3d-human-pose-estimation-on-human36m?p=coarse-to-fine-volumetric-prediction-for)`

Coarse-to-Fine Volumetric Prediction for Single-Image 3D Human Pose

CVPR 2017 · Georgios Pavlakos, Xiaowei Zhou, Konstantinos G. Derpanis, Kostas Daniilidis ·

This paper addresses the challenge of 3D human pose estimation from a single color image. Despite the general success of the end-to-end learning paradigm, top performing approaches employ a two-step solution consisting of a Convolutional Network (ConvNet) for 2D joint localization and a subsequent optimization step to recover 3D pose. In this paper, we identify the representation of 3D pose as a critical issue with current ConvNet approaches and make two important contributions towards validating the value of end-to-end learning for this task. First, we propose a fine discretization of the 3D space around the subject and train a ConvNet to predict per voxel likelihoods for each joint. This creates a natural representation for 3D pose and greatly improves performance over the direct regression of joint coordinates. Second, to further improve upon initial estimates, we employ a coarse-to-fine prediction scheme. This step addresses the large dimensionality increase and enables iterative refinement and repeated processing of the image features. The proposed approach outperforms all state-of-the-art methods on standard benchmarks achieving a relative error reduction greater than 30% on average. Additionally, we investigate using our volumetric representation in a related architecture which is suboptimal compared to our end-to-end approach, but is of practical interest, since it enables training when no image with corresponding 3D groundtruth is available, and allows us to present compelling results for in-the-wild images.

PDF Abstract CVPR 2017 PDF CVPR 2017 Abstract

Code

Add Remove Mark official

geopavlakos/c2f-vol-demo

thuml/ContextWM

strawberryfg/c2f-3dhm-human-caffe

Tasks

Add Remove

3D Human Pose Estimation

Pose Estimation

Datasets

Human3.6M

MPII

MPII Human Pose

Results from the Paper

Edit

Ranked #16 on 3D Human Pose Estimation on HumanEva-I

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
3D Human Pose Estimation	Human3.6M	c2f-vol	Average MPJPE (mm)	71.9	# 281	Compare
3D Human Pose Estimation	Human3.6M	c2f-vol	PA-MPJPE	51.9	# 96	Compare
3D Human Pose Estimation	HumanEva-I	c2f-vol	Mean Reconstruction Error (mm)	24.3	# 16	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Coarse-to-Fine Volumetric Prediction for Single-Image 3D Human Pose

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove