3D Human Pose Estimation with 2D Marginal Heatmaps

5 Jun 2018  ·  Aiden Nibali, Zhen He, Stuart Morgan, Luke Prendergast ·

Automatically determining three-dimensional human pose from monocular RGB image data is a challenging problem. The two-dimensional nature of the input results in intrinsic ambiguities which make inferring depth particularly difficult. Recently, researchers have demonstrated that the flexible statistical modelling capabilities of deep neural networks are sufficient to make such inferences with reasonable accuracy. However, many of these models use coordinate output techniques which are memory-intensive, not differentiable, and/or do not spatially generalise well. We propose improvements to 3D coordinate prediction which avoid the aforementioned undesirable traits by predicting 2D marginal heatmaps under an augmented soft-argmax scheme. Our resulting model, MargiPose, produces visually coherent heatmaps whilst maintaining differentiability. We are also able to achieve state-of-the-art accuracy on publicly available 3D human pose estimation data.

PDF Abstract
Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
3D Human Pose Estimation Human3.6M MargiPose (multi-crop) Average MPJPE (mm) 55.4 # 220
PA-MPJPE 39 # 45
3D Human Pose Estimation Human3.6M MargiPose Average MPJPE (mm) 57 # 232
PA-MPJPE 40.4 # 57
3D Human Pose Estimation MPI-INF-3DHP MargiPose (multi-crop) AUC 47 # 49
MPJPE 91.3 # 46
PCK 85.4 # 44

Methods


No methods listed for this paper. Add relevant methods here