VolNet: Estimating Human Body Part Volumes from a Single RGB Image

5 Jul 2021  ·  Fabian Leinen, Vittorio Cozzolino, Torsten Schön ·

Human body volume estimation from a single RGB image is a challenging problem despite minimal attention from the research community. However VolNet, an architecture leveraging 2D and 3D pose estimation, body part segmentation and volume regression extracted from a single 2D RGB image combined with the subject's body height can be used to estimate the total body volume. VolNet is designed to predict the 2D and 3D pose as well as the body part segmentation in intermediate tasks. We generated a synthetic, large-scale dataset of photo-realistic images of human bodies with a wide range of body shapes and realistic poses called SURREALvols. By using Volnet and combining multiple stacked hourglass networks together with ResNeXt, our model correctly predicted the volume in ~82% of cases with a 10% tolerance threshold. This is a considerable improvement compared to state-of-the-art solutions such as BodyNet with only a ~38% success rate.

PDF Abstract

Datasets


Introduced in the Paper:

SURREALvols

Used in the Paper:

LSUN SURREAL
Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Human Body Volume Estimation SURREALvols Volnet Absolute Percentage Error 0.062 # 1

Methods