Learning an Animatable Detailed 3D Face Model from In-The-Wild Images

7 Dec 2020  ·  Yao Feng, Haiwen Feng, Michael J. Black, Timo Bolkart ·

While current monocular 3D face reconstruction methods can recover fine geometric details, they suffer several limitations. Some methods produce faces that cannot be realistically animated because they do not model how wrinkles vary with expression. Other methods are trained on high-quality face scans and do not generalize well to in-the-wild images. We present the first approach that regresses 3D face shape and animatable details that are specific to an individual but change with expression. Our model, DECA (Detailed Expression Capture and Animation), is trained to robustly produce a UV displacement map from a low-dimensional latent representation that consists of person-specific detail parameters and generic expression parameters, while a regressor is trained to predict detail, shape, albedo, expression, pose and illumination parameters from a single image. To enable this, we introduce a novel detail-consistency loss that disentangles person-specific details from expression-dependent wrinkles. This disentanglement allows us to synthesize realistic person-specific wrinkles by controlling expression parameters while keeping person-specific details unchanged. DECA is learned from in-the-wild images with no paired 3D supervision and achieves state-of-the-art shape reconstruction accuracy on two benchmarks. Qualitative results on in-the-wild data demonstrate DECA's robustness and its ability to disentangle identity- and expression-dependent details enabling animation of reconstructed faces. The model and code are publicly available at https://deca.is.tue.mpg.de.

PDF Abstract
Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
3D Face Reconstruction NoW Benchmark DECA Mean Reconstruction Error (mm) 1.38 # 5
Stdev Reconstruction Error (mm) 1.18 # 5
Median Reconstruction Error 1.09 # 5
3D Face Reconstruction REALY DECA-f @nose 2.138 (±0.461) # 19
@mouth 2.802 (±0.868) # 21
@forehead 2.457 (±0.559) # 18
@cheek 1.443 (±0.498) # 10
all 2.210 # 19
3D Face Reconstruction REALY DECA-c @nose 1.697 (±0.355) # 7
@mouth 2.516 (±0.839) # 19
@forehead 2.394 (±0.576) # 12
@cheek 1.479 (±0.535) # 12
all 2.010 # 13
3D Face Reconstruction REALY (side-view) DECA-f @nose 2.286 (±1.103) # 16
all 2.261 # 16
@mouth 2.684 (±1.041) # 16
@forehead 2.519 (±0.718) # 14
@cheek 1.555 (±0.822) # 9
3D Face Reconstruction REALY (side-view) DECA-c @nose 1.903 (±1.050) # 13
all 2.107 # 12
@mouth 2.472 (±1.079) # 14
@forehead 2.423 (±0.720) # 8
@cheek 1.630 (±1.135) # 11
3D Face Reconstruction Stirling-HQ (FG2018 3D face reconstruction challenge) DECA Mean Reconstruction Error (mm) 1.89 # 1
3D Face Reconstruction Stirling-LQ (FG2018 3D face reconstruction challenge) DECA Mean Reconstruction Error (mm) 1.91 # 1

Methods