“Look Ma, no landmarks!” – Unsupervised, Model-based Dense Face Alignment

ECCV 2020  ·  Tatsuro Koizumi, William A. P. Smith ·

no landmarks!"" - Unsupervised, model-based dense face alignment","In this paper, we show how to train an image-to-image network to predict dense correspondence between a face image and a 3D morphable model using only the model for supervision. We show that both geometric parameters (shape, pose and camera intrinsics) and photometric parameters (texture and lighting) can be inferred directly from the correspondence map using linear least squares and our novel inverse spherical harmonic lighting model. The least squares residuals provide an unsupervised training signal that allows us to avoid artefacts common in the literature such as shrinking and conservative underfitting. Our approach uses a network that is 10$ imes$ smaller than parameter regression networks, significantly reduces sensitivity to image alignment and allows known camera calibration or multi-image constraints to be incorporated during inference. We achieve results competitive with state-of-the-art but without any auxiliary supervision used by previous methods.

PDF Abstract

Results from the Paper


Task Dataset Model Metric Name Metric Value Global Rank Benchmark
3D Face Reconstruction NoW Benchmark UMDFA Mean Reconstruction Error (mm) 1.89 # 15
Stdev Reconstruction Error (mm) 1.57 # 14
Median Reconstruction Error 1.52 # 16

Methods


No methods listed for this paper. Add relevant methods here