DeCaFA: Deep Convolutional Cascade for Face Alignment In The Wild

ICCV 2019  ·  Arnaud Dapogny, Kévin Bailly, Matthieu Cord ·

Face Alignment is an active computer vision domain, that consists in localizing a number of facial landmarks that vary across datasets. State-of-the-art face alignment methods either consist in end-to-end regression, or in refining the shape in a cascaded manner, starting from an initial guess. In this paper, we introduce DeCaFA, an end-to-end deep convolutional cascade architecture for face alignment. DeCaFA uses fully-convolutional stages to keep full spatial resolution throughout the cascade. Between each cascade stage, DeCaFA uses multiple chained transfer layers with spatial softmax to produce landmark-wise attention maps for each of several landmark alignment tasks. Weighted intermediate supervision, as well as efficient feature fusion between the stages allow to learn to progressively refine the attention maps in an end-to-end manner. We show experimentally that DeCaFA significantly outperforms existing approaches on 300W, CelebA and WFLW databases. In addition, we show that DeCaFA can learn fine alignment with reasonable accuracy from very few images using coarsely annotated data.

PDF Abstract ICCV 2019 PDF ICCV 2019 Abstract
No code implementations yet. Submit your code now

Results from the Paper


Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Face Alignment 300W DeCaFA NME_inter-ocular (%, Full) 3.39 # 26
NME_inter-ocular (%, Common) 2.93 # 24
NME_inter-ocular (%, Challenge) 5.26 # 29
Face Alignment WFLW DeCaFA NME (inter-ocular) 4.62 # 22
AUC@10 (inter-ocular) 56.3 # 18
FR@10 (inter-ocular) 4.84 # 19

Methods