Joint Statistical and Causal Feature Modulated Face Anti-Spoofing
In this paper, we propose a hierarchical feature modulation (HFM) approach for stable face anti-spoofing in unseen domains and unseen attacks. The conventional multi-domain based generalizable approaches likely lead to local optima due to the complicated or heuristic learning paradigm. Inspired by the fact that high-level semantic disturbances and low-level miscellaneous bias jointly cause the distribution shift, HFM aims to modulate the fine-grained feature in a hierarchical manner. Specifically, we complement the structural feature with patch-wise learnable statistical information, i.e. local difference histogram, to relieve the overfitting on high-level semantics. We further introduce the structural causal model (SCM) with imaging color model to reveal that presenting mediums and capturing devices destroy the liveness-relevant information from the low level. Thus we model this hidden entanglement as a distribution mixture problem and propose the expectation-maximization (EM) based causal intervention to remove these miscellanies. Experimental results on public datasets demonstrate the effectiveness of HFM, especially in out-of-distribution settings.
PDF Abstract