Last Layer Re-Training is Sufficient for Robustness to Spurious Correlations

6 Apr 2022  ·  Polina Kirichenko, Pavel Izmailov, Andrew Gordon Wilson ·

Neural network classifiers can largely rely on simple spurious features, such as backgrounds, to make predictions. However, even in these cases, we show that they still often learn core features associated with the desired attributes of the data, contrary to recent findings. Inspired by this insight, we demonstrate that simple last layer retraining can match or outperform state-of-the-art approaches on spurious correlation benchmarks, but with profoundly lower complexity and computational expenses. Moreover, we show that last layer retraining on large ImageNet-trained models can also significantly reduce reliance on background and texture information, improving robustness to covariate shift, after only minutes of training on a single GPU.

PDF Abstract
Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Out-of-Distribution Generalization UrbanCars DFR (CoObj) BG Gap -19.1 # 1
CoObj Gap -8.6 # 1
BG+CoObj Gap -64.9 # 1
Out-of-Distribution Generalization UrbanCars DFR (BG) BG Gap -9.8 # 1
CoObj Gap -13.6 # 1
BG+CoObj Gap -58.9 # 1
Out-of-Distribution Generalization UrbanCars DFR (BG+CoObj) BG Gap -10.7 # 1
CoObj Gap -6.9 # 1
BG+CoObj Gap -45.2 # 1

Methods


No methods listed for this paper. Add relevant methods here