Exploring the Limits of Out-of-Distribution Detection

Near out-of-distribution detection (OOD) is a major challenge for deep neural networks. We demonstrate that large-scale pre-trained transformers can significantly improve the state-of-the-art (SOTA) on a range of near OOD tasks across different data modalities. For instance, on CIFAR-100 vs CIFAR-10 OOD detection, we improve the AUROC from 85% (current SOTA) to more than 96% using Vision Transformers pre-trained on ImageNet-21k. On a challenging genomics OOD detection benchmark, we improve the AUROC from 66% to 77% using transformers and unsupervised pre-training. To further improve performance, we explore the few-shot outlier exposure setting where a few examples from outlier classes may be available; we show that pre-trained transformers are particularly well-suited for outlier exposure, and that the AUROC of OOD detection on CIFAR-100 vs CIFAR-10 can be improved to 98.7% with just 1 image per OOD class, and 99.46% with 10 images per OOD class. For multi-modal image-text pre-trained transformers such as CLIP, we explore a new way of using just the names of outlier classes as a sole source of information without any accompanying images, and show that this outperforms previous SOTA on standard vision OOD benchmark tasks.

PDF Abstract NeurIPS 2021 PDF NeurIPS 2021 Abstract

Results from the Paper


Task Dataset Model Metric Name Metric Value Global Rank Uses Extra
Training Data
Result Benchmark
Out-of-Distribution Detection CIFAR-100 vs CIFAR-10 R50+ViT_B-16 finetuned on CIFAR-100 AUROC 96.23 # 4
AUPR 92.08 # 2
Out-of-Distribution Detection CIFAR-100 vs CIFAR-10 Ensemble of ViTs AUROC 98.11 # 2
Out-of-Distribution Detection CIFAR-100 vs CIFAR-10 ViT-L_16 finetuned on CIFAR-100 AUROC 97.98 # 3
Out-of-Distribution Detection CIFAR-100 vs CIFAR-10 CLIP using class name words describing the two distributions AUROC 94.68 # 7
Out-of-Distribution Detection CIFAR-100 vs CIFAR-10 MLP-Mixer_B-16 finetuned on CIFAR-100 AUROC 95.31 # 6
AUPR 90.22 # 4
Out-of-Distribution Detection CIFAR-100 vs CIFAR-10 ViT_B-16 finetuned on CIFAR-100 AUROC 95.53 # 5
AUPR 91.89 # 3
Out-of-Distribution Detection CIFAR-10 vs CIFAR-100 ViT finetuned on CIFAR-10 AUPR 97.68 # 3
AUROC 98.42 # 3
Out-of-Distribution Detection CIFAR-10 vs CIFAR-100 MLP-Mixer finetuned on CIFAR-10 AUPR 96.28 # 4
AUROC 97.85 # 4
Out-of-Distribution Detection CIFAR-10 vs CIFAR-100 R+ViT finetuned on CIFAR-10 AUPR 97.75 # 2
AUROC 98.52 # 2

Methods