TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Out-of-Distribution Detection	20 Newsgroups	2-Layered GRU	AUROC	99.6	# 1
Out-of-Distribution Detection	20 Newsgroups	2-Layered GRU	FPR95	1.78	# 1
Out-of-Distribution Detection	CIFAR-10	Wide ResNet 40x2	FPR95	2.0	# 2
Out-of-Distribution Detection	CIFAR-10	Wide ResNet 40x2	AUROC	99.9	# 2
Out-of-Distribution Detection	CIFAR-100	Wide ResNet 40x2	FPR95	23.4	# 1
Out-of-Distribution Detection	CIFAR-100	Wide ResNet 40x2	AUROC	97.7	# 1
Out-of-Distribution Detection	SST	2-Layered GRU	AUROC	99.7	# 1
Out-of-Distribution Detection	SST	2-Layered GRU	FPR95	20.9	# 1
Out-of-Distribution Detection	TREC-NEWS	2-Layered GRU	AUROC	99.9	# 1
Out-of-Distribution Detection	TREC-NEWS	2-Layered GRU	FPR95	4.7	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/an-effective-baseline-for-robustness-to/out-of-distribution-detection-on-20)](https://paperswithcode.com/sota/out-of-distribution-detection-on-20?p=an-effective-baseline-for-robustness-to)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/an-effective-baseline-for-robustness-to/out-of-distribution-detection-on-cifar-100)](https://paperswithcode.com/sota/out-of-distribution-detection-on-cifar-100?p=an-effective-baseline-for-robustness-to)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/an-effective-baseline-for-robustness-to/out-of-distribution-detection-on-sst)](https://paperswithcode.com/sota/out-of-distribution-detection-on-sst?p=an-effective-baseline-for-robustness-to)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/an-effective-baseline-for-robustness-to/out-of-distribution-detection-on-trec-news)](https://paperswithcode.com/sota/out-of-distribution-detection-on-trec-news?p=an-effective-baseline-for-robustness-to)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/an-effective-baseline-for-robustness-to/out-of-distribution-detection-on-cifar-10)](https://paperswithcode.com/sota/out-of-distribution-detection-on-cifar-10?p=an-effective-baseline-for-robustness-to)`

An Effective Baseline for Robustness to Distributional Shift

15 May 2021 · Sunil Thulasidasan, Sushil Thapa, Sayera Dhaubhadel, Gopinath Chennupati, Tanmoy Bhattacharya, Jeff Bilmes ·

Refraining from confidently predicting when faced with categories of inputs different from those seen during training is an important requirement for the safe deployment of deep learning systems. While simple to state, this has been a particularly challenging problem in deep learning, where models often end up making overconfident predictions in such situations. In this work we present a simple, but highly effective approach to deal with out-of-distribution detection that uses the principle of abstention: when encountering a sample from an unseen class, the desired behavior is to abstain from predicting. Our approach uses a network with an extra abstention class and is trained on a dataset that is augmented with an uncurated set that consists of a large number of out-of-distribution (OoD) samples that are assigned the label of the abstention class; the model is then trained to learn an effective discriminator between in and out-of-distribution samples. We compare this relatively simple approach against a wide variety of more complex methods that have been proposed both for out-of-distribution detection as well as uncertainty modeling in deep learning, and empirically demonstrate its effectiveness on a wide variety of of benchmarks and deep architectures for image recognition and text classification, often outperforming existing approaches by significant margins. Given the simplicity and effectiveness of this method, we propose that this approach be used as a new additional baseline for future work in this domain.

PDF Abstract

Code

Add Remove Mark official

Sushil-Thapa/Abstention-OoD official

Tasks

Add Remove

Out-of-Distribution Detection

Robust classification

text-classification

Datasets

CIFAR-10

CIFAR-100

SVHN

SST

IMDb Movie Reviews

SNLI

Places

Tiny ImageNet

LSUN

WMT 2016

Tiny Images 20 Newsgroups COVID-19 Twitter Chatter Dataset

Results from the Paper

Edit

Ranked #1 on Out-of-Distribution Detection on CIFAR-100 (using extra training data)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Out-of-Distribution Detection	20 Newsgroups	2-Layered GRU	AUROC	99.6	# 1	Compare
Out-of-Distribution Detection	20 Newsgroups	2-Layered GRU	FPR95	1.78	# 1	Compare
Out-of-Distribution Detection	CIFAR-10	Wide ResNet 40x2	FPR95	2.0	# 2	Compare
Out-of-Distribution Detection	CIFAR-10	Wide ResNet 40x2	AUROC	99.9	# 2	Compare
Out-of-Distribution Detection	CIFAR-100	Wide ResNet 40x2	FPR95	23.4	# 1	Compare
Out-of-Distribution Detection	CIFAR-100	Wide ResNet 40x2	AUROC	97.7	# 1	Compare
Out-of-Distribution Detection	SST	2-Layered GRU	AUROC	99.7	# 1	Compare
Out-of-Distribution Detection	SST	2-Layered GRU	FPR95	20.9	# 1	Compare
Out-of-Distribution Detection	TREC-NEWS	2-Layered GRU	AUROC	99.9	# 1	Compare
Out-of-Distribution Detection	TREC-NEWS	2-Layered GRU	FPR95	4.7	# 1	Compare

Methods

Add Remove

Average Pooling • Batch Normalization • Convolution • Dropout • Global Average Pooling • GRU • Kaiming Initialization • ReLU • Residual Connection • Wide Residual Block • WideResNet

Edit Social Preview

An Effective Baseline for Robustness to Distributional Shift

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove