TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK	REMOVE
Out-of-Distribution Detection	CIFAR-10 vs CIFAR-100	WRN 40-2 (MSP Baseline)	AUPR	55.8	# 8
Out-of-Distribution Detection	CIFAR-10 vs CIFAR-100	WRN 40-2 (MSP Baseline)	AUROC	87.9	# 12

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/a-baseline-for-detecting-misclassified-and/out-of-distribution-detection-on-cifar-10-vs)](https://paperswithcode.com/sota/out-of-distribution-detection-on-cifar-10-vs?p=a-baseline-for-detecting-misclassified-and)`

A Baseline for Detecting Misclassified and Out-of-Distribution Examples in Neural Networks

7 Oct 2016 · Dan Hendrycks, Kevin Gimpel ·

We consider the two related problems of detecting if an example is misclassified or out-of-distribution. We present a simple baseline that utilizes probabilities from softmax distributions. Correctly classified examples tend to have greater maximum softmax probabilities than erroneously classified and out-of-distribution examples, allowing for their detection. We assess performance by defining several tasks in computer vision, natural language processing, and automatic speech recognition, showing the effectiveness of this baseline across all. We then show the baseline can sometimes be surpassed, demonstrating the room for future research on these underexplored detection tasks.

PDF Abstract

Code

Add Remove Mark official

hendrycks/error-detection official

218

thuiar/textoir

178

JakobCode/UncertaintyInNeuralNetwor…

thuiar/textoir-demo

kobybibas/pnml_ood_detection

See all 14 implementations

Tasks

Add Remove

Anomaly Detection

Automatic Speech Recognition

Automatic Speech Recognition (ASR)

Out-of-Distribution Detection

Speech Recognition

Datasets

CIFAR-10

CIFAR-100

IMDb Movie Reviews THCHS-30

Results from the Paper

Edit

Ranked #12 on Out-of-Distribution Detection on CIFAR-10 vs CIFAR-100

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Result	Benchmark
Out-of-Distribution Detection	CIFAR-10 vs CIFAR-100	WRN 40-2 (MSP Baseline)	AUPR	55.8	# 8		Compare
Out-of-Distribution Detection	CIFAR-10 vs CIFAR-100	WRN 40-2 (MSP Baseline)	AUROC	87.9	# 12		Compare

Methods

Add Remove

1x1 Convolution • Average Pooling • Batch Normalization • Bottleneck Residual Block • Convolution • Global Average Pooling • Kaiming Initialization • Max Pooling • ReLU • Residual Block • Residual Connection • ResNet • Softmax

Edit Social Preview

A Baseline for Detecting Misclassified and Out-of-Distribution Examples in Neural Networks

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove