TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Out-of-Distribution Detection	STL-10	Baseline (Gaussian)	Percentage correct	73.28	# 5
Out-of-Distribution Detection	STL-10	Dropout(Gaussian)	Percentage correct	70.57	# 6
Out-of-Distribution Detection	STL-10	Dropout(Imagenet)	Percentage correct	78.93	# 4
Out-of-Distribution Detection	STL-10	Mixup (Gaussian)	Percentage correct	95.93	# 1
Out-of-Distribution Detection	STL-10	Mixup (Imagenet)	Percentage correct	83.28	# 2
Out-of-Distribution Detection	STL-10	Baseline (Imagenet)	Percentage correct	80.57	# 3

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/on-mixup-training-improved-calibration-and/out-of-distribution-detection-on-stl-10)](https://paperswithcode.com/sota/out-of-distribution-detection-on-stl-10?p=on-mixup-training-improved-calibration-and)`

On Mixup Training: Improved Calibration and Predictive Uncertainty for Deep Neural Networks

NeurIPS 2019 · Sunil Thulasidasan, Gopinath Chennupati, Jeff Bilmes, Tanmoy Bhattacharya, Sarah Michalak ·

Mixup~\cite{zhang2017mixup} is a recently proposed method for training deep neural networks where additional samples are generated during training by convexly combining random pairs of images and their associated labels. While simple to implement, it has been shown to be a surprisingly effective method of data augmentation for image classification: DNNs trained with mixup show noticeable gains in classification performance on a number of image classification benchmarks. In this work, we discuss a hitherto untouched aspect of mixup training -- the calibration and predictive uncertainty of models trained with mixup. We find that DNNs trained with mixup are significantly better calibrated -- i.e., the predicted softmax scores are much better indicators of the actual likelihood of a correct prediction -- than DNNs trained in the regular fashion. We conduct experiments on a number of image classification architectures and datasets -- including large-scale datasets like ImageNet -- and find this to be the case. Additionally, we find that merely mixing features does not result in the same calibration benefit and that the label smoothing in mixup training plays a significant role in improving calibration. Finally, we also observe that mixup-trained DNNs are less prone to over-confident predictions on out-of-distribution and random-noise data. We conclude that the typical overconfidence seen in neural networks, even on in-distribution data is likely a consequence of training with hard labels, suggesting that mixup be employed for classification tasks where predictive uncertainty is a significant concern.

PDF Abstract NeurIPS 2019 PDF NeurIPS 2019 Abstract

Code

Add Remove Mark official

paganpasta/onmixup

MacroMayhem/OnMixup

Tasks

Add Remove

Classification

Data Augmentation

General Classification

Image Classification

Out-of-Distribution Detection

Datasets

CIFAR-10

ImageNet

CIFAR-100

Fashion-MNIST

STL-10

Results from the Paper

Edit

Ranked #1 on Out-of-Distribution Detection on STL-10

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Out-of-Distribution Detection	STL-10	Baseline (Gaussian)	Percentage correct	73.28	# 5	Compare
Out-of-Distribution Detection	STL-10	Dropout(Gaussian)	Percentage correct	70.57	# 6	Compare
Out-of-Distribution Detection	STL-10	Dropout(Imagenet)	Percentage correct	78.93	# 4	Compare
Out-of-Distribution Detection	STL-10	Mixup (Gaussian)	Percentage correct	95.93	# 1	Compare
Out-of-Distribution Detection	STL-10	Mixup (Imagenet)	Percentage correct	83.28	# 2	Compare
Out-of-Distribution Detection	STL-10	Baseline (Imagenet)	Percentage correct	80.57	# 3	Compare

Methods

Add Remove

Label Smoothing • Mixup • Softmax

Edit Social Preview

On Mixup Training: Improved Calibration and Predictive Uncertainty for Deep Neural Networks

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove