TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Adversarial Robustness	ImageNet	ResNet-50 (SGD, Step)	Accuracy	76.9	# 2
Adversarial Robustness	ImageNet	ResNet-50 (AdamW, Cosine)	Accuracy	76.4	# 4
Adversarial Robustness	ImageNet	DeiT-S (AdamW, Cosine)	Accuracy	76.8	# 3
Adversarial Robustness	ImageNet	ResNet-50 (SGD, Cosine)	Accuracy	77.4	# 1
Adversarial Robustness	ImageNet-A	DeiT-S (AdamW, Cosine)	Accuracy	12.2	# 1
Adversarial Robustness	ImageNet-A	ResNet-50 (SGD, Cosine)	Accuracy	3.3	# 2
Adversarial Robustness	ImageNet-A	ResNet-50 (SGD, Step)	Accuracy	3.2	# 3
Adversarial Robustness	ImageNet-A	ResNet-50 (AdamW, Cosine)	Accuracy	3.1	# 4
Adversarial Robustness	ImageNet-C	ResNet-50 (AdamW, Cosine)	mean Corruption Error (mCE)	59.3	# 4
Adversarial Robustness	ImageNet-C	ResNet-50 (SGD, Step)	mean Corruption Error (mCE)	57.9	# 3
Adversarial Robustness	ImageNet-C	DeiT-S (AdamW, Cosine)	mean Corruption Error (mCE)	48.0	# 1
Adversarial Robustness	ImageNet-C	ResNet-50 (SGD, Cosine)	mean Corruption Error (mCE)	56.9	# 2
Adversarial Robustness	Stylized ImageNet	ResNet-50 (SGD, Step)	Accuracy	8.3	# 3
Adversarial Robustness	Stylized ImageNet	ResNet-50 (AdamW, Cosine)	Accuracy	8.1	# 4
Adversarial Robustness	Stylized ImageNet	ResNet-50 (SGD, Cosine)	Accuracy	8.4	# 2
Adversarial Robustness	Stylized ImageNet	DeiT-S (AdamW, Cosine)	Accuracy	13.0	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/are-transformers-more-robust-than-cnns/adversarial-robustness-on-imagenet)](https://paperswithcode.com/sota/adversarial-robustness-on-imagenet?p=are-transformers-more-robust-than-cnns)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/are-transformers-more-robust-than-cnns/adversarial-robustness-on-imagenet-a)](https://paperswithcode.com/sota/adversarial-robustness-on-imagenet-a?p=are-transformers-more-robust-than-cnns)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/are-transformers-more-robust-than-cnns/adversarial-robustness-on-imagenet-c)](https://paperswithcode.com/sota/adversarial-robustness-on-imagenet-c?p=are-transformers-more-robust-than-cnns)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/are-transformers-more-robust-than-cnns/adversarial-robustness-on-stylized-imagenet)](https://paperswithcode.com/sota/adversarial-robustness-on-stylized-imagenet?p=are-transformers-more-robust-than-cnns)`

Are Transformers More Robust Than CNNs?

NeurIPS 2021 · Yutong Bai, Jieru Mei, Alan Yuille, Cihang Xie ·

Transformer emerges as a powerful tool for visual recognition. In addition to demonstrating competitive performance on a broad range of visual benchmarks, recent works also argue that Transformers are much more robust than Convolutions Neural Networks (CNNs). Nonetheless, surprisingly, we find these conclusions are drawn from unfair experimental settings, where Transformers and CNNs are compared at different scales and are applied with distinct training frameworks. In this paper, we aim to provide the first fair & in-depth comparisons between Transformers and CNNs, focusing on robustness evaluations. With our unified training setup, we first challenge the previous belief that Transformers outshine CNNs when measuring adversarial robustness. More surprisingly, we find CNNs can easily be as robust as Transformers on defending against adversarial attacks, if they properly adopt Transformers' training recipes. While regarding generalization on out-of-distribution samples, we show pre-training on (external) large-scale datasets is not a fundamental request for enabling Transformers to achieve better performance than CNNs. Moreover, our ablations suggest such stronger generalization is largely benefited by the Transformer's self-attention-like architectures per se, rather than by other training setups. We hope this work can help the community better understand and benchmark the robustness of Transformers and CNNs. The code and models are publicly available at https://github.com/ytongbai/ViTs-vs-CNNs.

PDF Abstract NeurIPS 2021 PDF NeurIPS 2021 Abstract

Code

Add Remove Mark official

ytongbai/ViTs-vs-CNNs official

174

Tasks

Add Remove

Adversarial Robustness

Datasets

ImageNet

ImageNet-C

ImageNet-A

Stylized ImageNet

Results from the Paper

Edit

Ranked #1 on Adversarial Robustness on Stylized ImageNet

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Adversarial Robustness	ImageNet	ResNet-50 (SGD, Step)	Accuracy	76.9	# 2	Compare
Adversarial Robustness	ImageNet	ResNet-50 (AdamW, Cosine)	Accuracy	76.4	# 4	Compare
Adversarial Robustness	ImageNet	DeiT-S (AdamW, Cosine)	Accuracy	76.8	# 3	Compare
Adversarial Robustness	ImageNet	ResNet-50 (SGD, Cosine)	Accuracy	77.4	# 1	Compare
Adversarial Robustness	ImageNet-A	DeiT-S (AdamW, Cosine)	Accuracy	12.2	# 1	Compare
Adversarial Robustness	ImageNet-A	ResNet-50 (SGD, Cosine)	Accuracy	3.3	# 2	Compare
Adversarial Robustness	ImageNet-A	ResNet-50 (SGD, Step)	Accuracy	3.2	# 3	Compare
Adversarial Robustness	ImageNet-A	ResNet-50 (AdamW, Cosine)	Accuracy	3.1	# 4	Compare
Adversarial Robustness	ImageNet-C	ResNet-50 (AdamW, Cosine)	mean Corruption Error (mCE)	59.3	# 4	Compare
Adversarial Robustness	ImageNet-C	ResNet-50 (SGD, Step)	mean Corruption Error (mCE)	57.9	# 3	Compare
Adversarial Robustness	ImageNet-C	DeiT-S (AdamW, Cosine)	mean Corruption Error (mCE)	48.0	# 1	Compare
Adversarial Robustness	ImageNet-C	ResNet-50 (SGD, Cosine)	mean Corruption Error (mCE)	56.9	# 2	Compare
Adversarial Robustness	Stylized ImageNet	ResNet-50 (SGD, Step)	Accuracy	8.3	# 3	Compare
Adversarial Robustness	Stylized ImageNet	ResNet-50 (AdamW, Cosine)	Accuracy	8.1	# 4	Compare
Adversarial Robustness	Stylized ImageNet	ResNet-50 (SGD, Cosine)	Accuracy	8.4	# 2	Compare
Adversarial Robustness	Stylized ImageNet	DeiT-S (AdamW, Cosine)	Accuracy	13.0	# 1	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Are Transformers More Robust Than CNNs?

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove