Adversarial Defense

179 papers with code • 10 benchmarks • 5 datasets

Competitions with currently unpublished results:

TrojAI

Benchmarks

Add a Result

These leaderboards are used to track progress in Adversarial Defense

Dataset	Best Model	Compare
CIFAR-10	WRN-28-10	See all
ImageNet (non-targeted PGD, max perturbation=4)	LLR-ResNet-152	See all
ImageNet	ResNet101	See all
ImageNet (targeted PGD, max perturbation=16)	ResNeXt-101 DenoiseAll	See all
CIFAR-100	wideresnet-34-20	See all
MNIST	Defense GAN	See all
CAAD 2018	Feature Denoising	See all
TrojAI Round 0	Cassandra	See all
TrojAI Round 1	Cassandra	See all
miniImageNet	Auto Encoder-Block Switching defense with GradCAM	See all

Libraries

Use these libraries to find Adversarial Defense models and implementations

tensorflow/cleverhans

3 papers

6,079

openai/cleverhans

3 papers

6,079

cleverhans-lab/cleverhans

3 papers

6,079

locuslab/fast_adversarial

2 papers

408

See all 6 libraries.

Datasets

Subtasks

Provable Adversarial Defense

Most implemented papers

Most implemented Social Latest No code

Towards Deep Learning Models Resistant to Adversarial Attacks

MadryLab/mnist_challenge • • ICLR 2018

Its principled nature also enables us to identify methods for both training and attacking neural networks that are reliable and, in a certain sense, universal.

Paper
Code

Technical Report on the CleverHans v2.1.0 Adversarial Examples Library

tensorflow/cleverhans • • 3 Oct 2016

An adversarial example library for constructing attacks, building defenses, and benchmarking both

Paper
Code

Benchmarking Neural Network Robustness to Common Corruptions and Perturbations

hendrycks/robustness • • ICLR 2019

Then we propose a new dataset called ImageNet-P which enables researchers to benchmark a classifier's robustness to common perturbations.

Paper
Code

The Limitations of Deep Learning in Adversarial Settings

cleverhans-lab/cleverhans • • 24 Nov 2015

In this work, we formalize the space of adversaries against deep neural networks (DNNs) and introduce a novel class of algorithms to craft adversarial samples based on a precise understanding of the mapping between inputs and outputs of DNNs.

Paper
Code

Certified Adversarial Robustness via Randomized Smoothing

locuslab/smoothing • • 8 Feb 2019

We show how to turn any classifier that classifies well under Gaussian noise into a new classifier that is certifiably robust to adversarial perturbations under the $\ell_2$ norm.

Paper
Code

Theoretically Principled Trade-off between Robustness and Accuracy

yaodongyu/TRADES • • 24 Jan 2019

We identify a trade-off between robustness and accuracy that serves as a guiding principle in the design of defenses against adversarial examples.

Paper
Code

Adversarial Training for Free!

mahyarnajibi/FreeAdversarialTraining • • NeurIPS 2019

Adversarial training, in which a network is trained on adversarial examples, is one of the few defenses against adversarial attacks that withstands strong attacks.

Paper
Code

ZOO: Zeroth Order Optimization based Black-box Attacks to Deep Neural Networks without Training Substitute Models

huanzhang12/ZOO-Attack • • 14 Aug 2017

However, different from leveraging attack transferability from substitute models, we propose zeroth order optimization (ZOO) based attacks to directly estimate the gradients of the targeted DNN for generating adversarial examples.

Paper
Code

Defense-GAN: Protecting Classifiers Against Adversarial Attacks Using Generative Models

kabkabm/defensegan • • ICLR 2018

Defense-GAN is trained to model the distribution of unperturbed images.

Paper
Code

ResNets Ensemble via the Feynman-Kac Formalism to Improve Natural and Robust Accuracies

BaoWangMath/EnResNet • • NeurIPS 2019

However, both natural and robust accuracies, in classifying clean and adversarial images, respectively, of the trained robust models are far from satisfactory.

Paper
Code

Adversarial Defense

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Most implemented papers

Content

Benchmarks

Add a Result