Knowledge Base

Adversarial Attack Detection

14 papers with code • 0 benchmarks • 0 datasets

The detection of adversarial attacks.

Benchmarks

Add a Result

These leaderboards are used to track progress in Adversarial Attack Detection

No evaluation results yet. Help compare methods by submitting evaluation metrics.

Latest papers

Most implemented Social Latest No code

OUTFOX: LLM-Generated Essay Detection Through In-Context Learning with Adversarially Generated Examples

ryuryukke/OUTFOX • • 21 Jul 2023

Experiments in the domain of student essays show that the proposed detector improves the detection performance on the attacker-generated texts by up to +41. 3 points F1-score.

21 Jul 2023

Paper
Code

Graph-based methods coupled with specific distributional distances for adversarial attack detection

dwightnw/graph_based_methods_for_adversarial_attack • • 31 May 2023

We introduce a novel approach of detection and interpretation of adversarial attacks from a graph perspective.

31 May 2023

Paper
Code

Unfolding Local Growth Rate Estimates for (Almost) Perfect Adversarial Detection

adverml/multilid • • 13 Dec 2022

Convolutional neural networks (CNN) define the state-of-the-art solution on many perceptual tasks.

13 Dec 2022

Paper
Code

Detecting Adversarial Examples in Batches -- a geometrical approach

danushv07/n2gem • • 17 Jun 2022

Many deep learning methods have successfully solved complex tasks in computer vision and speech recognition applications.

17 Jun 2022

Paper
Code

Residue-Based Natural Language Adversarial Attack Detection

rainavyas/naacl-2022-residue-detector • NAACL 2022

Many popular image adversarial detection approaches are able to identify adversarial examples from embedding feature spaces, whilst in the NLP domain existing state of the art detection approaches solely focus on input text features, without consideration of model embedding spaces.

17 Apr 2022

Paper
Code

Segment and Complete: Defending Object Detectors against Adversarial Patch Attacks with Robust Patch Detection

joellliu/segmentandcomplete • • CVPR 2022

In addition, we design a robust shape completion algorithm, which is guaranteed to remove the entire patch from the images if the outputs of the patch segmenter are within a certain Hamming distance of the ground-truth patch masks.

08 Dec 2021

Paper
Code

Is RobustBench/AutoAttack a suitable Benchmark for Adversarial Robustness?

adverml/spectraldef_framework • • AAAI Workshop AdvML 2022

In its most commonly reported sub-task, RobustBench evaluates and ranks the adversarial robustness of trained neural networks on CIFAR10 under AutoAttack (Croce and Hein 2020b) with l-inf perturbations limited to eps = 8/255.

02 Dec 2021

Paper
Code

Two Souls in an Adversarial Image: Towards Universal Adversarial Example Detection using Multi-view Inconsistency

sohaib730/argos-adversarial_detection • • 25 Sep 2021

To this end, Argos first amplifies the discrepancies between the visual content of an image and its misclassified label induced by the attack using a set of regeneration mechanisms and then identifies an image as adversarial if the reproduced views deviate to a preset degree.

25 Sep 2021

Paper
Code

Maximum Mean Discrepancy Test is Aware of Adversarial Attacks

Sjtubrian/SAMMD • • 22 Oct 2020

However, it has been shown that the MMD test is unaware of adversarial attacks -- the MMD test failed to detect the discrepancy between natural and adversarial data.

22 Oct 2020

Paper
Code

Towards Feature Space Adversarial Attack

qiulingxu/FeatureSpaceAttack • • 26 Apr 2020

We propose a new adversarial attack to Deep Neural Networks for image classification.

26 Apr 2020

Paper
Code

Adversarial Attack Detection

Benchmarks Add a Result

Latest papers

Content

Benchmarks

Add a Result