TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Model Poisoning	CIFAR-10	ADA	Attacking Task Accuracy	65.4	# 1
Model Poisoning	Fashion-MNIST	ADA	Attacking Task Accuracy	90.2	# 1
Model Poisoning	MNIST	ADA	Attacking Task Accuracy	96.8	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/semi-targeted-model-poisoning-attack-on/model-poisoning-on-cifar-10)](https://paperswithcode.com/sota/model-poisoning-on-cifar-10?p=semi-targeted-model-poisoning-attack-on)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/semi-targeted-model-poisoning-attack-on/model-poisoning-on-fashion-mnist)](https://paperswithcode.com/sota/model-poisoning-on-fashion-mnist?p=semi-targeted-model-poisoning-attack-on)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/semi-targeted-model-poisoning-attack-on/model-poisoning-on-mnist)](https://paperswithcode.com/sota/model-poisoning-on-mnist?p=semi-targeted-model-poisoning-attack-on)`

Semi-Targeted Model Poisoning Attack on Federated Learning via Backward Error Analysis

22 Mar 2022 · Yuwei Sun, Hideya Ochiai, Jun Sakuma ·

Model poisoning attacks on federated learning (FL) intrude in the entire system via compromising an edge model, resulting in malfunctioning of machine learning models. Such compromised models are tampered with to perform adversary-desired behaviors. In particular, we considered a semi-targeted situation where the source class is predetermined however the target class is not. The goal is to cause the global classifier to misclassify data of the source class. Though approaches such as label flipping have been adopted to inject poisoned parameters into FL, it has been shown that their performances are usually class-sensitive varying with different target classes applied. Typically, an attack can become less effective when shifting to a different target class. To overcome this challenge, we propose the Attacking Distance-aware Attack (ADA) to enhance a poisoning attack by finding the optimized target class in the feature space. Moreover, we studied a more challenging situation where an adversary had limited prior knowledge about a client's data. To tackle this problem, ADA deduces pair-wise distances between different classes in the latent feature space from shared model parameters based on the backward error analysis. We performed extensive empirical evaluations on ADA by varying the factor of attacking frequency in three different image classification tasks. As a result, ADA succeeded in increasing the attack performance by 1.8 times in the most challenging case with an attacking frequency of 0.01.

PDF Abstract

Code

Add Remove Mark official

yuweisunn/ADA official

Tasks

Add Remove

Backdoor Attack

Federated Learning

Image Classification

Model Poisoning

Neural Network Security

Datasets

CIFAR-10

MNIST

Fashion-MNIST

Results from the Paper

Edit

Ranked #1 on Model Poisoning on Fashion-MNIST

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Model Poisoning	CIFAR-10	ADA	Attacking Task Accuracy	65.4	# 1	Compare
Model Poisoning	Fashion-MNIST	ADA	Attacking Task Accuracy	90.2	# 1	Compare
Model Poisoning	MNIST	ADA	Attacking Task Accuracy	96.8	# 1	Compare

Methods

Add Remove

ADA

Edit Social Preview

Semi-Targeted Model Poisoning Attack on Federated Learning via Backward Error Analysis

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove