TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Unsupervised Anomaly Detection	AnoShift	COPOD	ROC-AUC IID	85.62	# 5
Unsupervised Anomaly Detection	AnoShift	COPOD	ROC-AUC NEAR	54.24	# 7
Unsupervised Anomaly Detection	AnoShift	COPOD	ROC-AUC FAR	50.42	# 5
Unsupervised Anomaly Detection	AnoShift	COPOD	ROC-AUC-ID (In-Distribution setup)	80.89	# 4
Unsupervised Anomaly Detection	AnoShift	AE for anomalies	ROC-AUC IID	81	# 9
Unsupervised Anomaly Detection	AnoShift	AE for anomalies	ROC-AUC NEAR	44.06	# 11
Unsupervised Anomaly Detection	AnoShift	AE for anomalies	ROC-AUC FAR	19.96	# 15
Unsupervised Anomaly Detection	AnoShift	AE for anomalies	ROC-AUC-ID (In-Distribution setup)	64.08	# 10
Unsupervised Anomaly Detection	AnoShift	LUNAR	ROC-AUC IID	85.75	# 4
Unsupervised Anomaly Detection	AnoShift	LUNAR	ROC-AUC NEAR	49.03	# 9
Unsupervised Anomaly Detection	AnoShift	LUNAR	ROC-AUC FAR	28.19	# 11
Unsupervised Anomaly Detection	AnoShift	LUNAR	ROC-AUC-ID (In-Distribution setup)	78.53	# 7
Unsupervised Anomaly Detection	AnoShift	OC-SVM	ROC-AUC IID	76.86	# 10
Unsupervised Anomaly Detection	AnoShift	OC-SVM	ROC-AUC NEAR	71.43	# 5
Unsupervised Anomaly Detection	AnoShift	OC-SVM	ROC-AUC FAR	49.57	# 6
Unsupervised Anomaly Detection	AnoShift	OC-SVM	ROC-AUC-ID (In-Distribution setup)	68.73	# 8
Unsupervised Anomaly Detection	AnoShift	Internal Contrastive Learning	ROC-AUC IID	84.86	# 6
Unsupervised Anomaly Detection	AnoShift	Internal Contrastive Learning	ROC-AUC NEAR	52.26	# 8
Unsupervised Anomaly Detection	AnoShift	Internal Contrastive Learning	ROC-AUC FAR	22.45	# 14
Unsupervised Anomaly Detection	AnoShift	Internal Contrastive Learning	ROC-AUC-ID (In-Distribution setup)	66.99	# 9
Unsupervised Anomaly Detection	AnoShift	deepSVDD	ROC-AUC IID	92.67	# 1
Unsupervised Anomaly Detection	AnoShift	deepSVDD	ROC-AUC NEAR	87.00	# 1
Unsupervised Anomaly Detection	AnoShift	deepSVDD	ROC-AUC FAR	34.53	# 10
Unsupervised Anomaly Detection	AnoShift	deepSVDD	ROC-AUC-ID (In-Distribution setup)	88.24	# 1
Unsupervised Anomaly Detection	AnoShift	SO-GAAL	ROC-AUC IID	50.48	# 11
Unsupervised Anomaly Detection	AnoShift	SO-GAAL	ROC-AUC NEAR	54.55	# 6
Unsupervised Anomaly Detection	AnoShift	SO-GAAL	ROC-AUC FAR	49.35	# 7
Unsupervised Anomaly Detection	AnoShift	SO-GAAL	ROC-AUC-ID (In-Distribution setup)	49.90	# 11
Unsupervised Anomaly Detection	AnoShift	ECOD Li et al. (2022)	ROC-AUC IID	84.76	# 7
Unsupervised Anomaly Detection	AnoShift	ECOD Li et al. (2022)	ROC-AUC NEAR	44.87	# 10
Unsupervised Anomaly Detection	AnoShift	ECOD Li et al. (2022)	ROC-AUC FAR	49.19	# 8
Unsupervised Anomaly Detection	AnoShift	ECOD Li et al. (2022)	ROC-AUC-ID (In-Distribution setup)	79.41	# 6
Unsupervised Anomaly Detection	AnoShift	IsoForest	ROC-AUC IID	86.09	# 3
Unsupervised Anomaly Detection	AnoShift	IsoForest	ROC-AUC NEAR	75.26	# 4
Unsupervised Anomaly Detection	AnoShift	IsoForest	ROC-AUC FAR	27.16	# 13
Unsupervised Anomaly Detection	AnoShift	IsoForest	ROC-AUC-ID (In-Distribution setup)	81.27	# 3
Unsupervised Anomaly Detection	AnoShift	BERT	ROC-AUC IID	84.54	# 8
Unsupervised Anomaly Detection	AnoShift	BERT	ROC-AUC NEAR	86.05	# 2
Unsupervised Anomaly Detection	AnoShift	BERT	ROC-AUC FAR	28.15	# 12
Unsupervised Anomaly Detection	AnoShift	BERT	ROC-AUC-ID (In-Distribution setup)	79.62	# 5
Unsupervised Anomaly Detection	AnoShift	LOF	ROC-AUC IID	91.5	# 2
Unsupervised Anomaly Detection	AnoShift	LOF	ROC-AUC NEAR	79.29	# 3
Unsupervised Anomaly Detection	AnoShift	LOF	ROC-AUC FAR	34.96	# 9
Unsupervised Anomaly Detection	AnoShift	LOF	ROC-AUC-ID (In-Distribution setup)	87.61	# 2

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/anoshift-a-distribution-shift-benchmark-for/unsupervised-anomaly-detection-on-anoshift)](https://paperswithcode.com/sota/unsupervised-anomaly-detection-on-anoshift?p=anoshift-a-distribution-shift-benchmark-for)`

AnoShift: A Distribution Shift Benchmark for Unsupervised Anomaly Detection

30 Jun 2022 · Marius Dragoi, Elena Burceanu, Emanuela Haller, Andrei Manolache, Florin Brad ·

Analyzing the distribution shift of data is a growing research direction in nowadays Machine Learning (ML), leading to emerging new benchmarks that focus on providing a suitable scenario for studying the generalization properties of ML models. The existing benchmarks are focused on supervised learning, and to the best of our knowledge, there is none for unsupervised learning. Therefore, we introduce an unsupervised anomaly detection benchmark with data that shifts over time, built over Kyoto-2006+, a traffic dataset for network intrusion detection. This type of data meets the premise of shifting the input distribution: it covers a large time span ($10$ years), with naturally occurring changes over time (eg users modifying their behavior patterns, and software updates). We first highlight the non-stationary nature of the data, using a basic per-feature analysis, t-SNE, and an Optimal Transport approach for measuring the overall distribution distances between years. Next, we propose AnoShift, a protocol splitting the data in IID, NEAR, and FAR testing splits. We validate the performance degradation over time with diverse models, ranging from classical approaches to deep learning. Finally, we show that by acknowledging the distribution shift problem and properly addressing it, the performance can be improved compared to the classical training which assumes independent and identically distributed data (on average, by up to $3\%$ for our approach). Dataset and code are available at https://github.com/bit-ml/AnoShift/.

PDF Abstract

Code

Add Remove Mark official

bit-ml/anoshift official

Tasks

Add Remove

Intrusion Detection

Network Intrusion Detection

Unsupervised Anomaly Detection

Datasets

Introduced in the Paper:

AnoShift

Results from the Paper

Edit

Ranked #5 on Unsupervised Anomaly Detection on AnoShift

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Unsupervised Anomaly Detection	AnoShift	COPOD	ROC-AUC IID	85.62	# 5	Compare
			ROC-AUC NEAR	54.24	# 7	Compare
			ROC-AUC FAR	50.42	# 5	Compare
			ROC-AUC-ID (In-Distribution setup)	80.89	# 4	Compare
Unsupervised Anomaly Detection	AnoShift	AE for anomalies	ROC-AUC IID	81	# 9	Compare
			ROC-AUC NEAR	44.06	# 11	Compare
			ROC-AUC FAR	19.96	# 15	Compare
			ROC-AUC-ID (In-Distribution setup)	64.08	# 10	Compare
Unsupervised Anomaly Detection	AnoShift	LUNAR	ROC-AUC IID	85.75	# 4	Compare
			ROC-AUC NEAR	49.03	# 9	Compare
			ROC-AUC FAR	28.19	# 11	Compare
			ROC-AUC-ID (In-Distribution setup)	78.53	# 7	Compare
Unsupervised Anomaly Detection	AnoShift	OC-SVM	ROC-AUC IID	76.86	# 10	Compare
			ROC-AUC NEAR	71.43	# 5	Compare
			ROC-AUC FAR	49.57	# 6	Compare
			ROC-AUC-ID (In-Distribution setup)	68.73	# 8	Compare
Unsupervised Anomaly Detection	AnoShift	Internal Contrastive Learning	ROC-AUC IID	84.86	# 6	Compare
			ROC-AUC NEAR	52.26	# 8	Compare
			ROC-AUC FAR	22.45	# 14	Compare
			ROC-AUC-ID (In-Distribution setup)	66.99	# 9	Compare
Unsupervised Anomaly Detection	AnoShift	deepSVDD	ROC-AUC IID	92.67	# 1	Compare
			ROC-AUC NEAR	87.00	# 1	Compare
			ROC-AUC FAR	34.53	# 10	Compare
			ROC-AUC-ID (In-Distribution setup)	88.24	# 1	Compare
Unsupervised Anomaly Detection	AnoShift	SO-GAAL	ROC-AUC IID	50.48	# 11	Compare
			ROC-AUC NEAR	54.55	# 6	Compare
			ROC-AUC FAR	49.35	# 7	Compare
			ROC-AUC-ID (In-Distribution setup)	49.90	# 11	Compare
Unsupervised Anomaly Detection	AnoShift	ECOD Li et al. (2022)	ROC-AUC IID	84.76	# 7	Compare
			ROC-AUC NEAR	44.87	# 10	Compare
			ROC-AUC FAR	49.19	# 8	Compare
			ROC-AUC-ID (In-Distribution setup)	79.41	# 6	Compare
Unsupervised Anomaly Detection	AnoShift	IsoForest	ROC-AUC IID	86.09	# 3	Compare
			ROC-AUC NEAR	75.26	# 4	Compare
			ROC-AUC FAR	27.16	# 13	Compare
			ROC-AUC-ID (In-Distribution setup)	81.27	# 3	Compare
Unsupervised Anomaly Detection	AnoShift	BERT	ROC-AUC IID	84.54	# 8	Compare
			ROC-AUC NEAR	86.05	# 2	Compare
			ROC-AUC FAR	28.15	# 12	Compare
			ROC-AUC-ID (In-Distribution setup)	79.62	# 5	Compare
Unsupervised Anomaly Detection	AnoShift	LOF	ROC-AUC IID	91.5	# 2	Compare
			ROC-AUC NEAR	79.29	# 3	Compare
			ROC-AUC FAR	34.96	# 9	Compare
			ROC-AUC-ID (In-Distribution setup)	87.61	# 2	Compare

Methods

Add Remove

Adam • Attention Dropout • BERT • Dense Connections • Dropout • GELU • Layer Normalization • Linear Layer • Linear Warmup With Linear Decay • Multi-Head Attention • Residual Connection • Scaled Dot-Product Attention • Softmax • Weight Decay • WordPiece

Edit Social Preview

AnoShift: A Distribution Shift Benchmark for Unsupervised Anomaly Detection

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove