Outlier Detection

195 papers with code • 11 benchmarks • 11 datasets

Outlier Detection is a task of identifying a subset of a given data set which are considered anomalous in that they are unusual from other instances. It is one of the core data mining tasks and is central to many applications. In the security field, it can be used to identify potentially threatening users, in the manufacturing field it can be used to identify parts that are likely to fail.

Source: Coverage-based Outlier Explanation

Benchmarks

Add a Result

These leaderboards are used to track progress in Outlier Detection

Dataset	Best Model	Compare
ECG5000	VRAE+SVM	See all
Breast cancer Wisconsin_class 2	ASVDD	See all
Breast cancer Wisconsin_class 4	ASVDD	See all
Balance scale_class 1		See all
Glass identification	ASVDD	See all
Ionosphere_class b	ASVDD	See all
Fashion-MNIST	PAE	See all
Hepatitis	MIX	See all
Heart-C	MIX	See all
Internet Ad	MIX	See all
SKAB	LSTMCaps	See all

Show all 11 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find Outlier Detection models and implementations

yzhao062/pyod

5 papers

7,985

pygod-team/pygod

2 papers

1,216

elki-project/elki

2 papers

771

xuhongzuo/DeepOD

2 papers

299

See all 6 libraries.

Datasets

Subtasks

Most implemented papers

Most implemented Social Latest No code

TOD: GPU-accelerated Outlier Detection via Tensor Operations

yzhao062/pytod • • 26 Oct 2021

Outlier detection (OD) is a key learning task for finding rare and deviant data samples, with many time-critical applications such as fraud detection and intrusion detection.

Paper
Code

ECOD: Unsupervised Outlier Detection Using Empirical Cumulative Distribution Functions

yzhao062/pyod • • 2 Jan 2022

To address these issues, we present a simple yet effective algorithm called ECOD (Empirical-Cumulative-distribution-based Outlier Detection), which is inspired by the fact that outliers are often the "rare events" that appear in the tails of a distribution.

Paper
Code

ODBO: Bayesian Optimization with Search Space Prescreening for Directed Protein Evolution

tencent-quantum-lab/odbo • • 19 May 2022

Directed evolution is a versatile technique in protein engineering that mimics the process of natural selection by iteratively alternating between mutagenesis and screening in order to search for sequences that optimize a given property of interest, such as catalytic activity and binding affinity to a specified target.

Paper
Code

DeepCAVE: An Interactive Analysis Tool for Automated Machine Learning

automl/deepcave • • 7 Jun 2022

Automated Machine Learning (AutoML) is used more than ever before to support users in determining efficient hyperparameters, neural architectures, or even full machine learning pipelines.

Paper
Code

BOND: Benchmarking Unsupervised Outlier Node Detection on Static Attributed Graphs

pygod-team/pygod • • 21 Jun 2022

To bridge this gap, we present--to the best of our knowledge--the first comprehensive benchmark for unsupervised outlier node detection on static attributed graphs called BOND, with the following highlights.

Paper
Code

Computationally Assisted Quality Control for Public Health Data Streams

cmu-delphi/covidcast-indicators • 29 Jun 2023

However, existing outlier detection frameworks perform poorly on this task because they do not account for the data volume or for the statistical properties of public health streams.

Paper
Code

A Fast Greedy Algorithm for Outlier Mining

ducthanhtran/greedy_outlier_entropy • Advances in Knowledge Discovery and Data Mining 2006

The task of outlier detection is to find small groups of data objects that are exceptional when compared with rest large amount of data.

Paper
Code

Condition Number Analysis of Kernel-based Density Ratio Estimation

JohnYKiyo/density_ratio_estimation • • 15 Dec 2009

We show that the kernel least-squares method has a smaller condition number than a version of kernel mean matching and other M-estimators, implying that the kernel least-squares method has preferable numerical properties.

Paper
Code

Anomaly Detection via oversampling Principal Component Analysis

SohanLalYadav2304/Anomaly-detection-via-oversampling-principal-component-analysis • 11th IEEE International Conference on Intelligent Data Acquisition and Advanced Computing Systems 2013

Based on this idea, an over-sampling principal component analysis outlier detection method is proposed for emphasizing the influence of an abnormal instance (or an outlier).

Paper
Code

A Framework for Clustering Uncertain Data

elki-project/elki • VLDB 2015

The challenges associated with handling uncertain data, in particular with querying and mining, are finding increasing attention in the research community.

Paper
Code

Outlier Detection

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Most implemented papers

Content

Benchmarks

Add a Result