Approaches Toward Physical and General Video Anomaly Detection

14 Dec 2021  ·  Laura Kart, Niv Cohen ·

In recent years, many works have addressed the problem of finding never-seen-before anomalies in videos. Yet, most work has been focused on detecting anomalous frames in surveillance videos taken from security cameras. Meanwhile, the task of anomaly detection (AD) in videos exhibiting anomalous mechanical behavior, has been mostly overlooked. Anomaly detection in such videos is both of academic and practical interest, as they may enable automatic detection of malfunctions in many manufacturing, maintenance, and real-life settings. To assess the potential of the different approaches to detect such anomalies, we evaluate two simple baseline approaches: (i) Temporal-pooled image AD techniques. (ii) Density estimation of videos represented with features pretrained for video-classification. Development of such methods calls for new benchmarks to allow evaluation of different possible approaches. We introduce the Physical Anomalous Trajectory or Motion (PHANTOM) dataset, which contains six different video classes. Each class consists of normal and anomalous videos. The classes differ in the presented phenomena, the normal class variability, and the kind of anomalies in the videos. We also suggest an even harder benchmark where anomalous activities should be spotted on highly variable scenes.

PDF Abstract

Datasets


Introduced in the Paper:

PHANTOM

Used in the Paper:

Something-Something V2

Results from the Paper


 Ranked #1 on Physical Video Anomaly Detection on PHANTOM (using extra training data)

     Get a GitHub badge
Task Dataset Model Metric Name Metric Value Global Rank Uses Extra
Training Data
Result Benchmark
Physical Video Anomaly Detection PHANTOM Pooled Image Level kNN Avg. ROC-AUC 0.78 # 1
Architecture ViT # 1
Physical Video Anomaly Detection PHANTOM Video Level features kNN Avg. ROC-AUC 0.76 # 2
Architecture TimeSformer # 1
General Action Video Anomaly Detection Something-Something V2 Video Level features kNN Avg. ROC-AUC 0.52 # 2
Architecture TimeSformer # 1
General Action Video Anomaly Detection Something-Something V2 Pooled Image Level kNN Avg. ROC-AUC 0.58 # 1
Architecture ViT # 1

Methods


No methods listed for this paper. Add relevant methods here