VT-ADL: A Vision Transformer Network for Image Anomaly Detection and Localization

20 Apr 2021  ·  Pankaj Mishra, Riccardo Verk, Daniele Fornasier, Claudio Piciarelli, Gian Luca Foresti ·

We present a transformer-based image anomaly detection and localization network. Our proposed model is a combination of a reconstruction-based approach and patch embedding. The use of transformer networks helps to preserve the spatial information of the embedded patches, which are later processed by a Gaussian mixture density network to localize the anomalous areas. In addition, we also publish BTAD, a real-world industrial anomaly dataset. Our results are compared with other state-of-the-art algorithms using publicly available datasets like MNIST and MVTec.

PDF Abstract

Datasets


Introduced in the Paper:

BTAD

Results from the Paper


Ranked #9 on Anomaly Detection on BTAD (using extra training data)

     Get a GitHub badge
Task Dataset Model Metric Name Metric Value Global Rank Uses Extra
Training Data
Result Benchmark
Anomaly Detection BTAD VT-ADL Segmentation AUROC 81.8 # 9

Methods


No methods listed for this paper. Add relevant methods here