…There are two common metrics: Detection AUROC and Segmentation (or pixelwise) AUROC Detection (or, classification) methods output single float (anomaly score) per input test image. Segmentation methods output anomaly probability for each pixel. "To assess segmentation performance, we evaluate the relative per-region overlap of the segmentation with the ground truth. We define the true positive rate as the percentage of pixels that were correctly classified as anomalous" [1] Later segmentation metric was improved to balance regions with small and large area, see PRO-AUC
287 PAPERS • 4 BENCHMARKS