Fine-Grained Visual Classification using Self Assessment Classifier

21 May 2022  ·  Tuong Do, Huy Tran, Erman Tjiputra, Quang D. Tran, Anh Nguyen ·

Extracting discriminative features plays a crucial role in the fine-grained visual classification task. Most of the existing methods focus on developing attention or augmentation mechanisms to achieve this goal. However, addressing the ambiguity in the top-k prediction classes is not fully investigated. In this paper, we introduce a Self Assessment Classifier, which simultaneously leverages the representation of the image and top-k prediction classes to reassess the classification results. Our method is inspired by continual learning with coarse-grained and fine-grained classifiers to increase the discrimination of features in the backbone and produce attention maps of informative areas on the image. In practice, our method works as an auxiliary branch and can be easily integrated into different architectures. We show that by effectively addressing the ambiguity in the top-k prediction classes, our method achieves new state-of-the-art results on CUB200-2011, Stanford Dog, and FGVC Aircraft datasets. Furthermore, our method also consistently improves the accuracy of different existing fine-grained classifiers with a unified setup.

PDF Abstract
Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Fine-Grained Image Classification CUB-200-2011 ViT-SAC Accuracy 91.8% # 8
Fine-Grained Image Classification FGVC Aircraft ViT-SAC Accuracy 93.1% # 26
Fine-Grained Image Classification Stanford Dogs WS_DAN-SAC Accuracy 93.1% # 4

Methods


No methods listed for this paper. Add relevant methods here