Classification-Specific Parts for Improving Fine-Grained Visual Categorization

16 Sep 2019  ·  Dimitri Korsch, Paul Bodesheim, Joachim Denzler ·

Fine-grained visual categorization is a classification task for distinguishing categories with high intra-class and small inter-class variance. While global approaches aim at using the whole image for performing the classification, part-based solutions gather additional local information in terms of attentions or parts. We propose a novel classification-specific part estimation that uses an initial prediction as well as back-propagation of feature importance via gradient computations in order to estimate relevant image regions. The subsequently detected parts are then not only selected by a-posteriori classification knowledge, but also have an intrinsic spatial extent that is determined automatically. This is in contrast to most part-based approaches and even to available ground-truth part annotations, which only provide point coordinates and no additional scale information. We show in our experiments on various widely-used fine-grained datasets the effectiveness of the mentioned part selection method in conjunction with the extracted part features.

PDF Abstract
Task Dataset Model Metric Name Metric Value Global Rank Uses Extra
Training Data
Result Benchmark
Fine-Grained Image Classification CUB-200-2011 CS-Parts Accuracy 89.5% # 32
Image Classification Flowers-102 CS-Parts Accuracy 96.9% # 38
Fine-Grained Image Classification NABirds CS-Part Accuracy 88.5% # 14
Fine-Grained Image Classification NABirds CS-Parts Accuracy 88.5% # 14
Fine-Grained Image Classification Stanford Cars CS-Part Accuracy 92.5% # 65
Fine-Grained Image Classification Stanford Cars CS-Parts Accuracy 92.5% # 65

Methods


No methods listed for this paper. Add relevant methods here