Pairwise Confusion for Fine-Grained Visual Classification

Fine-Grained Visual Classification (FGVC) datasets contain small sample sizes, along with significant intra-class variation and inter-class similarity. While prior work has addressed intra-class variation using localization and segmentation techniques, inter-class similarity may also affect feature learning and reduce classification performance. In this work, we address this problem using a novel optimization procedure for the end-to-end neural network training on FGVC tasks. Our procedure, called Pairwise Confusion (PC) reduces overfitting by intentionally {introducing confusion} in the activations. With PC regularization, we obtain state-of-the-art performance on six of the most widely-used FGVC datasets and demonstrate improved localization ability. {PC} is easy to implement, does not need excessive hyperparameter tuning during training, and does not add significant overhead during test time.

PDF Abstract ECCV 2018 PDF ECCV 2018 Abstract
Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Fine-Grained Image Classification CUB-200-2011 PC-DenseNet-161 Accuracy 86.87% # 58
Fine-Grained Image Classification CUB-200-2011 PC Accuracy 86.9 # 21
Fine-Grained Image Classification NABirds PC-DenseNet-161 Accuracy 82.79% # 20
Fine-Grained Image Classification Oxford 102 Flowers PC Bilinear CNN Accuracy 93.65% # 23
Fine-Grained Image Classification Stanford Cars PC-DenseNet-161 Accuracy 92.86% # 61
Fine-Grained Image Classification Stanford Dogs PC-DenseNet-161 Accuracy 83.75% # 16

Methods


No methods listed for this paper. Add relevant methods here