Hierarchical Shot Detector

Single shot detector simultaneously predicts object categories and regression offsets of the default boxes. Despite of high efficiency, this structure has some inappropriate designs: (1) The classification result of the default box is improperly assigned to that of the regressed box during inference, (2) Only regression once is not good enough for accurate object detection. To solve the first problem, a novel reg-offset-cls (ROC) module is proposed. It contains three hierarchical steps: box regression, the feature sampling location predication, and the regressed box classification with the features of offset locations. To further solve the second problem, a hierarchical shot detector (HSD) is proposed, which stacks two ROC modules and one feature enhanced module. The second ROC treats the regressed boxes and the feature sampling locations of features in the first ROC as the inputs. Meanwhile, the feature enhanced module injected between two ROCs aims to extract the local and non-local context. Experiments on the MS COCO and PASCAL VOC datasets demonstrate the superiority of proposed HSD. Without the bells or whistles, HSD outperforms all one-stage methods at real-time speed.

PDF Abstract

Results from the Paper


Task Dataset Model Metric Name Metric Value Global Rank Benchmark
Object Detection COCO test-dev HSD (Rest101, 768x768, single-scale test) box mAP 42.3 # 171
AP50 61.2 # 120
AP75 46.9 # 102
APS 22.8 # 114
APM 47.3 # 83
APL 55.9 # 94
Hardware Burden None # 1
Operations per network pass None # 1
Object Detection PASCAL VOC 2007 HSD (VGG16, 320x320, single-scale test) MAP 81.7% # 7
Object Detection PASCAL VOC 2007 HSD (VGG16, 512x512, single-scale test) MAP 83.0% # 4

Methods


No methods listed for this paper. Add relevant methods here