RDSNet: A New Deep Architecture for Reciprocal Object Detection and Instance Segmentation

11 Dec 2019  ·  Shaoru Wang, Yongchao Gong, Junliang Xing, Lichao Huang, Chang Huang, Weiming Hu ·

Object detection and instance segmentation are two fundamental computer vision tasks. They are closely correlated but their relations have not yet been fully explored in most previous work. This paper presents RDSNet, a novel deep architecture for reciprocal object detection and instance segmentation. To reciprocate these two tasks, we design a two-stream structure to learn features on both the object level (i.e., bounding boxes) and the pixel level (i.e., instance masks) jointly. Within this structure, information from the two streams is fused alternately, namely information on the object level introduces the awareness of instance and translation variance to the pixel level, and information on the pixel level refines the localization accuracy of objects on the object level in return. Specifically, a correlation module and a cropping module are proposed to yield instance masks, as well as a mask based boundary refinement module for more accurate bounding boxes. Extensive experimental analyses and comparisons on the COCO dataset demonstrate the effectiveness and efficiency of RDSNet. The source code is available at https://github.com/wangsr126/RDSNet.

PDF Abstract

Datasets


Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Instance Segmentation COCO test-dev RDSNet (data aug) mask AP 36.4% # 94
AP50 57.9% # 34
AP75 39.0% # 30
APS 16.4% # 35
APM 39.5% # 32
APL 51.6% # 27
Object Detection COCO test-dev RDSNet (ResNet-101, RetinaNet, mask, MBRM) box mAP 40.3 # 178
AP50 60.1 # 131
AP75 43 # 138
APS 22.1 # 121
APM 43.5 # 122
APL 51.5 # 126
Hardware Burden None # 1
Operations per network pass None # 1

Methods