MUM : Mix Image Tiles and UnMix Feature Tiles for Semi-Supervised Object Detection

22 Nov 2021  ·  Jongmok Kim, Jooyoung Jang, Seunghyeon Seo, Jisoo Jeong, Jongkeun Na, Nojun Kwak ·

Many recent semi-supervised learning (SSL) studies build teacher-student architecture and train the student network by the generated supervisory signal from the teacher. Data augmentation strategy plays a significant role in the SSL framework since it is hard to create a weak-strong augmented input pair without losing label information. Especially when extending SSL to semi-supervised object detection (SSOD), many strong augmentation methodologies related to image geometry and interpolation-regularization are hard to utilize since they possibly hurt the location information of the bounding box in the object detection task. To address this, we introduce a simple yet effective data augmentation method, Mix/UnMix (MUM), which unmixes feature tiles for the mixed image tiles for the SSOD framework. Our proposed method makes mixed input image tiles and reconstructs them in the feature space. Thus, MUM can enjoy the interpolation-regularization effect from non-interpolated pseudo-labels and successfully generate a meaningful weak-strong pair. Furthermore, MUM can be easily equipped on top of various SSOD methods. Extensive experiments on MS-COCO and PASCAL VOC datasets demonstrate the superiority of MUM by consistently improving the mAP performance over the baseline in all the tested SSOD benchmark protocols.

PDF Abstract
Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Semi-Supervised Object Detection COCO 0.5% labeled data MUM mAP 18.54 # 4
Semi-Supervised Object Detection COCO 100% labeled data MUM mAP 42.11 # 10
Semi-Supervised Object Detection COCO 10% labeled data MUM mAP 31.87 # 21
Semi-Supervised Object Detection COCO 1% labeled data MUM mAP 21.88 # 13
Semi-Supervised Object Detection COCO 2% labeled data MUM mAP 24.84 # 12
Semi-Supervised Object Detection COCO 5% labeled data MUM mAP 28.52 # 18

Methods


No methods listed for this paper. Add relevant methods here