ICNet for Real-Time Semantic Segmentation on High-Resolution Images

We focus on the challenging task of real-time semantic segmentation in this paper. It finds many practical applications and yet is with fundamental difficulty of reducing a large portion of computation for pixel-wise label inference. We propose an image cascade network (ICNet) that incorporates multi-resolution branches under proper label guidance to address this challenge. We provide in-depth analysis of our framework and introduce the cascade feature fusion unit to quickly achieve high-quality segmentation. Our system yields real-time inference on a single GPU card with decent quality results evaluated on challenging datasets like Cityscapes, CamVid and COCO-Stuff.

PDF Abstract ECCV 2018 PDF ECCV 2018 Abstract
Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Real-Time Semantic Segmentation CamVid ICNet mIoU 67.1% # 21
Time (ms) 36 # 12
Frame (fps) 27.8 # 12
Semantic Segmentation Cityscapes test ICNet Mean IoU (class) 70.6% # 77
Real-Time Semantic Segmentation Cityscapes test ICNet mIoU 70.6% # 29
Time (ms) 33 # 18
Frame (fps) 30.3 # 19
Dichotomous Image Segmentation DIS-TE1 ICNet max F-Measure 0.631 # 15
weighted F-measure 0.535 # 14
MAE 0.095 # 13
S-Measure 0.716 # 15
E-measure 0.784 # 13
HCE 234 # 10
Dichotomous Image Segmentation DIS-TE2 ICNet max F-Measure 0.716 # 13
weighted F-measure 0.627 # 12
MAE 0.095 # 11
S-Measure 0.759 # 12
E-measure 0.826 # 12
HCE 512 # 10
Dichotomous Image Segmentation DIS-TE3 ICNet max F-Measure 0.752 # 10
weighted F-measure 0.664 # 10
MAE 0.091 # 11
S-Measure 0.780 # 9
E-measure 0.852 # 12
HCE 1001 # 11
Dichotomous Image Segmentation DIS-TE4 ICNet max F-Measure 0.749 # 10
weighted F-measure 0.663 # 10
MAE 0.099 # 10
S-Measure 0.776 # 9
E-measure 0.837 # 12
HCE 3690 # 12
Dichotomous Image Segmentation DIS-VD ICNet max F-Measure 0.697 # 10
weighted F-measure 0.609 # 10
MAE 0.102 # 9
S-Measure 0.747 # 10
E-measure 0.811 # 10
HCE 1503 # 10
Semantic Segmentation Trans10K ICNet mIoU 23.39% # 15
GFLOPs 10.64 # 2

Methods