Rethinking Atrous Convolution for Semantic Image Segmentation

17 Jun 2017  ·  Liang-Chieh Chen, George Papandreou, Florian Schroff, Hartwig Adam ·

In this work, we revisit atrous convolution, a powerful tool to explicitly adjust filter's field-of-view as well as control the resolution of feature responses computed by Deep Convolutional Neural Networks, in the application of semantic image segmentation. To handle the problem of segmenting objects at multiple scales, we design modules which employ atrous convolution in cascade or in parallel to capture multi-scale context by adopting multiple atrous rates. Furthermore, we propose to augment our previously proposed Atrous Spatial Pyramid Pooling module, which probes convolutional features at multiple scales, with image-level features encoding global context and further boost performance. We also elaborate on implementation details and share our experience on training our system. The proposed `DeepLabv3' system significantly improves over our previous DeepLab versions without DenseCRF post-processing and attains comparable performance with other state-of-art models on the PASCAL VOC 2012 semantic image segmentation benchmark.

PDF Abstract

Results from the Paper


Ranked #3 on Semantic Segmentation on PASCAL VOC 2012 test (using extra training data)

     Get a GitHub badge
Task Dataset Model Metric Name Metric Value Global Rank Uses Extra
Training Data
Result Benchmark
Semantic Segmentation Cityscapes test DeepLabv3 (ResNet-101, coarse) Mean IoU (class) 81.3% # 42
Semantic Segmentation Cityscapes val DeepLabv3 (Dilated-ResNet-101) mIoU 78.5% # 52
Dichotomous Image Segmentation DIS-TE1 DeeplabV3+ max F-Measure 0.601 # 18
weighted F-measure 0.506 # 17
MAE 0.102 # 16
S-Measure 0.694 # 20
E-measure 0.772 # 15
HCE 234 # 10
Dichotomous Image Segmentation DIS-TE2 DeeplabV3+ max F-Measure 0.681 # 18
weighted F-measure 0.587 # 17
MAE 0.105 # 16
S-Measure 0.729 # 20
E-measure 0.813 # 15
HCE 516 # 11
Dichotomous Image Segmentation DIS-TE3 DeeplabV3+ max F-Measure 0.717 # 18
weighted F-measure 0.623 # 17
MAE 0.102 # 17
S-Measure 0.749 # 17
E-measure 0.833 # 16
HCE 999 # 10
Dichotomous Image Segmentation DIS-TE4 DeeplabV3+ max F-Measure 0.715 # 17
weighted F-measure 0.621 # 17
MAE 0.111 # 17
S-Measure 0.744 # 16
E-measure 0.820 # 16
HCE 3709 # 13
Dichotomous Image Segmentation DIS-VD DeeplabV3+ max F-Measure 0.660 # 19
weighted F-measure 0.568 # 17
MAE 0.114 # 17
S-Measure 0.716 # 20
E-measure 0.796 # 15
HCE 1520 # 11
Semantic Segmentation PASCAL VOC 2012 test DeepLabv3-JFT Mean IoU 86.9% # 3
Semantic Segmentation PASCAL VOC 2012 val DeepLabv3-JFT mIoU 82.7% # 6
Semantic Segmentation SELMA DeepLabV3 mIoU 70.7 # 3

Methods