Perturbed and Strict Mean Teachers for Semi-supervised Semantic Segmentation

Consistency learning using input image, feature, or network perturbations has shown remarkable results in semi-supervised semantic segmentation, but this approach can be seriously affected by inaccurate predictions of unlabelled training images. There are two consequences of these inaccurate predictions: 1) the training based on the "strict" cross-entropy (CE) loss can easily overfit prediction mistakes, leading to confirmation bias; and 2) the perturbations applied to these inaccurate predictions will use potentially erroneous predictions as training signals, degrading consistency learning. In this paper, we address the prediction accuracy problem of consistency learning methods with novel extensions of the mean-teacher (MT) model, which include a new auxiliary teacher, and the replacement of MT's mean square error (MSE) by a stricter confidence-weighted cross-entropy (Conf-CE) loss. The accurate prediction by this model allows us to use a challenging combination of network, input data and feature perturbations to improve the consistency learning generalisation, where the feature perturbations consist of a new adversarial perturbation. Results on public benchmarks show that our approach achieves remarkable improvements over the previous SOTA methods in the field. Our code is available at https://github.com/yyliu01/PS-MT.

PDF Abstract CVPR 2022 PDF CVPR 2022 Abstract

Datasets


Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Semi-Supervised Semantic Segmentation Cityscapes 12.5% labeled PS-MT (DeepLab v3+ with ImageNet-pretrained ResNet50, single scale inference) Validation mIoU 77.12% # 8
Semi-Supervised Semantic Segmentation Cityscapes 25% labeled PS-MT (DeepLab v3+ with ImageNet-pretrained ResNet-50, single scale inference) Validation mIoU 78.38% # 10
Semi-Supervised Semantic Segmentation Cityscapes 50% labeled PS-MT (DeepLab v3+ with ImageNet-pretrained ResNet-50, single scale inference) Validation mIoU 79.22% # 8
Semi-Supervised Semantic Segmentation Pascal VOC 2012 12.5% labeled PS-MT Validation mIoU 78.20% # 6
Validation mIoU 75.70% # 12
Semi-Supervised Semantic Segmentation PASCAL VOC 2012 1464 labels PS-MT (DeepLab v3+ with ImageNet-pretrained ResNet-101, single scale inference) Validation mIoU 80.01 # 5
Semi-Supervised Semantic Segmentation PASCAL VOC 2012 1464 labels PS-MT (DeepLab v3+ with ImageNet-pretrained ResNet-50, single scale inference) Validation mIoU 78.08 # 7
Semi-Supervised Semantic Segmentation PASCAL VOC 2012 25% labeled PS-MT (DeepLab v3+ with ImageNet-pretrained ResNet-101, single scale inference) Validation mIoU 78.72 # 7
Semi-Supervised Semantic Segmentation PASCAL VOC 2012 50% PS-MT (DeepLab v3+ with ImageNet-pretrained ResNet-101, single scale inference) Validation mIoU 79.76% # 6

Methods


No methods listed for this paper. Add relevant methods here