Semi-Supervised Semantic Segmentation Using Unreliable Pseudo-Labels

The crux of semi-supervised semantic segmentation is to assign adequate pseudo-labels to the pixels of unlabeled images. A common practice is to select the highly confident predictions as the pseudo ground-truth, but it leads to a problem that most pixels may be left unused due to their unreliability. We argue that every pixel matters to the model training, even its prediction is ambiguous. Intuitively, an unreliable prediction may get confused among the top classes (i.e., those with the highest probabilities), however, it should be confident about the pixel not belonging to the remaining classes. Hence, such a pixel can be convincingly treated as a negative sample to those most unlikely categories. Based on this insight, we develop an effective pipeline to make sufficient use of unlabeled data. Concretely, we separate reliable and unreliable pixels via the entropy of predictions, push each unreliable pixel to a category-wise queue that consists of negative samples, and manage to train the model with all candidate pixels. Considering the training evolution, where the prediction becomes more and more accurate, we adaptively adjust the threshold for the reliable-unreliable partition. Experimental results on various benchmarks and training settings demonstrate the superiority of our approach over the state-of-the-art alternatives.

PDF Abstract CVPR 2022 PDF CVPR 2022 Abstract

Datasets


Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Semi-Supervised Semantic Segmentation Cityscapes 12.5% labeled U2PL (DeepLab v3+ with ResNet-101 pretraind on ImageNet-1K, AEL) Validation mIoU 76.48% # 10
Semi-Supervised Semantic Segmentation Cityscapes 25% labeled U2PL (DeepLab v3+ with ResNet-101 pretraind on ImageNet-1K, AEL) Validation mIoU 78.51% # 7
Semi-Supervised Semantic Segmentation Cityscapes 50% labeled U2PL (DeepLab v3+ with ResNet-101 pretraind on ImageNet-1K, AEL) Validation mIoU 79.12% # 9
Semi-Supervised Semantic Segmentation Cityscapes 6.25% labeled U2PL (DeepLab v3+ with ResNet-101 pretraind on ImageNet-1K, AEL) Validation mIoU 74.90% # 7
Semi-Supervised Semantic Segmentation Pascal VOC 2012 12.5% labeled U2PL (DeepLab v3+ with ResNet-101 pretraind on ImageNet-1K, CutMix) Validation mIoU 79.01% # 5
Semi-Supervised Semantic Segmentation PASCAL VOC 2012 1464 labels U2PL (DeepLab v3+ with ResNet-101) Validation mIoU 79.5 # 6
Semi-Supervised Semantic Segmentation PASCAL VOC 2012 183 labeled U2PL (DeepLab v3+ with ResNet-101) Validation mIoU 69.2 # 6
Semi-Supervised Semantic Segmentation PASCAL VOC 2012 25% labeled U2PL (DeepLab v3+ with ResNet-101 pretraind on ImageNet-1K, CutMix) Validation mIoU 79.3 # 5
Semi-Supervised Semantic Segmentation PASCAL VOC 2012 366 labeled U2PL (DeepLab v3+ with ResNet-101) Validation mIoU 73.7 # 6
Semi-Supervised Semantic Segmentation PASCAL VOC 2012 50% U2PL (DeepLab v3+ with ResNet-101 pretraind on ImageNet-1K, CutMix) Validation mIoU 80.5% # 3
Semi-Supervised Semantic Segmentation Pascal VOC 2012 6.25% labeled U2PL (DeepLab v3+ with ResNet-101 pretraind on ImageNet-1K, CutMix) Validation mIoU 77.21 # 5
Semi-Supervised Semantic Segmentation PASCAL VOC 2012 732 labeled U2PL (DeepLab v3+ with ResNet-101) Validation mIoU 76.2 # 6
Semi-Supervised Semantic Segmentation PASCAL VOC 2012 92 labeled U2PL (DeepLab v3+ with ResNet-101) Validation mIoU 68.0 # 6

Methods


No methods listed for this paper. Add relevant methods here