Context-Aware Mixup for Domain Adaptive Semantic Segmentation

Unsupervised domain adaptation (UDA) aims to adapt a model of the labeled source domain to an unlabeled target domain. Existing UDA-based semantic segmentation approaches always reduce the domain shifts in pixel level, feature level, and output level. However, almost all of them largely neglect the contextual dependency, which is generally shared across different domains, leading to less-desired performance. In this paper, we propose a novel Context-Aware Mixup (CAMix) framework for domain adaptive semantic segmentation, which exploits this important clue of context-dependency as explicit prior knowledge in a fully end-to-end trainable manner for enhancing the adaptability toward the target domain. Firstly, we present a contextual mask generation strategy by leveraging the accumulated spatial distributions and prior contextual relationships. The generated contextual mask is critical in this work and will guide the context-aware domain mixup on three different levels. Besides, provided the context knowledge, we introduce a significance-reweighted consistency loss to penalize the inconsistency between the mixed student prediction and the mixed teacher prediction, which alleviates the negative transfer of the adaptation, e.g., early performance degradation. Extensive experiments and analysis demonstrate the effectiveness of our method against the state-of-the-art approaches on widely-used UDA benchmarks.

PDF Abstract
Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Unsupervised Domain Adaptation GTAV-to-Cityscapes Labels CAMix (w Deeplabv2 ResNet 101) mIoU 55.2 # 16
Image-to-Image Translation GTAV-to-Cityscapes Labels CAMix (w DAFormer) mIoU 70.0 # 6
Image-to-Image Translation GTAV-to-Cityscapes Labels CAMix (w Deeplabv2 ResNet 101) mIoU 55.2 # 16
Synthetic-to-Real Translation GTAV-to-Cityscapes Labels CAMix (w Deeplabv2 ResNet101) mIoU 55.2 # 27
Synthetic-to-Real Translation GTAV-to-Cityscapes Labels CAMix (w DAFormer) mIoU 70.0 # 9
Unsupervised Domain Adaptation GTAV-to-Cityscapes Labels CAMix (w DAFormer) mIoU 70.0 # 8
Synthetic-to-Real Translation SYNTHIA-to-Cityscapes CAMix (ResNet 101) MIoU (13 classes) 59.7 # 17
Synthetic-to-Real Translation SYNTHIA-to-Cityscapes CAMix (w DAFormer) MIoU (13 classes) 69.2 # 6
Image-to-Image Translation SYNTHIA-to-Cityscapes CAMix (w Deeplabv2 ResNet 101) mIoU (13 classes) 59.7 # 12
Image-to-Image Translation SYNTHIA-to-Cityscapes CAMix (w DAFormer) mIoU (13 classes) 69.2 # 5

Methods