HRDA: Context-Aware High-Resolution Domain-Adaptive Semantic Segmentation

27 Apr 2022  ·  Lukas Hoyer, Dengxin Dai, Luc van Gool ·

Unsupervised domain adaptation (UDA) aims to adapt a model trained on the source domain (e.g. synthetic data) to the target domain (e.g. real-world data) without requiring further annotations on the target domain. This work focuses on UDA for semantic segmentation as real-world pixel-wise annotations are particularly expensive to acquire. As UDA methods for semantic segmentation are usually GPU memory intensive, most previous methods operate only on downscaled images. We question this design as low-resolution predictions often fail to preserve fine details. The alternative of training with random crops of high-resolution images alleviates this problem but falls short in capturing long-range, domain-robust context information. Therefore, we propose HRDA, a multi-resolution training approach for UDA, that combines the strengths of small high-resolution crops to preserve fine segmentation details and large low-resolution crops to capture long-range context dependencies with a learned scale attention, while maintaining a manageable GPU memory footprint. HRDA enables adapting small objects and preserving fine segmentation details. It significantly improves the state-of-the-art performance by 5.5 mIoU for GTA-to-Cityscapes and 4.9 mIoU for Synthia-to-Cityscapes, resulting in unprecedented 73.8 and 65.8 mIoU, respectively. The implementation is available at https://github.com/lhoyer/HRDA.

PDF Abstract
Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Domain Adaptation Cityscapes to ACDC HRDA mIoU 68.0 # 5
Semantic Segmentation Dark Zurich HRDA mIoU 55.9 # 5
Domain Adaptation GTA5 to Cityscapes HRDA mIoU 73.8 # 5
Image-to-Image Translation GTAV-to-Cityscapes Labels HRDA mIoU 73.8 # 3
Semantic Segmentation GTAV-to-Cityscapes Labels HRDA mIoU 73.8 # 3
Unsupervised Domain Adaptation GTAV-to-Cityscapes Labels HRDA mIoU 73.8 # 4
Synthetic-to-Real Translation GTAV-to-Cityscapes Labels HRDA mIoU 73.8 # 5
Synthetic-to-Real Translation SYNTHIA-to-Cityscapes HRDA MIoU (13 classes) 72.4 # 4
MIoU (16 classes) 65.8 # 5
Image-to-Image Translation SYNTHIA-to-Cityscapes HRDA mIoU (13 classes) 72.4 # 3
Domain Adaptation SYNTHIA-to-Cityscapes HRDA mIoU 65.8 # 6
Semantic Segmentation SYNTHIA-to-Cityscapes HRDA Mean IoU 65.8 # 3
Unsupervised Domain Adaptation SYNTHIA-to-Cityscapes HRDA mIoU (13 classes) 72.4 # 4
mIoU 65.8 # 4

Methods


No methods listed for this paper. Add relevant methods here