FarSeg++: Foreground-Aware Relation Network for Geospatial Object Segmentation in High Spatial Resolution Remote Sensing Imagery

Geospatial object segmentation, a fundamental Earth vision task, always suffers from scale variation, the larger intraclass variance of background, and foreground-background imbalance in high spatial resolution (HSR) remote sensing imagery. Generic semantic segmentation methods mainly focus on the scale variation in natural scenarios. However, the other two problems are insufficiently considered in large area Earth observation scenarios. In this paper, we propose a foreground-aware relation network (FarSeg++) from the perspectives of relation-based, optimizationbased, and objectness-based foreground modeling, alleviating the above two problems. From the perspective of the relations, the foreground-scene relation module improves the discrimination of the foreground features via the foreground-correlated contexts associated with the object-scene relation. From the perspective of optimization, foreground-aware optimization is proposed to focus on foreground examples and hard examples of the background during training to achieve a balanced optimization. Besides, from the perspective of objectness, a foreground-aware decoder is proposed to improve the objectness representation, alleviating the objectness prediction problem that is the main bottleneck revealed by an empirical upper bound analysis. We also introduce a new large-scale high-resolution urban vehicle segmentation dataset to verify the effectiveness of the proposed method and push the development of objectness prediction further forward. The experimental results suggest that FarSeg++ is superior to the state-of-the-art generic semantic segmentation methods and can achieve a better trade-off between speed and accuracy.

PDF Abstract

Datasets


Task Dataset Model Metric Name Metric Value Global Rank Benchmark
Semantic Segmentation iSAID FarSeg++@Swin-T mIoU 66.3 # 11
Semantic Segmentation iSAID FarSeg++@MiT-B2 mIoU 67.9 # 7
Semantic Segmentation iSAID FarSeg++@ResNet-50 mIoU 67.6 # 8
The Semantic Segmentation Of Remote Sensing Imagery UV6K FarSeg++@Swin-T IoU (%) 65.8 # 2
The Semantic Segmentation Of Remote Sensing Imagery UV6K FarSeg++@MiT-B2 IoU (%) 66.4 # 1
The Semantic Segmentation Of Remote Sensing Imagery UV6K FarSeg++@ResNet-50 IoU (%) 64.4 # 3

Methods