LaRS: A Diverse Panoptic Maritime Obstacle Detection Dataset and Benchmark

ICCV 2023  ·  Lojze Žust, Janez Perš, Matej Kristan ·

The progress in maritime obstacle detection is hindered by the lack of a diverse dataset that adequately captures the complexity of general maritime environments. We present the first maritime panoptic obstacle detection benchmark LaRS, featuring scenes from Lakes, Rivers and Seas. Our major contribution is the new dataset, which boasts the largest diversity in recording locations, scene types, obstacle classes, and acquisition conditions among the related datasets. LaRS is composed of over 4000 per-pixel labeled key frames with nine preceding frames to allow utilization of the temporal texture, amounting to over 40k frames. Each key frame is annotated with 8 thing, 3 stuff classes and 19 global scene attributes. We report the results of 27 semantic and panoptic segmentation methods, along with several performance insights and future research directions. To enable objective evaluation, we have implemented an online evaluation server. The LaRS dataset, evaluation toolkit and benchmark are publicly available at: https://lojzezust.github.io/lars-dataset

PDF Abstract ICCV 2023 PDF ICCV 2023 Abstract

Datasets


Introduced in the Paper:

LaRS

Results from the Paper


Task Dataset Model Metric Name Metric Value Global Rank Benchmark
Panoptic Segmentation LaRS Mask2Former (Swin-B) PQ 41.7 # 1
Video Semantic Segmentation LaRS CSANet (ResNet-101) F1 52.1 # 3
μ 63.7 # 3
mIoU 94.2 # 2
Q 49.1 # 3
Video Semantic Segmentation LaRS TMANet (ResNet-50) F1 61.1 # 2
μ 77.1 # 1
mIoU 94.1 # 3
Q 57.5 # 2
Panoptic Segmentation LaRS MaX-DeepLab PQ 31.9 # 8
Panoptic Segmentation LaRS Mask2Former (ResNet-101) PQ 37.2 # 6
Panoptic Segmentation LaRS Mask2Former (ResNet-50) PQ 37.6 # 5
Panoptic Segmentation LaRS Panoptic FPN (ResNet-101) PQ 38.7 # 4
Panoptic Segmentation LaRS Mask2Former (Swin-T) PQ 39.2 # 3
Panoptic Segmentation LaRS Panoptic FPN (ResNet-50) PQ 40.1 # 2
Semantic Segmentation LaRS UNet F1 15.4 # 20
μ 75.7 # 12
mIoU 90.1 # 18
Q 13.9 # 20
Semantic Segmentation LaRS BiSeNetv1 (ResNet-50) F1 42.8 # 19
μ 73.3 # 15
mIoU 92.2 # 17
Q 39.4 # 18
Semantic Segmentation LaRS IntCatchAI F1 44.9 # 18
μ 62.4 # 20
mIoU 45.6 # 20
Q 20.5 # 19
Semantic Segmentation LaRS WODIS (ResNet-101) F1 47.5 # 17
μ 63.0 # 19
mIoU 85.7 # 19
Q 40.7 # 17
Semantic Segmentation LaRS BiSeNetv2 F1 54.7 # 16
μ 73.9 # 14
mIoU 93.5 # 15
Q 51.2 # 16
Semantic Segmentation LaRS Segmenter (ViT-B) F1 55.2 # 15
μ 72.2 # 16
mIoU 95.1 # 10
Q 52.6 # 15
Semantic Segmentation LaRS FCN (ResNet-50) F1 57.9 # 14
μ 76.8 # 10
mIoU 92.6 # 16
Q 53.6 # 14
Semantic Segmentation LaRS WaSR (ResNet-101) F1 61.6 # 12
μ 71.0 # 17
mIoU 96.6 # 6
Q 59.5 # 11
Semantic Segmentation LaRS STDC1 F1 61.8 # 11
μ 75.6 # 13
mIoU 93.6 # 14
Q 57.8 # 12
Semantic Segmentation LaRS FCN (ResNet-101) F1 63.4 # 10
μ 77.4 # 9
mIoU 95.0 # 11
Q 60.2 # 10
Semantic Segmentation LaRS DeepLabv3+ (ResNet-101) F1 64.0 # 9
μ 77.8 # 6
mIoU 95.4 # 8
Q 61.0 # 8
Semantic Segmentation LaRS STDC2 F1 64.3 # 8
μ 76.5 # 11
mIoU 94.5 # 13
Q 60.8 # 9
Semantic Segmentation LaRS PointRend F1 65.4 # 7
μ 77.5 # 7
mIoU 94.9 # 12
Q 62.1 # 7
Semantic Segmentation LaRS DeepLabv3 (ResNet-101) F1 66.1 # 6
μ 77.5 # 7
mIoU 95.2 # 9
Q 62.9 # 6
Semantic Segmentation LaRS SegFormer (MiT-B2) F1 70.0 # 5
μ 78.6 # 4
mIoU 96.8 # 5
Q 67.8 # 5
Semantic Segmentation LaRS KNet (Swin-T) F1 73.4 # 4
μ 78.8 # 3
mIoU 97.2 # 3
Q 71.3 # 4
Video Semantic Segmentation LaRS WaSR-T (ResNet-101) F1 62.1 # 1
μ 71.1 # 2
mIoU 96.7 # 1
Q 60.1 # 1
Panoptic Segmentation LaRS Panoptic Deeplab (ResNet-50) PQ 34.7 # 7

Methods


No methods listed for this paper. Add relevant methods here