TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK	REMOVE
Weakly-Supervised Semantic Segmentation	Cityscapes test	CARB	mIoU	51.8	# 1
Weakly-Supervised Semantic Segmentation	Cityscapes val	CARB	mIoU	52.1	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/weakly-supervised-semantic-segmentation-for-1/weakly-supervised-semantic-segmentation-on-17)](https://paperswithcode.com/sota/weakly-supervised-semantic-segmentation-on-17?p=weakly-supervised-semantic-segmentation-for-1)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/weakly-supervised-semantic-segmentation-for-1/weakly-supervised-semantic-segmentation-on-16)](https://paperswithcode.com/sota/weakly-supervised-semantic-segmentation-on-16?p=weakly-supervised-semantic-segmentation-for-1)`

Weakly Supervised Semantic Segmentation for Driving Scenes

21 Dec 2023 · Dongseob Kim, Seungho Lee, Junsuk Choe, Hyunjung Shim ·

State-of-the-art techniques in weakly-supervised semantic segmentation (WSSS) using image-level labels exhibit severe performance degradation on driving scene datasets such as Cityscapes. To address this challenge, we develop a new WSSS framework tailored to driving scene datasets. Based on extensive analysis of dataset characteristics, we employ Contrastive Language-Image Pre-training (CLIP) as our baseline to obtain pseudo-masks. However, CLIP introduces two key challenges: (1) pseudo-masks from CLIP lack in representing small object classes, and (2) these masks contain notable noise. We propose solutions for each issue as follows. (1) We devise Global-Local View Training that seamlessly incorporates small-scale patches during model training, thereby enhancing the model's capability to handle small-sized yet critical objects in driving scenes (e.g., traffic light). (2) We introduce Consistency-Aware Region Balancing (CARB), a novel technique that discerns reliable and noisy regions through evaluating the consistency between CLIP masks and segmentation predictions. It prioritizes reliable pixels over noisy pixels via adaptive loss weighting. Notably, the proposed method achieves 51.8\% mIoU on the Cityscapes test dataset, showcasing its potential as a strong WSSS baseline on driving scene datasets. Experimental results on CamVid and WildDash2 demonstrate the effectiveness of our method across diverse datasets, even with small-scale datasets or visually challenging conditions. The code is available at https://github.com/k0u-id/CARB.

PDF Abstract

Code

Add Remove Mark official

k0u-id/carb official

Tasks

Add Remove

Semantic Segmentation

Weakly supervised Semantic Segmentation

Weakly-Supervised Semantic Segmentation

Datasets

MS COCO

Cityscapes

CamVid

Results from the Paper

Add Remove

Ranked #1 on Weakly-Supervised Semantic Segmentation on Cityscapes test

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Weakly-Supervised Semantic Segmentation	Cityscapes test	CARB	mIoU	51.8	# 1	Compare
Weakly-Supervised Semantic Segmentation	Cityscapes val	CARB	mIoU	52.1	# 1	Compare

Methods

Add Remove

Adaptive Loss • CLIP

Edit Social Preview

Weakly Supervised Semantic Segmentation for Driving Scenes

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove