TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Semantic Segmentation	CamVid	DeepLabV3Plus + SDCNetAug	Mean IoU	81.7%	# 4
Semantic Segmentation	KITTI Semantic Segmentation	DeepLabV3Plus + SDCNetAug	Mean IoU (class)	72.83	# 2
Semantic Segmentation	KITTI Semantic Segmentation	DeepLabV3Plus + SDCNetAug	class iIoU	48.68	# 1
Semantic Segmentation	KITTI Semantic Segmentation	DeepLabV3Plus + SDCNetAug	Category IoU	88.99	# 1
Semantic Segmentation	KITTI Semantic Segmentation	DeepLabV3Plus + SDCNetAug	Category iIoU	75.26	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/improving-semantic-segmentation-via-video/semantic-segmentation-on-kitti-semantic)](https://paperswithcode.com/sota/semantic-segmentation-on-kitti-semantic?p=improving-semantic-segmentation-via-video)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/improving-semantic-segmentation-via-video/semantic-segmentation-on-camvid)](https://paperswithcode.com/sota/semantic-segmentation-on-camvid?p=improving-semantic-segmentation-via-video)`

Improving Semantic Segmentation via Video Propagation and Label Relaxation

CVPR 2019 · Yi Zhu, Karan Sapra, Fitsum A. Reda, Kevin J. Shih, Shawn Newsam, Andrew Tao, Bryan Catanzaro ·

Semantic segmentation requires large amounts of pixel-wise annotations to learn accurate models. In this paper, we present a video prediction-based methodology to scale up training sets by synthesizing new training samples in order to improve the accuracy of semantic segmentation networks. We exploit video prediction models' ability to predict future frames in order to also predict future labels. A joint propagation strategy is also proposed to alleviate mis-alignments in synthesized samples. We demonstrate that training segmentation models on datasets augmented by the synthesized samples leads to significant improvements in accuracy. Furthermore, we introduce a novel boundary label relaxation technique that makes training robust to annotation noise and propagation artifacts along object boundaries. Our proposed methods achieve state-of-the-art mIoUs of 83.5% on Cityscapes and 82.9% on CamVid. Our single model, without model ensembles, achieves 72.8% mIoU on the KITTI semantic segmentation test set, which surpasses the winning entry of the ROB challenge 2018. Our code and videos can be found at https://nv-adlr.github.io/publication/2018-Segmentation.

PDF Abstract CVPR 2019 PDF CVPR 2019 Abstract

Code

Add Remove Mark official

NVIDIA/semantic-segmentation

1,751

YeLyuUT/SSeg

ganlumomo/mtl-segmentation

ganlumomo/semantic-segmentation

tobiasriedlinger/uncertainty-gradie…

Tasks

Add Remove

Segmentation

Semantic Segmentation

Video Propagation

Datasets

KITTI

CamVid

PASCAL VOC

Results from the Paper

Edit

Ranked #2 on Semantic Segmentation on KITTI Semantic Segmentation (using extra training data)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Semantic Segmentation	CamVid	DeepLabV3Plus + SDCNetAug	Mean IoU	81.7%	# 4	Compare
Semantic Segmentation	KITTI Semantic Segmentation	DeepLabV3Plus + SDCNetAug	Mean IoU (class)	72.83	# 2	Compare
			class iIoU	48.68	# 1	Compare
			Category IoU	88.99	# 1	Compare
			Category iIoU	75.26	# 1	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Improving Semantic Segmentation via Video Propagation and Label Relaxation

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove