TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Video Instance Segmentation	YouTube-VIS validation	MaskTrack R-CNN (ResNet-50, single-scale training and test)	mask AP	30.3	# 49
Video Instance Segmentation	YouTube-VIS validation	MaskTrack R-CNN (ResNet-50, single-scale training and test)	AP50	51.1	# 45
Video Instance Segmentation	YouTube-VIS validation	MaskTrack R-CNN (ResNet-50, single-scale training and test)	AP75	32.6	# 47
Video Instance Segmentation	YouTube-VIS validation	MaskTrack R-CNN (ResNet-50, single-scale training and test)	AR1	31	# 39
Video Instance Segmentation	YouTube-VIS validation	MaskTrack R-CNN (ResNet-50, single-scale training and test)	AR10	35.5	# 39

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/video-instance-segmentation/video-instance-segmentation-on-youtube-vis-1)](https://paperswithcode.com/sota/video-instance-segmentation-on-youtube-vis-1?p=video-instance-segmentation)`

Video Instance Segmentation

ICCV 2019 · Linjie Yang, Yuchen Fan, Ning Xu ·

In this paper we present a new computer vision task, named video instance segmentation. The goal of this new task is simultaneous detection, segmentation and tracking of instances in videos. In words, it is the first time that the image instance segmentation problem is extended to the video domain. To facilitate research on this new task, we propose a large-scale benchmark called YouTube-VIS, which consists of 2883 high-resolution YouTube videos, a 40-category label set and 131k high-quality instance masks. In addition, we propose a novel algorithm called MaskTrack R-CNN for this task. Our new method introduces a new tracking branch to Mask R-CNN to jointly perform the detection, segmentation and tracking tasks simultaneously. Finally, we evaluate the proposed method and several strong baselines on our new dataset. Experimental results clearly demonstrate the advantages of the proposed algorithm and reveal insight for future improvement. We believe the video instance segmentation task will motivate the community along the line of research for video understanding.

PDF Abstract ICCV 2019 PDF ICCV 2019 Abstract

Code

Add Remove Mark official

Epiphqny/VisTR official

734

JonathonLuiten/TrackEval

878

JonathonLuiten/HOTA-metrics

877

youtubevos/MaskTrackRCNN

433

bbednarski9/MaskTrackRCNN_DeepSort

Tasks

Add Remove

Instance Segmentation

Segmentation

Semantic Segmentation

Video Instance Segmentation

Video Understanding

Datasets

Introduced in the Paper:

YouTube-VIS 2019 YouTube-VIS 2021

Youtube-VIS 2022 Validation

Used in the Paper:

MS COCO

YouTube-VOS 2018

Results from the Paper

Edit

Ranked #49 on Video Instance Segmentation on YouTube-VIS validation

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Video Instance Segmentation	YouTube-VIS validation	MaskTrack R-CNN (ResNet-50, single-scale training and test)	mask AP	30.3	# 49	Compare
			AP50	51.1	# 45	Compare
			AP75	32.6	# 47	Compare
			AR1	31	# 39	Compare
			AR10	35.5	# 39	Compare

Methods

Add Remove

Convolution • Mask R-CNN • RoIAlign • RPN • Softmax

Edit Social Preview

Video Instance Segmentation

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove