TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Video Instance Segmentation	OVIS validation	CrossVIS (ResNet-50, calibration)	mask AP	18.1	# 34
Video Instance Segmentation	OVIS validation	CrossVIS (ResNet-50, calibration)	AP50	35.5	# 33
Video Instance Segmentation	OVIS validation	CrossVIS (ResNet-50, calibration)	AP75	16.9	# 33
Video Instance Segmentation	OVIS validation	CrossVIS (ResNet-50)	mask AP	14.9	# 41
Video Instance Segmentation	OVIS validation	CrossVIS (ResNet-50)	AP50	32.7	# 39
Video Instance Segmentation	OVIS validation	CrossVIS (ResNet-50)	AP75	12.1	# 41
Video Instance Segmentation	YouTube-VIS validation	CrossVIS (ResNet-101)	mask AP	36.6	# 37
Video Instance Segmentation	YouTube-VIS validation	CrossVIS (ResNet-101)	AP50	57.3	# 34
Video Instance Segmentation	YouTube-VIS validation	CrossVIS (ResNet-101)	AP75	39.7	# 32
Video Instance Segmentation	YouTube-VIS validation	CrossVIS (ResNet-101)	AR1	36	# 32
Video Instance Segmentation	YouTube-VIS validation	CrossVIS (ResNet-101)	AR10	42	# 31

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/crossover-learning-for-fast-online-video/video-instance-segmentation-on-ovis-1)](https://paperswithcode.com/sota/video-instance-segmentation-on-ovis-1?p=crossover-learning-for-fast-online-video)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/crossover-learning-for-fast-online-video/video-instance-segmentation-on-youtube-vis-1)](https://paperswithcode.com/sota/video-instance-segmentation-on-youtube-vis-1?p=crossover-learning-for-fast-online-video)`

Crossover Learning for Fast Online Video Instance Segmentation

ICCV 2021 · Shusheng Yang, Yuxin Fang, Xinggang Wang, Yu Li, Chen Fang, Ying Shan, Bin Feng, Wenyu Liu ·

Modeling temporal visual context across frames is critical for video instance segmentation (VIS) and other video understanding tasks. In this paper, we propose a fast online VIS model named CrossVIS. For temporal information modeling in VIS, we present a novel crossover learning scheme that uses the instance feature in the current frame to pixel-wisely localize the same instance in other frames. Different from previous schemes, crossover learning does not require any additional network parameters for feature enhancement. By integrating with the instance segmentation loss, crossover learning enables efficient cross-frame instance-to-pixel relation learning and brings cost-free improvement during inference. Besides, a global balanced instance embedding branch is proposed for more accurate and more stable online instance association. We conduct extensive experiments on three challenging VIS benchmarks, \ie, YouTube-VIS-2019, OVIS, and YouTube-VIS-2021 to evaluate our methods. To our knowledge, CrossVIS achieves state-of-the-art performance among all online VIS methods and shows a decent trade-off between latency and accuracy. Code will be available to facilitate future research.

PDF Abstract ICCV 2021 PDF ICCV 2021 Abstract

Code

Add Remove Mark official

hustvl/CrossVIS official

Tasks

Add Remove

Instance Segmentation

Semantic Segmentation

Video Instance Segmentation

Video Understanding

Datasets

MS COCO

YouTube-VIS 2019

OVIS YouTube-VIS 2021

Results from the Paper

Edit

Ranked #34 on Video Instance Segmentation on OVIS validation

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Video Instance Segmentation	OVIS validation	CrossVIS (ResNet-50, calibration)	mask AP	18.1	# 34	Compare
			AP50	35.5	# 33	Compare
			AP75	16.9	# 33	Compare
Video Instance Segmentation	OVIS validation	CrossVIS (ResNet-50)	mask AP	14.9	# 41	Compare
			AP50	32.7	# 39	Compare
			AP75	12.1	# 41	Compare
Video Instance Segmentation	YouTube-VIS validation	CrossVIS (ResNet-101)	mask AP	36.6	# 37	Compare
			AP50	57.3	# 34	Compare
			AP75	39.7	# 32	Compare
			AR1	36	# 32	Compare
			AR10	42	# 31	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Crossover Learning for Fast Online Video Instance Segmentation

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove