TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Unsupervised Video Object Segmentation	DAVIS 2016 val	MATNet	G	81.6	# 17
Unsupervised Video Object Segmentation	DAVIS 2016 val	MATNet	J	82.4	# 17
Unsupervised Video Object Segmentation	DAVIS 2016 val	MATNet	F	80.7	# 16
Unsupervised Video Object Segmentation	FBMS test	MATNet	J	76.1	# 9
Unsupervised Video Object Segmentation	YouTube-Objects	MATNet	J	69.0	# 10

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/motion-attentive-transition-for-zero-shot/unsupervised-video-object-segmentation-on-11)](https://paperswithcode.com/sota/unsupervised-video-object-segmentation-on-11?p=motion-attentive-transition-for-zero-shot)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/motion-attentive-transition-for-zero-shot/unsupervised-video-object-segmentation-on-12)](https://paperswithcode.com/sota/unsupervised-video-object-segmentation-on-12?p=motion-attentive-transition-for-zero-shot)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/motion-attentive-transition-for-zero-shot/unsupervised-video-object-segmentation-on-10)](https://paperswithcode.com/sota/unsupervised-video-object-segmentation-on-10?p=motion-attentive-transition-for-zero-shot)`

Motion-Attentive Transition for Zero-Shot Video Object Segmentation

9 Mar 2020 · Tianfei Zhou, Shunzhou Wang, Yi Zhou, Yazhou Yao, Jianwu Li, Ling Shao ·

In this paper, we present a novel Motion-Attentive Transition Network (MATNet) for zero-shot video object segmentation, which provides a new way of leveraging motion information to reinforce spatio-temporal object representation. An asymmetric attention block, called Motion-Attentive Transition (MAT), is designed within a two-stream encoder, which transforms appearance features into motion-attentive representations at each convolutional stage. In this way, the encoder becomes deeply interleaved, allowing for closely hierarchical interactions between object motion and appearance. This is superior to the typical two-stream architecture, which treats motion and appearance separately in each stream and often suffers from overfitting to appearance information. Additionally, a bridge network is proposed to obtain a compact, discriminative and scale-sensitive representation for multi-level encoder features, which is further fed into a decoder to achieve segmentation results. Extensive experiments on three challenging public benchmarks (i.e. DAVIS-16, FBMS and Youtube-Objects) show that our model achieves compelling performance against the state-of-the-arts.

PDF Abstract

Code

Add Remove Mark official

tfzhou/MATNet official

190

Tasks

Add Remove

Object

Segmentation

Semantic Segmentation

Unsupervised Video Object Segmentation

Video Object Segmentation

Video Semantic Segmentation

Zero-Shot Video Object Segmentation

Datasets

DAVIS

DAVIS 2016

FBMS

Results from the Paper

Edit

Ranked #9 on Unsupervised Video Object Segmentation on FBMS test

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Unsupervised Video Object Segmentation	DAVIS 2016 val	MATNet	G	81.6	# 17	Compare
			J	82.4	# 17	Compare
			F	80.7	# 16	Compare
Unsupervised Video Object Segmentation	FBMS test	MATNet	J	76.1	# 9	Compare
Unsupervised Video Object Segmentation	YouTube-Objects	MATNet	J	69.0	# 10	Compare

Methods

Add Remove

Convolution

Edit Social Preview

Motion-Attentive Transition for Zero-Shot Video Object Segmentation

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove