TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Unsupervised Video Object Segmentation	DAVIS 2016 val	DPA	G	87.6	# 3
Unsupervised Video Object Segmentation	DAVIS 2016 val	DPA	J	86.8	# 4
Unsupervised Video Object Segmentation	DAVIS 2016 val	DPA	F	88.4	# 3
Unsupervised Video Object Segmentation	FBMS test	DPA	J	83.4	# 1
Unsupervised Video Object Segmentation	YouTube-Objects	DPA	J	73.7	# 2

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/domain-alignment-and-temporal-aggregation-for/unsupervised-video-object-segmentation-on-11)](https://paperswithcode.com/sota/unsupervised-video-object-segmentation-on-11?p=domain-alignment-and-temporal-aggregation-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/domain-alignment-and-temporal-aggregation-for/unsupervised-video-object-segmentation-on-12)](https://paperswithcode.com/sota/unsupervised-video-object-segmentation-on-12?p=domain-alignment-and-temporal-aggregation-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/domain-alignment-and-temporal-aggregation-for/unsupervised-video-object-segmentation-on-10)](https://paperswithcode.com/sota/unsupervised-video-object-segmentation-on-10?p=domain-alignment-and-temporal-aggregation-for)`

Dual Prototype Attention for Unsupervised Video Object Segmentation

22 Nov 2022 · Suhwan Cho, Minhyeok Lee, Seunghoon Lee, Dogyoon Lee, Heeseung Choi, Ig-Jae Kim, Sangyoun Lee ·

Unsupervised video object segmentation (VOS) aims to detect and segment the most salient object in videos. The primary techniques used in unsupervised VOS are 1) the collaboration of appearance and motion information; and 2) temporal fusion between different frames. This paper proposes two novel prototype-based attention mechanisms, inter-modality attention (IMA) and inter-frame attention (IFA), to incorporate these techniques via dense propagation across different modalities and frames. IMA densely integrates context information from different modalities based on a mutual refinement. IFA injects global context of a video to the query frame, enabling a full utilization of useful properties from multiple frames. Experimental results on public benchmark datasets demonstrate that our proposed approach outperforms all existing methods by a substantial margin. The proposed two components are also thoroughly validated via ablative study.

PDF Abstract

Code

Add Remove Mark official

hydragon516/dpa official

Tasks

Add Remove

Object

Semantic Segmentation

Unsupervised Video Object Segmentation

Video Object Segmentation

Video Semantic Segmentation

Datasets

DAVIS 2016

YouTube-VOS 2018

FBMS

Results from the Paper

Edit

Ranked #1 on Unsupervised Video Object Segmentation on FBMS test

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Unsupervised Video Object Segmentation	DAVIS 2016 val	DPA	G	87.6	# 3	Compare
			J	86.8	# 4	Compare
			F	88.4	# 3	Compare
Unsupervised Video Object Segmentation	FBMS test	DPA	J	83.4	# 1	Compare
Unsupervised Video Object Segmentation	YouTube-Objects	DPA	J	73.7	# 2	Compare

Methods

Add Remove

TAM • VOS

Edit Social Preview

Dual Prototype Attention for Unsupervised Video Object Segmentation

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove