TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Semi-Supervised Video Object Segmentation	DAVIS 2016	RMNet	Jaccard (Mean)	88.9	# 30
Semi-Supervised Video Object Segmentation	DAVIS 2016	RMNet	F-measure (Mean)	88.7	# 42
Semi-Supervised Video Object Segmentation	DAVIS 2016	RMNet	J&F	88.8	# 39
Semi-Supervised Video Object Segmentation	DAVIS 2017 (test-dev)	RMNet	J&F	75.0	# 33
Semi-Supervised Video Object Segmentation	DAVIS 2017 (test-dev)	RMNet	Jaccard (Mean)	71.9	# 31
Semi-Supervised Video Object Segmentation	DAVIS 2017 (test-dev)	RMNet	F-measure (Mean)	78.1	# 34
Semi-Supervised Video Object Segmentation	DAVIS 2017 (val)	RMNet	Jaccard (Mean)	81.0	# 31
Semi-Supervised Video Object Segmentation	DAVIS 2017 (val)	RMNet	F-measure (Mean)	86.0	# 32
Semi-Supervised Video Object Segmentation	DAVIS 2017 (val)	RMNet	J&F	83.5	# 32
Semi-Supervised Video Object Segmentation	DAVIS (no YouTube-VOS training)	RMNet	FPS	11.9	# 12
Semi-Supervised Video Object Segmentation	DAVIS (no YouTube-VOS training)	RMNet	D16 val (G)	81.5	# 16
Semi-Supervised Video Object Segmentation	DAVIS (no YouTube-VOS training)	RMNet	D16 val (J)	80.6	# 14
Semi-Supervised Video Object Segmentation	DAVIS (no YouTube-VOS training)	RMNet	D16 val (F)	82.3	# 14
Semi-Supervised Video Object Segmentation	DAVIS (no YouTube-VOS training)	RMNet	D17 val (G)	75.0	# 9
Semi-Supervised Video Object Segmentation	DAVIS (no YouTube-VOS training)	RMNet	D17 val (J)	72.8	# 10
Semi-Supervised Video Object Segmentation	DAVIS (no YouTube-VOS training)	RMNet	D17 val (F)	77.2	# 10
Semi-Supervised Video Object Segmentation	YouTube-VOS 2018	RMNet	F-Measure (Seen)	85.7	# 37
Semi-Supervised Video Object Segmentation	YouTube-VOS 2018	RMNet	F-Measure (Unseen)	82.4	# 40
Semi-Supervised Video Object Segmentation	YouTube-VOS 2018	RMNet	Overall	81.5	# 36
Semi-Supervised Video Object Segmentation	YouTube-VOS 2018	RMNet	Jaccard (Seen)	82.1	# 30
Semi-Supervised Video Object Segmentation	YouTube-VOS 2018	RMNet	Jaccard (Unseen)	75.7	# 35

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/efficient-regional-memory-network-for-video/semi-supervised-video-object-segmentation-on-20)](https://paperswithcode.com/sota/semi-supervised-video-object-segmentation-on-20?p=efficient-regional-memory-network-for-video)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/efficient-regional-memory-network-for-video/visual-object-tracking-on-davis-2017)](https://paperswithcode.com/sota/visual-object-tracking-on-davis-2017?p=efficient-regional-memory-network-for-video)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/efficient-regional-memory-network-for-video/semi-supervised-video-object-segmentation-on-1)](https://paperswithcode.com/sota/semi-supervised-video-object-segmentation-on-1?p=efficient-regional-memory-network-for-video)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/efficient-regional-memory-network-for-video/video-object-segmentation-on-youtube-vos)](https://paperswithcode.com/sota/video-object-segmentation-on-youtube-vos?p=efficient-regional-memory-network-for-video)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/efficient-regional-memory-network-for-video/visual-object-tracking-on-davis-2016)](https://paperswithcode.com/sota/visual-object-tracking-on-davis-2016?p=efficient-regional-memory-network-for-video)`

Efficient Regional Memory Network for Video Object Segmentation

CVPR 2021 · Haozhe Xie, Hongxun Yao, Shangchen Zhou, Shengping Zhang, Wenxiu Sun ·

Recently, several Space-Time Memory based networks have shown that the object cues (e.g. video frames as well as the segmented object masks) from the past frames are useful for segmenting objects in the current frame. However, these methods exploit the information from the memory by global-to-global matching between the current and past frames, which lead to mismatching to similar objects and high computational complexity. To address these problems, we propose a novel local-to-local matching solution for semi-supervised VOS, namely Regional Memory Network (RMNet). In RMNet, the precise regional memory is constructed by memorizing local regions where the target objects appear in the past frames. For the current query frame, the query regions are tracked and predicted based on the optical flow estimated from the previous frame. The proposed local-to-local matching effectively alleviates the ambiguity of similar objects in both memory and query frames, which allows the information to be passed from the regional memory to the query region efficiently and effectively. Experimental results indicate that the proposed RMNet performs favorably against state-of-the-art methods on the DAVIS and YouTube-VOS datasets.

PDF Abstract CVPR 2021 PDF CVPR 2021 Abstract

Code

Add Remove Mark official

hzxie/RMNet official

Tasks

Add Remove

Object

One-shot visual object segmentation

Optical Flow Estimation

Semantic Segmentation

Semi-Supervised Video Object Segmentation

Video Object Segmentation

Video Semantic Segmentation

Datasets

DAVIS

DAVIS 2017

DAVIS 2016

YouTube-VOS 2018

Referring Expressions for DAVIS 2016 & 2017

Results from the Paper

Add Remove

Ranked #9 on Semi-Supervised Video Object Segmentation on DAVIS (no YouTube-VOS training)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Semi-Supervised Video Object Segmentation	DAVIS 2016	RMNet	Jaccard (Mean)	88.9	# 30	Compare
			F-measure (Mean)	88.7	# 42	Compare
			J&F	88.8	# 39	Compare
Semi-Supervised Video Object Segmentation	DAVIS 2017 (test-dev)	RMNet	J&F	75.0	# 33	Compare
			Jaccard (Mean)	71.9	# 31	Compare
			F-measure (Mean)	78.1	# 34	Compare
Semi-Supervised Video Object Segmentation	DAVIS 2017 (val)	RMNet	Jaccard (Mean)	81.0	# 31	Compare
			F-measure (Mean)	86.0	# 32	Compare
			J&F	83.5	# 32	Compare
Semi-Supervised Video Object Segmentation	DAVIS (no YouTube-VOS training)	RMNet	FPS	11.9	# 12	Compare
			D16 val (G)	81.5	# 16	Compare
			D16 val (J)	80.6	# 14	Compare
			D16 val (F)	82.3	# 14	Compare
			D17 val (G)	75.0	# 9	Compare
			D17 val (J)	72.8	# 10	Compare
			D17 val (F)	77.2	# 10	Compare
Semi-Supervised Video Object Segmentation	YouTube-VOS 2018	RMNet	F-Measure (Seen)	85.7	# 37	Compare
			F-Measure (Unseen)	82.4	# 40	Compare
			Overall	81.5	# 36	Compare
			Jaccard (Seen)	82.1	# 30	Compare
			Jaccard (Unseen)	75.7	# 35	Compare

Methods

Add Remove

VOS

Edit Social Preview

Efficient Regional Memory Network for Video Object Segmentation

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove