TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Semi-Supervised Video Object Segmentation	DAVIS 2016	STCN	Jaccard (Mean)	90.4	# 17
Semi-Supervised Video Object Segmentation	DAVIS 2016	STCN	Jaccard (Recall)	98.1	# 1
Semi-Supervised Video Object Segmentation	DAVIS 2016	STCN	Jaccard (Decay)	4.1	# 31
Semi-Supervised Video Object Segmentation	DAVIS 2016	STCN	F-measure (Mean)	93.0	# 18
Semi-Supervised Video Object Segmentation	DAVIS 2016	STCN	F-measure (Recall)	97.1	# 1
Semi-Supervised Video Object Segmentation	DAVIS 2016	STCN	F-measure (Decay)	4.3	# 30
Semi-Supervised Video Object Segmentation	DAVIS 2016	STCN	J&F	91.7	# 17
Semi-Supervised Video Object Segmentation	DAVIS 2016	STCN	Speed (FPS)	26.9	# 20
Semi-Supervised Video Object Segmentation	DAVIS 2017 (test-dev)	STCN	J&F	79.9	# 18
Semi-Supervised Video Object Segmentation	DAVIS 2017 (test-dev)	STCN	Jaccard (Mean)	76.3	# 18
Semi-Supervised Video Object Segmentation	DAVIS 2017 (test-dev)	STCN	Jaccard (Recall)	85.5	# 1
Semi-Supervised Video Object Segmentation	DAVIS 2017 (test-dev)	STCN	Jaccard (Decay)	10.5	# 1
Semi-Supervised Video Object Segmentation	DAVIS 2017 (test-dev)	STCN	F-measure (Mean)	83.5	# 19
Semi-Supervised Video Object Segmentation	DAVIS 2017 (test-dev)	STCN	F-measure (Recall)	89.7	# 1
Semi-Supervised Video Object Segmentation	DAVIS 2017 (test-dev)	STCN	F-measure (Decay)	10.3	# 1
Semi-Supervised Video Object Segmentation	DAVIS 2017 (val)	STCN	Jaccard (Mean)	82.0	# 23
Semi-Supervised Video Object Segmentation	DAVIS 2017 (val)	STCN	Jaccard (Recall)	91.3	# 2
Semi-Supervised Video Object Segmentation	DAVIS 2017 (val)	STCN	Jaccard (Decay)	6.2	# 1
Semi-Supervised Video Object Segmentation	DAVIS 2017 (val)	STCN	F-measure (Mean)	88.6	# 17
Semi-Supervised Video Object Segmentation	DAVIS 2017 (val)	STCN	F-measure (Recall)	94.6	# 1
Semi-Supervised Video Object Segmentation	DAVIS 2017 (val)	STCN	F-measure (Decay)	85.3	# 23
Semi-Supervised Video Object Segmentation	DAVIS 2017 (val)	STCN	J&F	85.3	# 20
Semi-Supervised Video Object Segmentation	DAVIS 2017 (val)	STCN	Speed (FPS)	20.2	# 20
Semi-Supervised Video Object Segmentation	MOSE	STCN	J&F	50.8	# 15
Semi-Supervised Video Object Segmentation	MOSE	STCN	J	46.6	# 15
Semi-Supervised Video Object Segmentation	MOSE	STCN	F	55.0	# 14
Semi-Supervised Video Object Segmentation	YouTube-VOS 2018	STCN	F-Measure (Seen)	87.9	# 23
Semi-Supervised Video Object Segmentation	YouTube-VOS 2018	STCN	F-Measure (Unseen)	87.3	# 13
Semi-Supervised Video Object Segmentation	YouTube-VOS 2018	STCN	Jaccard (Seen)	83.2	# 23
Semi-Supervised Video Object Segmentation	YouTube-VOS 2018	STCN	Jaccard (Unseen)	79.0	# 13
Semi-Supervised Video Object Segmentation	YouTube-VOS 2019	STCN	Overall	84.2	# 16
Semi-Supervised Video Object Segmentation	YouTube-VOS 2019	STCN	Jaccard (Seen)	82.6	# 17
Semi-Supervised Video Object Segmentation	YouTube-VOS 2019	STCN	Jaccard (Unseen)	79.4	# 12
Semi-Supervised Video Object Segmentation	YouTube-VOS 2019	STCN	F-Measure (Seen)	87.0	# 17
Semi-Supervised Video Object Segmentation	YouTube-VOS 2019	STCN	F-Measure (Unseen)	87.7	# 11
Semi-Supervised Video Object Segmentation	YouTube-VOS 2019	STCN (MS)	Overall	85.2	# 9
Semi-Supervised Video Object Segmentation	YouTube-VOS 2019	STCN (MS)	Jaccard (Seen)	83.5	# 13
Semi-Supervised Video Object Segmentation	YouTube-VOS 2019	STCN (MS)	Jaccard (Unseen)	80.8	# 7
Semi-Supervised Video Object Segmentation	YouTube-VOS 2019	STCN (MS)	F-Measure (Seen)	87.8	# 15
Semi-Supervised Video Object Segmentation	YouTube-VOS 2019	STCN (MS)	F-Measure (Unseen)	88.8	# 7
Video Object Segmentation	YouTube-VOS 2019	STCN	Mean Jaccard & F-Measure	82.7	# 7
Video Object Segmentation	YouTube-VOS 2019	STCN	Jaccard (Seen)	81.1	# 8
Video Object Segmentation	YouTube-VOS 2019	STCN	Jaccard (Unseen)	78.2	# 6
Video Object Segmentation	YouTube-VOS 2019	STCN	F-Measure (Seen)	85.4	# 8
Video Object Segmentation	YouTube-VOS 2019	STCN	F-Measure (Unseen)	85.9	# 6

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/rethinking-space-time-networks-with-improved/video-object-segmentation-on-youtube-vos-2019-2)](https://paperswithcode.com/sota/video-object-segmentation-on-youtube-vos-2019-2?p=rethinking-space-time-networks-with-improved)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/rethinking-space-time-networks-with-improved/semi-supervised-video-object-segmentation-on-18)](https://paperswithcode.com/sota/semi-supervised-video-object-segmentation-on-18?p=rethinking-space-time-networks-with-improved)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/rethinking-space-time-networks-with-improved/video-object-segmentation-on-youtube-vos)](https://paperswithcode.com/sota/video-object-segmentation-on-youtube-vos?p=rethinking-space-time-networks-with-improved)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/rethinking-space-time-networks-with-improved/semi-supervised-video-object-segmentation-on-21)](https://paperswithcode.com/sota/semi-supervised-video-object-segmentation-on-21?p=rethinking-space-time-networks-with-improved)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/rethinking-space-time-networks-with-improved/visual-object-tracking-on-davis-2016)](https://paperswithcode.com/sota/visual-object-tracking-on-davis-2016?p=rethinking-space-time-networks-with-improved)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/rethinking-space-time-networks-with-improved/semi-supervised-video-object-segmentation-on-1)](https://paperswithcode.com/sota/semi-supervised-video-object-segmentation-on-1?p=rethinking-space-time-networks-with-improved)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/rethinking-space-time-networks-with-improved/visual-object-tracking-on-davis-2017)](https://paperswithcode.com/sota/visual-object-tracking-on-davis-2017?p=rethinking-space-time-networks-with-improved)`

Rethinking Space-Time Networks with Improved Memory Coverage for Efficient Video Object Segmentation

NeurIPS 2021 · Ho Kei Cheng, Yu-Wing Tai, Chi-Keung Tang ·

This paper presents a simple yet effective approach to modeling space-time correspondences in the context of video object segmentation. Unlike most existing approaches, we establish correspondences directly between frames without re-encoding the mask features for every object, leading to a highly efficient and robust framework. With the correspondences, every node in the current query frame is inferred by aggregating features from the past in an associative fashion. We cast the aggregation process as a voting problem and find that the existing inner-product affinity leads to poor use of memory with a small (fixed) subset of memory nodes dominating the votes, regardless of the query. In light of this phenomenon, we propose using the negative squared Euclidean distance instead to compute the affinities. We validated that every memory node now has a chance to contribute, and experimentally showed that such diversified voting is beneficial to both memory efficiency and inference accuracy. The synergy of correspondence networks and diversified voting works exceedingly well, achieves new state-of-the-art results on both DAVIS and YouTubeVOS datasets while running significantly faster at 20+ FPS for multiple objects without bells and whistles.

PDF Abstract NeurIPS 2021 PDF NeurIPS 2021 Abstract

Code

Add Remove Mark official

hkchengrex/STCN official

523

limingxing00/rde-vos-cvpr2022

alipga/AMM_VOS

Tasks

Add Remove

Semantic Segmentation

Semi-Supervised Video Object Segmentation

Video Object Segmentation

Video Semantic Segmentation

Datasets

DAVIS

DAVIS 2017

DAVIS 2016

YouTube-VOS 2018

Referring Expressions for DAVIS 2016 & 2017

MOSE

BL30K

Results from the Paper

Edit

Ranked #7 on Video Object Segmentation on YouTube-VOS 2019

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Semi-Supervised Video Object Segmentation	DAVIS 2016	STCN	Jaccard (Mean)	90.4	# 17	Compare
			Jaccard (Recall)	98.1	# 1	Compare
			Jaccard (Decay)	4.1	# 31	Compare
			F-measure (Mean)	93.0	# 18	Compare
			F-measure (Recall)	97.1	# 1	Compare
			F-measure (Decay)	4.3	# 30	Compare
			J&F	91.7	# 17	Compare
			Speed (FPS)	26.9	# 20	Compare
Semi-Supervised Video Object Segmentation	DAVIS 2017 (test-dev)	STCN	J&F	79.9	# 18	Compare
			Jaccard (Mean)	76.3	# 18	Compare
			Jaccard (Recall)	85.5	# 1	Compare
			Jaccard (Decay)	10.5	# 1	Compare
			F-measure (Mean)	83.5	# 19	Compare
			F-measure (Recall)	89.7	# 1	Compare
			F-measure (Decay)	10.3	# 1	Compare
Semi-Supervised Video Object Segmentation	DAVIS 2017 (val)	STCN	Jaccard (Mean)	82.0	# 23	Compare
			Jaccard (Recall)	91.3	# 2	Compare
			Jaccard (Decay)	6.2	# 1	Compare
			F-measure (Mean)	88.6	# 17	Compare
			F-measure (Recall)	94.6	# 1	Compare
			F-measure (Decay)	85.3	# 23	Compare
			J&F	85.3	# 20	Compare
			Speed (FPS)	20.2	# 20	Compare
Semi-Supervised Video Object Segmentation	MOSE	STCN	J&F	50.8	# 15	Compare
			J	46.6	# 15	Compare
			F	55.0	# 14	Compare
Semi-Supervised Video Object Segmentation	YouTube-VOS 2018	STCN	F-Measure (Seen)	87.9	# 23	Compare
			F-Measure (Unseen)	87.3	# 13	Compare
			Jaccard (Seen)	83.2	# 23	Compare
			Jaccard (Unseen)	79.0	# 13	Compare
Semi-Supervised Video Object Segmentation	YouTube-VOS 2019	STCN	Overall	84.2	# 16	Compare
			Jaccard (Seen)	82.6	# 17	Compare
			Jaccard (Unseen)	79.4	# 12	Compare
			F-Measure (Seen)	87.0	# 17	Compare
			F-Measure (Unseen)	87.7	# 11	Compare
Semi-Supervised Video Object Segmentation	YouTube-VOS 2019	STCN (MS)	Overall	85.2	# 9	Compare
			Jaccard (Seen)	83.5	# 13	Compare
			Jaccard (Unseen)	80.8	# 7	Compare
			F-Measure (Seen)	87.8	# 15	Compare
			F-Measure (Unseen)	88.8	# 7	Compare
Video Object Segmentation	YouTube-VOS 2019	STCN	Mean Jaccard & F-Measure	82.7	# 7	Compare
			Jaccard (Seen)	81.1	# 8	Compare
			Jaccard (Unseen)	78.2	# 6	Compare
			F-Measure (Seen)	85.4	# 8	Compare
			F-Measure (Unseen)	85.9	# 6	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Rethinking Space-Time Networks with Improved Memory Coverage for Efficient Video Object Segmentation

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove