Semi-Supervised Video Object Segmentation

94 papers with code • 15 benchmarks • 13 datasets

The semi-supervised scenario assumes the user inputs a full mask of the object(s) of interest in the first frame of a video sequence. Methods have to produce the segmentation mask for that object(s) in the subsequent frames.

Benchmarks

Add a Result

These leaderboards are used to track progress in Semi-Supervised Video Object Segmentation

Dataset	Best Model	Compare
DAVIS 2017 (val)	Cutie+ (base)	See all
DAVIS 2016	ISVOS (BL30K, MS)	See all
DAVIS 2017 (test-dev)	Cutie+ (base, MEGA)	See all
YouTube-VOS 2018	Cutie+ (base, MEGA)	See all
DAVIS (no YouTube-VOS training)	HMMN	See all
YouTube-VOS 2019	Cutie+ (base, MEGA)	See all
VOT2020	SwinB-DeAOT-L	See all
MOSE	Cutie+ (base, MEGA)	See all
Long Video Dataset	ISVOS	See all
YouTube	FEELVOS	See all
DAVIS-2017	STCN + TrickVOS (PT)	See all
Long Video Dataset (3X)	XMem	See all
BURST-val	Cutie (base, MEGA, 600 pixels)	See all
BURST-test	Cutie (base, MEGA, 600 pixels)	See all
DAVIS-2016	STCN + TrickVOS (PT)	See all

Show all 15 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find Semi-Supervised Video Object Segmentation models and implementations

yoxu515/aot-benchmark

4 papers

563

hkchengrex/Mask-Propagation

3 papers

124

z-x-yang/AOT

3 papers

118

Datasets

Subtasks

One-shot visual object segmentation

Latest papers

Most implemented Social Latest No code

CLVOS23: A Long Video Object Segmentation Dataset for Continual Learning

amir4g/clvos23 • 9 Apr 2023

Continual learning in real-world scenarios is a major challenge.

09 Apr 2023

Paper
Code

Learning to Learn Better for Video Object Segmentation

vitae-transformer/vos-llb • • 5 Dec 2022

Recently, the joint learning framework (JOINT) integrates matching based transductive reasoning and online inductive learning to achieve accurate and robust semi-supervised video object segmentation (SVOS).

05 Dec 2022

Paper
Code

Decoupling Features in Hierarchical Propagation for Video Object Segmentation

yoxu515/aot-benchmark • • 18 Oct 2022

To solve such a problem and further facilitate the learning of visual embeddings, this paper proposes a Decoupling Features in Hierarchical Propagation (DeAOT) approach.

563

18 Oct 2022

Paper
Code

Global Spectral Filter Memory Network for Video Object Segmentation

workforai/gsfm • • 11 Oct 2022

Besides, we empirically find low frequency feature should be enhanced in encoder (backbone) while high frequency for decoder (segmentation head).

11 Oct 2022

Paper
Code

SWEM: Towards Real-Time Video Object Segmentation with Sequential Weighted Expectation-Maximization

lmm077/SWEM • • CVPR 2022

Matching-based methods, especially those based on space-time memory, are significantly ahead of other solutions in semi-supervised video object segmentation (VOS).

22 Aug 2022

Paper
Code

Per-Clip Video Object Segmentation

pkyong95/PCVOS • • CVPR 2022

In this per-clip inference scheme, we update the memory with an interval and simultaneously process a set of consecutive frames (i. e. clip) between the memory updates.

03 Aug 2022

Paper
Code

Learning Quality-aware Dynamic Memory for Video Object Segmentation

workforai/qdmn • • 16 Jul 2022

However, they mainly focus on better matching between the current frame and the memory frames without explicitly paying attention to the quality of the memory.

140

16 Jul 2022

Paper
Code

XMem: Long-Term Video Object Segmentation with an Atkinson-Shiffrin Memory Model

hkchengrex/XMem • • 14 Jul 2022

We present XMem, a video object segmentation architecture for long videos with unified feature memory stores inspired by the Atkinson-Shiffrin memory model.

1,595

14 Jul 2022

Paper
Code

Tackling Background Distraction in Video Object Segmentation

suhwan-cho/tbd • • 14 Jul 2022

Semi-supervised video object segmentation (VOS) aims to densely track certain designated objects in videos.

14 Jul 2022

Paper
Code

Towards Robust Video Object Segmentation with Adaptive Object Calibration

jerryx1110/robust-video-object-segmentation • • 2 Jul 2022

We consolidate this conditional mask calibration process in a progressive manner, where the object representations and proto-masks evolve to be discriminative iteratively.

02 Jul 2022

Paper
Code

Semi-Supervised Video Object Segmentation

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers

Content

Benchmarks

Add a Result