Semi-Supervised Video Object Segmentation

94 papers with code • 15 benchmarks • 13 datasets

The semi-supervised scenario assumes the user inputs a full mask of the object(s) of interest in the first frame of a video sequence. Methods have to produce the segmentation mask for that object(s) in the subsequent frames.

Benchmarks

Add a Result

These leaderboards are used to track progress in Semi-Supervised Video Object Segmentation

Dataset	Best Model	Compare
DAVIS 2017 (val)	Cutie+ (base)	See all
DAVIS 2016	ISVOS (BL30K, MS)	See all
DAVIS 2017 (test-dev)	Cutie+ (base, MEGA)	See all
YouTube-VOS 2018	Cutie+ (base, MEGA)	See all
DAVIS (no YouTube-VOS training)	HMMN	See all
YouTube-VOS 2019	Cutie+ (base, MEGA)	See all
VOT2020	SwinB-DeAOT-L	See all
MOSE	Cutie+ (base, MEGA)	See all
Long Video Dataset	ISVOS	See all
YouTube	FEELVOS	See all
DAVIS-2017	STCN + TrickVOS (PT)	See all
Long Video Dataset (3X)	XMem	See all
BURST-val	Cutie (base, MEGA, 600 pixels)	See all
BURST-test	Cutie (base, MEGA, 600 pixels)	See all
DAVIS-2016	STCN + TrickVOS (PT)	See all

Show all 15 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find Semi-Supervised Video Object Segmentation models and implementations

yoxu515/aot-benchmark

4 papers

560

hkchengrex/Mask-Propagation

3 papers

124

z-x-yang/AOT

3 papers

116

Datasets

Subtasks

One-shot visual object segmentation

Latest papers

Most implemented Social Latest No code

Efficient Video Object Segmentation via Modulated Cross-Attention Memory

amshaker/mavos • 26 Mar 2024

Recently, transformer-based approaches have shown promising results for semi-supervised video object segmentation.

26 Mar 2024

Paper
Code

Video Object Segmentation with Dynamic Query Modulation

zht8506/qmvos • • 18 Mar 2024

Storing intermediate frame segmentations as memory for long-range context modeling, spatial-temporal memory-based methods have recently showcased impressive results in semi-supervised video object segmentation (SVOS).

18 Mar 2024

Paper
Code

Lester: rotoscope animation through video object segmentation and tracking

rtous/lester • 15 Feb 2024

This article introduces Lester, a novel method to automatically synthetise retro-style 2D animations from videos.

15 Feb 2024

Paper
Code

ODTrack: Online Dense Temporal Token Learning for Visual Tracking

gxnu-zhonglab/odtrack • • 3 Jan 2024

To alleviate the above problem, we propose a simple, flexible and effective video-level tracking pipeline, named \textbf{ODTrack}, which densely associates the contextual relationships of video frames in an online token propagation manner.

03 Jan 2024

Paper
Code

Putting the Object Back into Video Object Segmentation

hkchengrex/Cutie • • 19 Oct 2023

We present Cutie, a video object segmentation (VOS) network with object-level memory reading, which puts the object representation from memory back into the video object segmentation result.

455

19 Oct 2023

Paper
Code

Tracking Anything with Decoupled Video Segmentation

hkchengrex/Tracking-Anything-with-DEVA • • ICCV 2023

To 'track anything' without training on video data for every individual task, we develop a decoupled video segmentation approach (DEVA), composed of task-specific image-level segmentation and class/task-agnostic bi-directional temporal propagation.

1,050

07 Sep 2023

Paper
Code

XMem++: Production-level Video Segmentation From Few Annotated Frames

max810/XMem2 • • ICCV 2023

Despite advancements in user-guided video segmentation, extracting complex objects consistently for highly complex scenes is still a labor-intensive task, especially for production.

130

29 Jul 2023

Paper
Code

Tracking Anything in High Quality

jiawen-zhu/hqtrack • • 26 Jul 2023

To further improve the quality of tracking masks, a pretrained MR model is employed to refine the tracking results.

732

26 Jul 2023

Paper
Code

READMem: Robust Embedding Association for a Diverse Memory in Unconstrained Video Object Segmentation

Vujas-Eteph/READMem • • 22 May 2023

We present READMem (Robust Embedding Association for a Diverse Memory), a modular framework for semi-automatic video object segmentation (sVOS) methods designed to handle unconstrained videos.

22 May 2023

Paper
Code

Video Object Segmentation in Panoptic Wild Scenes

yoxu515/aot-benchmark • • 8 May 2023

Considering the challenges in panoptic VOS, we propose a strong baseline method named panoptic object association with transformers (PAOT), which uses panoptic identification to associate objects with a pyramid architecture on multiple scales.

560

08 May 2023

Paper
Code

Semi-Supervised Video Object Segmentation

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers

Content

Benchmarks

Add a Result