Video Object Segmentation

240 papers with code • 9 benchmarks • 17 datasets

Video object segmentation is a binary labeling problem aiming to separate foreground object(s) from the background region of a video.

For leaderboards please refer to the different subtasks.

Benchmarks

Add a Result

These leaderboards are used to track progress in Video Object Segmentation

Dataset	Best Model	Compare
DAVIS 2016	ISVOS (BL30K, MS)	See all
DAVIS 2017 (val)	XMem (BLK30K, MS)	See all
YouTube-VOS 2018	XMem (BL30K, MS)	See all
DAVIS 2017 (test-dev)	BATMAN	See all
YouTube-VOS 2019	XMem (BL30K,MS)	See all
DAVIS 2017	AOC-MF (val)	See all
FBMS	Ours	See all
DAVIS-2017 (test-dev)	XMem (BL30K, MS)	See all
YouTube	Ours	See all

Libraries

Use these libraries to find Video Object Segmentation models and implementations

yoxu515/aot-benchmark

4 papers

560

visionml/pytracking

3 papers

3,082

hkchengrex/Mask-Propagation

3 papers

124

z-x-yang/AOT

3 papers

116

Datasets

Subtasks

Video Salient Object Detection

Interactive Video Object Segmentation

Long-tail Video Object Segmentation

Most implemented papers

Most implemented Social Latest No code

Emerging Properties in Self-Supervised Vision Transformers

facebookresearch/dino • • ICCV 2021

In this paper, we question if self-supervised learning provides new properties to Vision Transformer (ViT) that stand out compared to convolutional networks (convnets).

Paper
Code

One-Shot Video Object Segmentation

kmaninis/OSVOS-PyTorch • • CVPR 2017

This paper tackles the task of semi-supervised video object segmentation, i. e., the separation of an object from the background in a video, given the mask of the first frame.

Paper
Code

PReMVOS: Proposal-generation, Refinement and Merging for Video Object Segmentation

JonathonLuiten/PReMVOS • • 24 Jul 2018

We address semi-supervised video object segmentation, the task of automatically generating accurate and consistent pixel masks for objects in a video sequence, given the first-frame ground truth annotations.

Paper
Code

Modular Interactive Video Object Segmentation: Interaction-to-Mask, Propagation and Difference-Aware Fusion

hkchengrex/MiVOS • • CVPR 2021

We present Modular interactive VOS (MiVOS) framework which decouples interaction-to-mask and mask propagation, allowing for higher generalizability and better performance.

Paper
Code

Rethinking Self-supervised Correspondence Learning: A Video Frame-level Similarity Perspective

xvjiarui/VFS • • ICCV 2021

To learn generalizable representation for correspondence in large-scale, a variety of self-supervised pretext tasks are proposed to explicitly perform object-level or patch-level similarity learning.

Paper
Code

Lucid Data Dreaming for Video Object Segmentation

omkar13/MaskTrack • • 28 Mar 2017

Our approach is suitable for both single and multiple object segmentation.

Paper
Code

YouTube-VOS: Sequence-to-Sequence Video Object Segmentation

BehradToghi/ECCV_Youtube_VOS • • ECCV 2018

End-to-end sequential learning to explore spatial-temporal features for video segmentation is largely limited by the scale of available video segmentation datasets, i. e., even the largest video segmentation dataset only contains 90 short video clips.

Paper
Code

Interactive Video Object Segmentation Using Global and Local Transfer Modules

yuk6heo/IVOS-ATNet • • ECCV 2020

The global transfer module conveys the segmentation information in an annotated frame to a target frame, while the local transfer module propagates the segmentation information in a temporally adjacent frame to the target frame.

Paper
Code

Make One-Shot Video Object Segmentation Efficient Again

dvl-tum/e-osvos • • NeurIPS 2020

In the semi-supervised setting, the first mask of each object is provided at test time.

Paper
Code

Video Polyp Segmentation: A Deep Learning Perspective

DengPingFan/PraNet • • 27 Mar 2022

We present the first comprehensive video polyp segmentation (VPS) study in the deep learning era.

Paper
Code

Video Object Segmentation

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Most implemented papers

Content

Benchmarks

Add a Result