Video Object Segmentation

240 papers with code • 9 benchmarks • 17 datasets

Video object segmentation is a binary labeling problem aiming to separate foreground object(s) from the background region of a video.

For leaderboards please refer to the different subtasks.

Libraries

Use these libraries to find Video Object Segmentation models and implementations

Towards Temporally Consistent Referring Video Object Segmentation

bo-miao/HTR 28 Mar 2024

Referring Video Object Segmentation (R-VOS) methods face challenges in maintaining consistent object segmentation due to temporal context variability and the presence of other visually similar objects.

2
28 Mar 2024

Efficient Video Object Segmentation via Modulated Cross-Attention Memory

amshaker/mavos 26 Mar 2024

Recently, transformer-based approaches have shown promising results for semi-supervised video object segmentation.

40
26 Mar 2024

PSALM: Pixelwise SegmentAtion with Large Multi-Modal Model

zamling/psalm 21 Mar 2024

PSALM is a powerful extension of the Large Multi-modal Model (LMM) to address the segmentation task challenges.

106
21 Mar 2024

Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation

buxiangzhiren/vd-it 18 Mar 2024

We hypothesize that the latent representation learned from a pretrained generative T2V model encapsulates rich semantics and coherent temporal correspondences, thereby naturally facilitating video understanding.

13
18 Mar 2024

Video Object Segmentation with Dynamic Query Modulation

zht8506/qmvos 18 Mar 2024

Storing intermediate frame segmentations as memory for long-range context modeling, spatial-temporal memory-based methods have recently showcased impressive results in semi-supervised video object segmentation (SVOS).

4
18 Mar 2024

VideoMAC: Video Masked Autoencoders Meet ConvNets

nust-machine-intelligence-laboratory/videomac 29 Feb 2024

In this paper, we propose a new approach termed as \textbf{VideoMAC}, which combines video masked autoencoders with resource-friendly ConvNets.

8
29 Feb 2024

UniVS: Unified and Universal Video Segmentation with Prompts as Queries

minghanli/univs 28 Feb 2024

Despite the recent advances in unified image segmentation (IS), developing a unified video segmentation (VS) model remains a challenge.

111
28 Feb 2024

Lester: rotoscope animation through video object segmentation and tracking

rtous/lester 15 Feb 2024

This article introduces Lester, a novel method to automatically synthetise retro-style 2D animations from videos.

3
15 Feb 2024

Vivim: a Video Vision Mamba for Medical Video Object Segmentation

scott-yjyang/vivim 25 Jan 2024

Traditional convolutional neural networks have a limited receptive field while transformer-based networks are mediocre in constructing long-term dependency from the perspective of computational complexity.

92
25 Jan 2024

OMG-Seg: Is One Model Good Enough For All Segmentation?

lxtgh/omg-seg 18 Jan 2024

In this work, we address various segmentation tasks, each traditionally tackled by distinct or partially unified models.

679
18 Jan 2024