Video Semantic Segmentation

322 papers with code • 5 benchmarks • 8 datasets

This task has no description! Would you like to contribute one?

Benchmarks

Add a Result

These leaderboards are used to track progress in Video Semantic Segmentation

Dataset	Best Model	Compare
Cityscapes val	TMANet-50	See all
CamVid	TMANet-50	See all
VSPW	DVIS++(VIT-L)	See all
LaRS	WaSR-T (ResNet-101)	See all
Multispectral Video Semantic Segmentation	MVNet(DeepLabV3)	See all

Libraries

Use these libraries to find Video Semantic Segmentation models and implementations

yoxu515/aot-benchmark

4 papers

561

PaddlePaddle/PaddleSeg

3 papers

8,248

visionml/pytracking

3 papers

3,085

hkchengrex/Mask-Propagation

3 papers

124

See all 9 libraries.

Datasets

Subtasks

Camera shot segmentation

Latest papers

Most implemented Social Latest No code

UniVS: Unified and Universal Video Segmentation with Prompts as Queries

minghanli/univs • • 28 Feb 2024

Despite the recent advances in unified image segmentation (IS), developing a unified video segmentation (VS) model remains a challenge.

119

28 Feb 2024

Paper
Code

PolypNextLSTM: A lightweight and fast polyp video segmentation network using ConvNext and ConvLSTM

mtec-tuhh/polypnextlstm • • 18 Feb 2024

Our primary novelty lies in PolypNextLSTM, which stands out as the leanest in parameters and the fastest model, surpassing the performance of five state-of-the-art image and video-based deep learning models.

18 Feb 2024

Paper
Code

Lester: rotoscope animation through video object segmentation and tracking

rtous/lester • 15 Feb 2024

This article introduces Lester, a novel method to automatically synthetise retro-style 2D animations from videos.

15 Feb 2024

Paper
Code

We're Not Using Videos Effectively: An Updated Domain Adaptive Video Segmentation Baseline

simarkareer/unifiedvideoda • • 1 Feb 2024

While the vast majority of prior work has studied this as a frame-level Image-DAS problem, a few Video-DAS works have sought to additionally leverage the temporal signal present in adjacent frames.

01 Feb 2024

Paper
Code

Vivim: a Video Vision Mamba for Medical Video Object Segmentation

scott-yjyang/vivim • • 25 Jan 2024

Traditional convolutional neural networks have a limited receptive field while transformer-based networks are mediocre in constructing long-term dependency from the perspective of computational complexity.

101

25 Jan 2024

Paper
Code