Video Semantic Segmentation

325 papers with code • 5 benchmarks • 8 datasets

This task has no description! Would you like to contribute one?

Benchmarks

Add a Result

These leaderboards are used to track progress in Video Semantic Segmentation

Dataset	Best Model	Compare
Cityscapes val	TMANet-50	See all
CamVid	TMANet-50	See all
VSPW	DVIS++(VIT-L)	See all
LaRS	WaSR-T (ResNet-101)	See all
Multispectral Video Semantic Segmentation	MVNet(DeepLabV3)	See all

Libraries

Use these libraries to find Video Semantic Segmentation models and implementations

yoxu515/aot-benchmark

4 papers

563

PaddlePaddle/PaddleSeg

3 papers

8,263

visionml/pytracking

3 papers

3,089

hkchengrex/Mask-Propagation

3 papers

124

See all 9 libraries.

Datasets

Subtasks

Camera shot segmentation

Latest papers with no code

Most implemented Social Latest No code

Point-VOS: Pointing Up Video Object Segmentation

no code yet • 8 Feb 2024

We propose a novel Point-VOS task with a spatio-temporally sparse point-wise annotation scheme that substantially reduces the annotation effort.

Paper
Add Code

Is Two-shot All You Need? A Label-efficient Approach for Video Segmentation in Breast Ultrasound

no code yet • 7 Feb 2024

Breast lesion segmentation from breast ultrasound (BUS) videos could assist in early diagnosis and treatment.

Paper
Add Code

Self-supervised Video Object Segmentation with Distillation Learning of Deformable Attention

no code yet • 25 Jan 2024

This is enabled by deformable attention mechanism, where the keys and values capturing the memory of a video sequence in the attention module have flexible locations updated across frames.

Paper
Add Code

Explore Synergistic Interaction Across Frames for Interactive Video Object Segmentation

no code yet • 23 Jan 2024

Interactive Video Object Segmentation (iVOS) is a challenging task that requires real-time human-computer interaction.

Paper
Add Code

Understanding Video Transformers via Universal Concept Discovery

no code yet • 19 Jan 2024

Concretely, we seek to explain the decision-making process of video transformers based on high-level, spatiotemporal concepts that are automatically discovered.

Paper
Add Code

No More Shortcuts: Realizing the Potential of Temporal Self-Supervision

no code yet • 20 Dec 2023

To address these issues, we propose 1) a more challenging reformulation of temporal self-supervision as frame-level (rather than clip-level) recognition tasks and 2) an effective augmentation strategy to mitigate shortcuts.

Paper
Add Code

Appearance-based Refinement for Object-Centric Motion Segmentation

no code yet • 18 Dec 2023

The goal of this paper is to discover, segment, and track independently moving objects in complex visual scenes.

Paper
Add Code

Artificial intelligence optical hardware empowers high-resolution hyperspectral video understanding at 1.2 Tb/s

no code yet • 17 Dec 2023

The technology platform combines artificial intelligence hardware, processing information optically, with state-of-the-art machine vision networks, resulting in a data processing speed of 1. 2 Tb/s with hundreds of frequency bands and megapixel spatial resolution at video rates.

Paper
Add Code

TAM-VT: Transformation-Aware Multi-scale Video Transformer for Segmentation and Tracking

no code yet • 13 Dec 2023

In this work we propose a novel, clip-based DETR-style encoder-decoder architecture, which focuses on systematically analyzing and addressing aforementioned challenges.

Paper
Add Code

GenDeF: Learning Generative Deformation Field for Video Generation

no code yet • 7 Dec 2023

We offer a new perspective on approaching the task of video generation.

Paper
Add Code

Video Semantic Segmentation

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers with no code

Content

Benchmarks

Add a Result