Video Semantic Segmentation

325 papers with code • 5 benchmarks • 8 datasets

This task has no description! Would you like to contribute one?

Libraries

Use these libraries to find Video Semantic Segmentation models and implementations

Latest papers with no code

Point-VOS: Pointing Up Video Object Segmentation

no code yet • 8 Feb 2024

We propose a novel Point-VOS task with a spatio-temporally sparse point-wise annotation scheme that substantially reduces the annotation effort.

Is Two-shot All You Need? A Label-efficient Approach for Video Segmentation in Breast Ultrasound

no code yet • 7 Feb 2024

Breast lesion segmentation from breast ultrasound (BUS) videos could assist in early diagnosis and treatment.

Self-supervised Video Object Segmentation with Distillation Learning of Deformable Attention

no code yet • 25 Jan 2024

This is enabled by deformable attention mechanism, where the keys and values capturing the memory of a video sequence in the attention module have flexible locations updated across frames.

Explore Synergistic Interaction Across Frames for Interactive Video Object Segmentation

no code yet • 23 Jan 2024

Interactive Video Object Segmentation (iVOS) is a challenging task that requires real-time human-computer interaction.

Understanding Video Transformers via Universal Concept Discovery

no code yet • 19 Jan 2024

Concretely, we seek to explain the decision-making process of video transformers based on high-level, spatiotemporal concepts that are automatically discovered.

No More Shortcuts: Realizing the Potential of Temporal Self-Supervision

no code yet • 20 Dec 2023

To address these issues, we propose 1) a more challenging reformulation of temporal self-supervision as frame-level (rather than clip-level) recognition tasks and 2) an effective augmentation strategy to mitigate shortcuts.

Appearance-based Refinement for Object-Centric Motion Segmentation

no code yet • 18 Dec 2023

The goal of this paper is to discover, segment, and track independently moving objects in complex visual scenes.

Artificial intelligence optical hardware empowers high-resolution hyperspectral video understanding at 1.2 Tb/s

no code yet • 17 Dec 2023

The technology platform combines artificial intelligence hardware, processing information optically, with state-of-the-art machine vision networks, resulting in a data processing speed of 1. 2 Tb/s with hundreds of frequency bands and megapixel spatial resolution at video rates.

TAM-VT: Transformation-Aware Multi-scale Video Transformer for Segmentation and Tracking

no code yet • 13 Dec 2023

In this work we propose a novel, clip-based DETR-style encoder-decoder architecture, which focuses on systematically analyzing and addressing aforementioned challenges.

GenDeF: Learning Generative Deformation Field for Video Generation

no code yet • 7 Dec 2023

We offer a new perspective on approaching the task of video generation.