Video Semantic Segmentation

320 papers with code • 5 benchmarks • 8 datasets

This task has no description! Would you like to contribute one?

Libraries

Use these libraries to find Video Semantic Segmentation models and implementations

Latest papers with no code

Moving Object Proposals with Deep Learned Optical Flow for Video Object Segmentation

no code yet • 14 Feb 2024

Then we render the output of optical flow net to a fully convolutional SegNet model.

Point-VOS: Pointing Up Video Object Segmentation

no code yet • 8 Feb 2024

We propose a novel Point-VOS task with a spatio-temporally sparse point-wise annotation scheme that substantially reduces the annotation effort.

Is Two-shot All You Need? A Label-efficient Approach for Video Segmentation in Breast Ultrasound

no code yet • 7 Feb 2024

Breast lesion segmentation from breast ultrasound (BUS) videos could assist in early diagnosis and treatment.

Vanishing-Point-Guided Video Semantic Segmentation of Driving Scenes

no code yet • 27 Jan 2024

The estimation of implicit cross-frame correspondences and the high computational cost have long been major challenges in video semantic segmentation (VSS) for driving scenes.

Self-supervised Video Object Segmentation with Distillation Learning of Deformable Attention

no code yet • 25 Jan 2024

This is enabled by deformable attention mechanism, where the keys and values capturing the memory of a video sequence in the attention module have flexible locations updated across frames.

Explore Synergistic Interaction Across Frames for Interactive Video Object Segmentation

no code yet • 23 Jan 2024

Interactive Video Object Segmentation (iVOS) is a challenging task that requires real-time human-computer interaction.

Understanding Video Transformers via Universal Concept Discovery

no code yet • 19 Jan 2024

Concretely, we seek to explain the decision-making process of video transformers based on high-level, spatiotemporal concepts that are automatically discovered.

No More Shortcuts: Realizing the Potential of Temporal Self-Supervision

no code yet • 20 Dec 2023

To address these issues, we propose 1) a more challenging reformulation of temporal self-supervision as frame-level (rather than clip-level) recognition tasks and 2) an effective augmentation strategy to mitigate shortcuts.

Appearance-based Refinement for Object-Centric Motion Segmentation

no code yet • 18 Dec 2023

The goal of this paper is to discover, segment, and track independently moving objects in complex visual scenes.

Artificial intelligence optical hardware empowers high-resolution hyperspectral video understanding at 1.2 Tb/s

no code yet • 17 Dec 2023

The technology platform combines artificial intelligence hardware, processing information optically, with state-of-the-art machine vision networks, resulting in a data processing speed of 1. 2 Tb/s with hundreds of frequency bands and megapixel spatial resolution at video rates.