Video Saliency Detection

19 papers with code • 5 benchmarks • 2 datasets

This task has no description! Would you like to contribute one?

Benchmarks

Add a Result

These leaderboards are used to track progress in Video Saliency Detection

Dataset	Best Model	Compare
MSU Video Saliency Prediction	ViNet (dave)	See all
DHF1K	ViNet	See all
DIEM	AViNet	See all
Hollywood2	ViNet	See all
UCFSports	ViNet	See all

Datasets

Most implemented papers

Most implemented Social Latest No code

Contextual Encoder-Decoder Network for Visual Saliency Prediction

alexanderkroner/saliency • • 18 Feb 2019

To develop robust representations for this challenging task, high-level visual features at multiple spatial scales must be extracted and augmented with contextual information.

Paper
Code

Simple vs complex temporal recurrences for video saliency prediction

imatge-upc/SalEMA • • 3 Jul 2019

This paper investigates modifying an existing neural network architecture for static saliency prediction using two types of recurrences that integrate information from the temporal domain.

Paper
Code

Unified Image and Video Saliency Modeling

rdroste/unisal • • ECCV 2020

We evaluate our method on the video saliency datasets DHF1K, Hollywood-2 and UCF-Sports, and the image saliency datasets SALICON and MIT300.

Paper
Code

Revisiting Video Saliency: A Large-scale Benchmark and a New Model

wenguanwang/DHF1K • • CVPR 2018

Existing video saliency datasets lack variety and generality of common dynamic scenes and fall short in covering challenging situations in unconstrained environments.

Paper
Code

DeepVS: A Deep Learning Based Video Saliency Prediction Approach

remega/OMCNN_2CLSTM • • ECCV 2018

Hence, an object-to-motion convolutional neural network (OM-CNN) is developed to predict the intra-frame saliency for DeepVS, which is composed of the objectness and motion subnets.

Paper
Code

A Dilated Inception Network for Visual Saliency Prediction

ysyscool/DINet • • 7 Apr 2019

In this work, we proposed an end-to-end dilated inception network (DINet) for visual saliency prediction.

Paper
Code

TASED-Net: Temporally-Aggregating Spatial Encoder-Decoder Network for Video Saliency Detection

kylemin/TASED-Net • • ICCV 2019

It consists of two building blocks: first, the encoder network extracts low-resolution spatiotemporal features from an input clip of several consecutive frames, and then the following prediction network decodes the encoded features spatially while aggregating all the temporal information.

Paper
Code

Video Saliency Prediction Using Enhanced Spatiotemporal Alignment Network

cj4L/ESAN-VSP • • 2 Jan 2020

Due to a variety of motions across different frames, it is highly challenging to learn an effective spatiotemporal representation for accurate video saliency prediction (VSP).

Paper
Code

A Plug-and-play Scheme to Adapt Image Saliency Deep Model for Video Data

YunX17/AdaptSaliency • 2 Aug 2020

With the rapid development of deep learning techniques, image saliency deep models trained solely by spatial information have occasionally achieved detection performance for video data comparable to that of the models trained by both spatial and temporal information.

Paper
Code

Exploring Rich and Efficient Spatial Temporal Interactions for Real Time Video Salient Object Detection

guotaowang/STVS • • 7 Aug 2020

In this way, even though the overall video saliency quality is heavily dependent on its spatial branch, however, the performance of the temporal branch still matter.

Paper
Code

Video Saliency Detection

Benchmarks Add a Result

Datasets

Most implemented papers

Content

Benchmarks

Add a Result