Most implemented papers

TASED-Net: Temporally-Aggregating Spatial Encoder-Decoder Network for Video Saliency Detection

kylemin/TASED-Net ICCV 2019

It consists of two building blocks: first, the encoder network extracts low-resolution spatiotemporal features from an input clip of several consecutive frames, and then the following prediction network decodes the encoded features spatially while aggregating all the temporal information.

007: Democratically Finding The Cause of Packet Drops

behnazak/Vigil-007SourceCode 20 Feb 2018

Network failures continue to plague datacenter operators as their symptoms may not have direct correlation with where or why they occur.

A Plug-and-play Scheme to Adapt Image Saliency Deep Model for Video Data

YunX17/AdaptSaliency 2 Aug 2020

With the rapid development of deep learning techniques, image saliency deep models trained solely by spatial information have occasionally achieved detection performance for video data comparable to that of the models trained by both spatial and temporal information.

Exploring Rich and Efficient Spatial Temporal Interactions for Real Time Video Salient Object Detection

guotaowang/STVS 7 Aug 2020

In this way, even though the overall video saliency quality is heavily dependent on its spatial branch, however, the performance of the temporal branch still matter.

Hierarchical Domain-Adapted Feature Learning for Video Saliency Prediction

perceivelab/hd2s 2 Oct 2020

When the base hierarchical model is empowered with domain-specific modules, performance improves, outperforming state-of-the-art models on three out of five metrics on the DHF1K benchmark and reaching the second-best results on the other two.

ViNet: Pushing the limits of Visual Modality for Audio-Visual Saliency Prediction

samyak0210/ViNet 11 Dec 2020

We also explore a variation of ViNet architecture by augmenting audio features into the decoder.

Weakly Supervised Video Salient Object Detection

wangbo-zhao/WSVSOD CVPR 2021

Significant performance improvement has been achieved for fully-supervised video salient object detection with the pixel-wise labeled training datasets, which are time-consuming and expensive to obtain.

From Semantic Categories to Fixations: A Novel Weakly-Supervised Visual-Auditory Saliency Detection Approach

guotaowang/STANet CVPR 2021

Thanks to the rapid advances in the deep learning techniques and the wide availability of large-scale training sets, the performances of video saliency detection models have been improving steadily and significantly.

Weakly Supervised Visual-Auditory Saliency Detection with Multigranularity Perception

guotaowang/STANet 27 Dec 2021

Moreover, we distill knowledge from these regions to obtain complete new spatial-temporal-audio (STA) fixation prediction (FP) networks, enabling broad applications in cases where video tags are not available.