The unsupervised scenario assumes that the user does not interact with the algorithm to obtain the segmentation masks. Methods should provide a set of object candidates with no overlapping pixels that span through the whole video sequence. This set of objects should contain at least the objects that capture human attention when watching the whole video sequence i.e objects that are more likely to be followed by human gaze.
We introduce a novel network, called CO-attention Siamese Network (COSNet), to address the unsupervised video object segmentation task from a holistic view.
Ranked #1 on
Unsupervised Video Object Segmentation
on YouTube
SEMANTIC SEGMENTATION UNSUPERVISED VIDEO OBJECT SEGMENTATION VIDEO SEMANTIC SEGMENTATION
Multiple object video object segmentation is a challenging task, specially for the zero-shot case, when no object mask is given at the initial frame and the model has to find the objects to be segmented along the sequence.
Ranked #1 on
Youtube-VOS
on YouTube-VOS
This paper conducts a systematic study on the role of visual attention in Unsupervised Video Object Segmentation (UVOS) tasks.
EYE TRACKING SEMANTIC SEGMENTATION UNSUPERVISED VIDEO OBJECT SEGMENTATION VIDEO SEGMENTATION VIDEO SEMANTIC SEGMENTATION
This paper proposes an end-to-end trainable network, SegFlow, for simultaneously predicting pixel-wise object segmentation and optical flow in videos.
Ranked #16 on
Unsupervised Video Object Segmentation
on DAVIS 2016
OPTICAL FLOW ESTIMATION SEMANTIC SEGMENTATION SEMI-SUPERVISED VIDEO OBJECT SEGMENTATION UNSUPERVISED VIDEO OBJECT SEGMENTATION VIDEO SEMANTIC SEGMENTATION VISUAL OBJECT TRACKING
To further demonstrate the generalization ability of our spatiotemporal learning framework, we extend MATNet to another relevant task: dynamic visual attention prediction (DVAP).
SEMANTIC SEGMENTATION UNSUPERVISED VIDEO OBJECT SEGMENTATION VIDEO SEMANTIC SEGMENTATION
In this paper, we present a novel Motion-Attentive Transition Network (MATNet) for zero-shot video object segmentation, which provides a new way of leveraging motion information to reinforce spatio-temporal object representation.
Ranked #2 on
Unsupervised Video Object Segmentation
on DAVIS 2016
(using extra training data)
SEMANTIC SEGMENTATION UNSUPERVISED VIDEO OBJECT SEGMENTATION VIDEO SEMANTIC SEGMENTATION
Unsupervised video object segmentation has often been tackled by methods based on recurrent neural networks and optical flow.
Ranked #3 on
Unsupervised Video Object Segmentation
on DAVIS 2016
(using extra training data)
OPTICAL FLOW ESTIMATION SEMANTIC SEGMENTATION UNSUPERVISED VIDEO OBJECT SEGMENTATION VIDEO SEMANTIC SEGMENTATION
This paper proposes a fast video salient object detection model, based on a novel recurrent network architecture, named Pyramid Dilated Bidirectional ConvLSTM (PDB-ConvLSTM).
Ranked #1 on
Video Salient Object Detection
on UVSD
(using extra training data)
SALIENT OBJECT DETECTION SEMANTIC SEGMENTATION UNSUPERVISED VIDEO OBJECT SEGMENTATION VIDEO SALIENT OBJECT DETECTION VIDEO SEMANTIC SEGMENTATION
We propose a new method for video object segmentation (VOS) that addresses object pattern learning from unlabeled videos, unlike most existing methods which rely heavily on extensive annotated data.
REPRESENTATION LEARNING SEMANTIC SEGMENTATION SEMI-SUPERVISED VIDEO OBJECT SEGMENTATION UNSUPERVISED VIDEO OBJECT SEGMENTATION VIDEO SEMANTIC SEGMENTATION
In this paper, we pro-pose a different approach that is well-suited to a variety of tasks involvinginstance segmentation in videos.
INSTANCE SEGMENTATION SEMANTIC SEGMENTATION UNSUPERVISED VIDEO OBJECT SEGMENTATION VIDEO INSTANCE SEGMENTATION