8 dataset results for segmentation AND Unsupervised Video Object Segmentation

MOSE (Complex Video Object Segmentation)

CoMplex video Object SEgmentation (MOSE) is a dataset to study the tracking and segmenting objects in complex environments. MOSE contains 2,149 video clips and 5,200 objects from 36 categories, with 431,725 high-quality object segmentation masks.

21 PAPERS • 1 BENCHMARK

FBMS (Freiburg-Berkeley Motion Segmentation)

The Freiburg-Berkeley Motion Segmentation Dataset (FBMS-59) is an extension of the BMS dataset with 33 additional video sequences. A total of 720 frames is annotated. It has pixel-accurate segmentation annotations of moving objects. FBMS-59 comes with a split into a training set and a test set.

119 PAPERS • 3 BENCHMARKS

FBMS-59 (Freiburg-Berkeley Motion Segmentation)

The Freiburg-Berkeley Motion Segmentation Dataset (FBMS-59) is a dataset for motion segmentation, which extends the BMS-26 dataset with 33 additional video sequences.

17 PAPERS • 2 BENCHMARKS

SegTrack-v2

SegTrack v2 is a video segmentation dataset with full pixel-level annotations on multiple objects at each frame within each video.

102 PAPERS • 4 BENCHMARKS

DAVIS 2017

DAVIS17 is a dataset for video object segmentation. It contains a total of 150 videos - 60 for training, 30 for validation, 60 for testing

273 PAPERS • 11 BENCHMARKS

DAVIS 2016

DAVIS16 is a dataset for video object segmentation which consists of 50 videos in total (30 videos for training and 20 for testing). Per-frame pixel-wise annotations are offered.

217 PAPERS • 4 BENCHMARKS

Referring Expressions for DAVIS 2016 & 2017

…To validate our approach we employ two popular video object segmentation datasets, DAVIS16 [38] and DAVIS17 [42]. For the multiple object video segmentation task we consider DAVIS17. As our goal is to segment objects in videos using language specifications, we augment all objects annotated with mask labels in DAVIS16 and DAVIS17 with non-ambiguous referring expressions. (We actually quantified that only∼ 15% of the collected descriptions become invalid over time and it does not affect strongly segmentation results as temporal consistency step helps to disambiguate some We believe the collected data will be of interest to segmentation as well as vision and language communities, providing an opportunity to explore language as alternative input for video object segmentation

75 PAPERS • 5 BENCHMARKS

BL30K

…We break the dataset into six segments, each with approximately 5K videos.

11 PAPERS • NO BENCHMARKS YET

Datasets

8 dataset results for segmentation AND Unsupervised Video Object Segmentation