2 dataset results for Unsupervised Semantic Segmentation with Language-image Pre-training AND English

MS COCO (Microsoft Common Objects in Context)

The MS COCO (Microsoft Common Objects in Context) dataset is a large-scale object detection, segmentation, key-point detection, and captioning dataset. The dataset consists of 328K images.

10,453 PAPERS • 93 BENCHMARKS

KITTI-STEP

The Segmenting and Tracking Every Pixel (STEP) benchmark consists of 21 training sequences and 29 test sequences. It is based on the KITTI Tracking Evaluation and the Multi-Object Tracking and Segmentation (MOTS) benchmark. This benchmark extends the annotations to the Segmenting and Tracking Every Pixel (STEP) task. [Copy-pasted from http://www.cvlibs.net/datasets/kitti/eval_step.php]

20 PAPERS • 2 BENCHMARKS

Datasets

2 dataset results for Unsupervised Semantic Segmentation with Language-image Pre-training AND English