Datasets > Modality > Texts > PhraseCut

PhraseCut is a dataset consisting of 77,262 images and 345,486 phrase-region pairs. The dataset is collected on top of the Visual Genome dataset and uses the existing annotations to generate a challenging set of referring phrases for which the corresponding regions are manually annotated.

Source: PhraseCut: Language-based Image Segmentation in the Wild

Samples

License

  • Unknown

Modalities

Languages

Tasks