DME VQA dataset (Diabetic Macular Edema VQA dataset)

Introduced by Tascon-Morales et al. in Consistency-preserving Visual Question Answering in Medical Imaging

Medical VQA dataset built from the IDRiD and eOphta datasets. The dataset contains both healthy and unhealthy fundus images. For each image, a set of pre-defined questions is generated, including questions about regions (e.g. are there hard exudates in this region?), for which an associated mask denotes the location of the region.

The motivation for this dataset include the lack of public medical VQA datasets with related questions. In our dataset, questions are related because there is a high-level question about the DME grade of the image, and associated low-level questions that can lead to the answer of the high-level question. This allows to study the consistency of a VQA model i.e. how often the model produces contradictory answers to questions about a given image. Questions about regions are also a novel feature of this dataset.

The dataset can be used for general VQA purposes, and also for the more specific purpose of consistency improvement.

Number of images : Train: 433 Val: 112 Test: 134

Number of QA pairs: Train: 9779 Val: 2380 Test: 1311

To download the dataset, click here.

For more information, check our paper.

Papers


Paper Code Results Date Stars

Dataset Loaders


No data loaders found. You can submit your data loader here.

Tasks


License


Modalities


Languages