DigestPath

Introduced by Da et al. in DigestPath: a Benchmark Dataset with Challenge Review for the Pathological Detection and Segmentation of Digestive-System

Grand-Challenge Page

1. Signet ring cell dataset

Signet ring cell carcinoma is a type of rare adenocarcinoma with poor prognosis. Early detection of such cells leads to huge improvement of patients' survival rate. However, there is no existing public dataset with annotations for studying the problem of signet ring cell detection.

This dataset has positive samples and negative samples. Training positive samples contain 77 images from 20 WSIs, with cell bounding boxes written in xml. Training negative samples contain 378 images from 79 WSIs.These negative WSIs have no signet ring, but could contain other kinds of tumor cells. Each signet ring cell is labeled by experienced pathologists with a rectangle bounding box tightly surrounding the cell. Each image is of size 2000X2000. The training images are from 2 organs, including gastric mucosa and intestine. Because of the difficulty of manual annotation, there exist some signet ring cells who are missed by pathologists. In other words, this dataset is a noisy dataset with its positive images not fully annotated.

All whole slide images were stained by hematoxylin and eosin and scanned at X40.

2. Colonoscopy tissue segment dataset

Colonoscopy pathology examination can find cells of early-stage colon tumor from small tissue slices. Pathologists need to daily examine hundreds of tissue slices, which is a time consuming and exhausting work. Here we propose a challenge task on automatic colonoscopy tissue segmentation and screening, aiming at automatic lesion segmentation and classification of the whole tissue (benign vs. malignant).

This dataset has positive samples and negative samples. Training positive samples contain 250 images of tissue from 93 WSIs, with pixel-level annotation in jpg format, where 0 means background and 255 for foreground (malignant lesion). You could simply get binary mask by a threshold 128. Training negative samples contain 410 images of tissue from 231 WSI. This negative images have no annotation because they don't have any malignant lesion.

The average size of all images are of 5000x5000 pixels, some of them are extremely huge. We will also provide another 152 patients' 212 tissues as the testing set, in which 90 images from 65 patients contain lesion. All whole slide images were stained by hematoxylin and eosin and scanned at X20.

Sign the DATABASE USE AGREEMENT first and download the dataset at the homepage!

Da Q, Huang X, Li Z, et al. DigestPath: a Benchmark Dataset with Challenge Review for the 
Pathological Detection and Segmentation of Digestive-System[J]. 
Medical Image Analysis, 2022: 102485.

https://doi.org/10.1016/j.media.2022.102485

Papers


Paper Code Results Date Stars

Tasks


Similar Datasets