The BraTS 2015 dataset is a dataset for brain tumor image segmentation. It consists of 220 high grade gliomas (HGG) and 54 low grade gliomas (LGG) MRIs. The four MRI modalities are T1, T1c, T2, and T2FLAIR. Segmented “ground truth” is provide about four intra-tumoral classes, viz. edema, enhancing tumor, non-enhancing tumor, and necrosis.
68 PAPERS • 1 BENCHMARK
LiTS17 is a liver tumor segmentation benchmark. The data and segmentations are provided by various clinical sites around the world. The training data set contains 130 CT scans and the test data set 70 CT scans. Image Source: https://arxiv.org/pdf/1707.07734.pdf
44 PAPERS • 3 BENCHMARKS
Head and Neck Tumor Segmentation
25 PAPERS • NO BENCHMARKS YET
Introduced by Da et al. in DigestPath: a Benchmark Dataset with Challenge Review for the Pathological Detection and Segmentation of Digestive-System
23 PAPERS • 1 BENCHMARK
BRATS 2016 is a brain tumor segmentation dataset. It shares the same training set as BRATS 2015, which consists of 220 HHG and 54 LGG. Its testing dataset consists of 191 cases with unknown grades. Image Source: https://sites.google.com/site/braintumorsegmentation/home/brats_2016
13 PAPERS • NO BENCHMARKS YET
BRATS 2014 is a brain tumor segmentation dataset.
5 PAPERS • 1 BENCHMARK
A whole-body FDG-PET/CT dataset with manually annotated tumor lesions (FDG-PET-CT-Lesions) 1,014 studies (900 patients)
4 PAPERS • NO BENCHMARKS YET
CSAW-S is a dataset of mammography images which includes expert annotations of tumors and non-expert annotations of breast anatomy and artifacts in the image.
The ULS23 test set contains 725 lesions from 284 patients of the Radboudumc and JBZ hospitals in the Netherlands. It is intended to be used to measure the performance of 3D universal lesion segmentation models for Computed Tomography (CT). To prepare the data, radiological reports from both participating institutions where searched using NLP tools identifying patients with measurable target lesions, indicating that these lesions were clinically relevant. A random sample of patients was selected, 56.3% of which were male and with diverse scanner manufacturers. The lesions were annotated in 3D by expert radiologists with over 10 years of experience in reading oncological scans. ULS23 is an open benchmark, and we invite ongoing submissions to advance the development of future ULS models.
2 PAPERS • 1 BENCHMARK
https://drive.google.com/file/d/1X_JTfD8Ch-IxmG5VHtKk_xGZT336Fl1Q/view?usp=drive_link
1 PAPER • NO BENCHMARKS YET
Highlights
A dataset of abdominal CT studies in NifTi format from the open-source medical data repository Medical Decathlon was utilized. To expedite the partitioning process, the MONAILabel plugin of the MONAI framework within the 3D Slicer program was employed. A radiologist with 15 years of experience conducted a validation process, wherein the boundaries of the colon markup were verified on each slice. The existing colorectal cancer markings in the dataset remained unaltered. Validation by a radiologist reduced the size of the validated dataset to 122 studies. In this case, the 122 studies were categorized into three subsets based on the quality of the data: The "good" subset comprises 100 studies, while the "bad" subset contains 17 cropped studies (in which the entire colon is not visible on the image). The "bad" subset comprises five studies. Two of these studies were of poor quality and could not identify the entire colon. Two further studies involved colon stomas following surgery, while
1 PAPER • 1 BENCHMARK
Unique radiogenomic dataset from a Non-Small Cell Lung Cancer (NSCLC) cohort of 211 subjects. The dataset comprises Computed Tomography (CT), Positron Emission Tomography (PET)/CT images, semantic annotations of the tumors as observed on the medical images using a controlled vocabulary, segmentation maps of tumors in the CT scans, and quantitative values obtained from the PET/CT scans. Imaging data are also paired with gene mutation, RNA sequencing data from samples of surgically excised tumor tissue, and clinical data, including survival outcomes.
The ULS23 training dataset contains 38,693 diverse lesions from chest-abdomen-pelvis CT examinations. For the challenge, we introduced two novel 3D annotated datasets targeting lesions in the pancreas and bones, which are traditionally challenging to segment. Additionally, we aggregate 10 publicly available datasets with a lesion segmentation component into a single, easily accessible data repository.