🔔 Share your dataset with the ML community!

Filter by Modality

Filter by Task

Filter by Language

670 dataset results for segmentation

FracAtlas (A Dataset for Fracture Classification, Localization and Segmentation of Musculoskeletal Radiographs)

FractureAtlas is a musculoskeletal bone fracture dataset with annotations for deep learning tasks like classification, localization, and segmentation.

1 PAPER • NO BENCHMARKS YET

DISRPT2019

DISRPT2019 (DISRPT2019 shared task on Discourse Unit Segmentation and Connective Detection)

The DISRPT 2019 workshop introduces the first iteration of a cross-formalism shared task on discourse unit segmentation. Since all major discourse parsing frameworks imply a segmentation of texts into segments, learning segmentations for and from diverse resources is a promising area for converging methods and insights. Because different corpora, languages and frameworks use different guidelines for segmentation, the shared task is meant to promote design of flexible methods for dealing with various guidelines, and help

4 PAPERS • NO BENCHMARKS YET

SegPANDA200 (Segmentation task on PANDA challenge in 200 microns by 512px)

SegPANDA200 is a public pathological H&E image dataset from segmentation task on PANDA challenge in 200 microns by 512px made in the same manner from PANDA challenge dataset .

2 PAPERS • NO BENCHMARKS YET

LoveDA (Remote Sensing Land-Cover Dataset for Domain Adaptive Semantic Segmentation)

5987 high spatial resolution (0.3 m) remote sensing images from Nanjing, Changzhou, and Wuhan Focus on different geographical environments between Urban and Rural Advance both semantic segmentation and domain adaptation tasks Three considerable challenges: Multi-scale objects Complex background samples Inconsistent class distributions Two contests are held on the Codalab: <b>LoveDA Semantic Segmentation

45 PAPERS • 1 BENCHMARK

BHSD (A 3D Multi-class Brain Hemorrhage Segmentation Dataset)

…While deep learning techniques are widely used in medical image segmentation and have been applied to the ICH segmentation task, existing public ICH datasets do not support the multi-class segmentation To address this, we develop the Brain Hemorrhage Segmentation Dataset (BHSD), which provides a 3D multi-class ICH dataset containing 192 volumes with pixel-level annotations and 2200 volumes with slice-level To demonstrate the utility of the dataset, we formulate a series of supervised and semi-supervised ICH segmentation tasks.

1 PAPER • NO BENCHMARKS YET

CEREBRUM-7T (Fast and Fully-volumetric Brain Segmentation of 7 Tesla MR Volumes)

…Segmentation, meant as the partition of MR brain images in multiple anatomical classes, is an essential step in many functional and structural neuroimaging studies. In this work, we design and test CEREBRUM-7T, an optimised end-to-end CNN architecture, that allows to segment a whole 7T T1w MRI brain volume at once, without the need of partitioning it into 2D or 3D The generated model is able to produce accurate multi-structure segmentation masks on six different classes, in only few seconds. In the experimental part, we show that the proposed solution outperforms the GT it was trained on in segmentation accuracy. For more details, please visit: https://rocknroll87q.github.io/cerebrum7t

1 PAPER • NO BENCHMARKS YET

BraTS 2020 (RSNA-ASNR-MICCAI Brain Tumor Segmentation BraTS Challenge)

Click to add a brief description of the dataset (Markdown and LaTeX enabled).

4 PAPERS • NO BENCHMARKS YET

MatSeg (MatSeg Dataset for Zero-Shot Material States Segmentation)

MatSeg Dataset for Zero-Shot Material States Segmentation: The dataset contains large-scale synthetic images for training data and highly diverse real-world image benchmarks for testing. Focusing on zero-shot class-agnostic segmentation of materials and their states. This means finding the region of materials states without pre-training on the specific material classes or states. It contains both hard segmentation maps and soft and partial similarity annotations for similar but not identical materials.

1 PAPER • NO BENCHMARKS YET

HelixNet (HelixNet: A Dataset for Online LiDAR Segmentation)

Large-scale and open-access LiDAR dataset intended for the evaluation of real-time semantic segmentation algorithms.

3 PAPERS • 1 BENCHMARK

HaN-Seg (The Head and Neck Organ-at-Risk CT & MR Segmentation Challenge)

…A precise three-dimensional spatial description, i.e. segmentation, of the target volumes as well as OARs is required for optimal radiation dose distribution calculation, which is primarily performed using Although attempts have been made towards the segmentation of OARs from MR images, so far there has been no evaluation of the impact the combined analysis of CT and MR images has on the segmentation of The Head and Neck Organ-at-Risk Multi-Modal Segmentation Challenge aims to promote the development of new and application of existing fully automated techniques for OAR segmentation in the HaN region from CT images that exploit the information of multiple imaging modalities so as to improve the accuracy of segmentation results.

5 PAPERS • NO BENCHMARKS YET

UVO (Unidentified Video Objects: A Benchmark for Dense, Open-World Segmentation)

UVO is a new benchmark for open-world class-agnostic object segmentation in videos.

22 PAPERS • 3 BENCHMARKS

BRATS 2021 (RSNA-ASNR-MICCAI Brain Tumor Segmentation (BraTS) Challenge 2021)

…2021 challenge utilizes multi-institutional pre-operative baseline multi-parametric magnetic resonance imaging (mpMRI) scans, and focuses on the evaluation of state-of-the-art methods for (Task 1) the segmentation

10 PAPERS • NO BENCHMARKS YET

Tinto (Tinto: Multisensor Benchmark for 3D Hyperspectral Point Cloud Segmentation in the Geosciences)

The increasing use of deep learning techniques has reduced interpretation time and, ideally, reduced interpreter bias by automatically deriving geological maps from digital outcrop models. However, accurate validation of these automated mapping approaches is a significant challenge due to the subjective nature of geological mapping and the difficulty in collecting quantitative validation data. Additionally, many state-of-the-art deep learning methods are limited to 2D image data, which is insufficient for 3D digital outcrops, such as hyperclouds. To address these challenges, we present Tinto, a multi-sensor benchmark digital outcrop dataset designed to facilitate the development and validation of deep learning approaches for geological mapping, especially for non-structured 3D data like point clouds. Tinto comprises two complementary sets: 1) a real digital outcrop model from Corta Atalaya (Spain), with spectral attributes and ground-truth data, and 2) a synthetic twin that uses latent

1 PAPER • NO BENCHMARKS YET

DISRPT2021

DISRPT2021 (DISRPT2021 shared task on Discourse Unit Segmentation, Connective Detection and Discourse Relation Classification)

The DISRPT 2021 shared task, co-located with CODI 2021 at EMNLP, introduces the second iteration of a cross-formalism shared task on discourse unit segmentation and connective detection, as well as the

3 PAPERS • NO BENCHMARKS YET

DALES (DALES: A Large-scale Aerial LiDAR Data Set for Semantic Segmentation)

We present the Dayton Annotated LiDAR Earth Scan (DALES) data set, a new large-scale aerial LiDAR data set with over a half-billion hand-labeled points spanning 10 square kilometers of area and eight object categories. Large annotated point cloud data sets have become the standard for evaluating deep learning methods. However, most of the existing data sets focus on data collected from a mobile or terrestrial scanner with few focusing on aerial data. Point cloud data collected from an Aerial Laser Scanner (ALS) presents a new set of challenges and applications in areas such as 3D urban modeling and large-scale surveillance. DALES is the most extensive publicly available ALS data set with over 400 times the number of points and six times the resolution of other currently available annotated aerial point cloud data sets. This data set gives a critical number of expert verified hand-labeled points for the evaluation of new 3D deep learning algorithms, helping to expand the focus of curren

23 PAPERS • 2 BENCHMARKS

PAXRay (PAXRay: A Projected dataset for the segmentation of Anatomical structures in X-Ray data)

Projection of RibFrac CT dataset to a 2D plane to imitate X-Ray data for a total of 880 images with multi-label segmentation masks.

0 PAPER • NO BENCHMARKS YET

FOR-instance (FOR-instance: a UAV laser scanning benchmark dataset for semantic and instance segmentation of individual trees)

The challenge of accurately segmenting individual trees from laser scanning data hinders the assessment of crucial tree parameters necessary for effective forest management, impacting many downstream applications While dense laser scanning offers detailed 3D representations, automating the segmentation of trees and their structures from point clouds remains difficult. Addressing these gaps, the FOR-instance data represent a novel benchmarking dataset to enhance forest measurement using dense airborne laser scanning data, aiding researchers in advancing segmentation In this repository, users will find forest laser scanning point clouds from unamnned aerial vehicle (using Riegl sensors) that are manually segmented according to the individual trees (1130 trees) and

3 PAPERS • NO BENCHMARKS YET

ACCT Data Repository (ACCT is a fast and accessible automatic cell counting tool using machine learning for 2D image segmentation)

…For more details, please refer to ACCT is a fast and accessible automatic cell counting tool using machine learning for 2D image segmentation.

1 PAPER • NO BENCHMARKS YET

Teeth3DS (Teeth3DS: a benchmark for teeth segmentation and labeling from intra-oral 3D scans)

…created in the frame of the 3DTeethSeg 2022 MICCAI challenge to boost the research field and inspire the 3D vision research community to work on intra-oral 3D scans analysis such as teeth identification, segmentation

2 PAPERS • NO BENCHMARKS YET

Sen4AgriNet

Sen4AgriNet (A Sentinel-2 multi-year, multi-country benchmark dataset for crop classification and segmentation with deep learning)

A Sentinel-2 based time series multi country benchmark dataset, tailored for agricultural monitoring applications with Machine and Deep Learning. Sen4AgriNet dataset is annotated from farmer declarations collected via the Land Parcel Identification System (LPIS) for harmonizing country wide labels. Sen4AgriNet is the only multi-country, multi-year dataset that includes all spectral information. It is constructed to cover the period 2016-2020 for Catalonia and France, while it can be extended to include additional countries. Currently, it contains 42.5 million parcels, which makes it significantly larger than other available archives.

1 PAPER • NO BENCHMARKS YET

HT1080WT cells - 3D collagen type I matrices

HT1080WT cells - 3D collagen type I matrices (HT1080WT cells embedded in 3D collagen type I matrices - manual annotations for cell instance segmentation and tracking)

Human fibrosarcoma HT1080WT (ATCC) cells at low cell densities embedded in 3D collagen type I matrices [1]. The time-lapse videos were recorded every 2 minutes for 16.7 hours and covered a field of view of 1002 pixels × 1004 pixels with a pixel size of 0.802 μm/pixel The videos were pre-processed to correct frame-to-frame drift artifacts, resulting in a final size of 983 pixels × 985 pixels pixels.

1 PAPER • NO BENCHMARKS YET

SAMRS

SAMRS is a remote sensing segmentation dataset which provides object category, location, and instance information that can be used for semantic segmentation, instance segmentation, and object detection

4 PAPERS • NO BENCHMARKS YET

VCSL

VCSL (Video Copy Segment Localization)

VCSL (Video Copy Segment Localization) is a new comprehensive segment-level annotated video copy dataset. Compared with existing copy detection datasets restricted by either video-level annotation or small-scale, VCSL not only has two orders of magnitude more segment level labelled data, with 160k realistic video copy pairs containing more than 280k localized copied segment pairs, but also covers a variety of video categories and a wide range of video duration. All the copied segments inside each collected video pair are manually extracted and accompanied by precisely annotated starting and ending timestamps.

5 PAPERS • NO BENCHMARKS YET

OCTScenes

…OCTScenes-A dataset, the 0--3099 scenes without segmentation annotation are for training, while the 3100--3199 scenes with segmentation annotation can be used for testing. In the OCTScenes-B dataset, the 0--4899 scenes without segmentation annotation are for training, while the 4900--4999 scenes with segmentation annotation can be used for testing.

1 PAPER • NO BENCHMARKS YET

TCIA Test & Validation Radiotherapy CT Planning Scan

A dataset of 663 deidentified computed tomography (CT) scans acquired in routine clinical practice and with both segmentations taken from clinical practice and segmentations.

1 PAPER • NO BENCHMARKS YET

IntrA

IntrA is an open-access 3D intracranial aneurysm dataset that makes the application of points-based and mesh-based classification and segmentation models available. reconstruction. 103 3D models of entire brain vessels are collected by reconstructing scanned 2D MRA images of patients (the raw 2D MRA images are not published due to medical ethics). 1909 blood vessel segments are generated automatically from the complete models, including 1694 healthy vessel segments and 215 aneurysm segments for diagnosis. 116 aneurysm segments are divided and annotated manually by medical experts; the scale of each aneurysm segment is based on the need for a preoperative examination. Geodesic distance matrices are computed and included for each annotated 3D segment, because the expression of the geodesic distance is more accurate than Euclidean distance according to the shape of vessels

23 PAPERS • 2 BENCHMARKS

SMDG (Standardized Multi-Channel Dataset for Glaucoma)

Standardized Multi-Channel Dataset for Glaucoma (SMDG-19) is a collection and standardization of 19 public datasets, comprised of full-fundus glaucoma images, associated image metadata like, optic disc segmentation , optic cup segmentation, blood vessel segmentation, and any provided per-instance text metadata like sex and age.

0 PAPER • NO BENCHMARKS YET

PSG Dataset

…It annotates inter-segment relations based on COCO panoptic segmentation.

19 PAPERS • 1 BENCHMARK

Panoptic nuScenes

Panoptic nuScenes is a benchmark dataset that extends the popular nuScenes dataset with point-wise groundtruth annotations for semantic segmentation, panoptic segmentation, and panoptic tracking tasks.

5 PAPERS • NO BENCHMARKS YET

ZeroWaste

ZeroWaste is a dataset for automatic waste detection and segmentation. This dataset contains over 1,800 fully segmented video frames collected from a real waste sorting plant along with waste material labels for training and evaluation of the segmentation methods, as well ZeroWaste also provides frames of the conveyor belt before and after the sorting process, comprising a novel setup that can be used for weakly-supervised segmentation.

2 PAPERS • NO BENCHMARKS YET

KITTI-STEP

The Segmenting and Tracking Every Pixel (STEP) benchmark consists of 21 training sequences and 29 test sequences. It is based on the KITTI Tracking Evaluation and the Multi-Object Tracking and Segmentation (MOTS) benchmark. This benchmark extends the annotations to the Segmenting and Tracking Every Pixel (STEP) task. [Copy-pasted from http://www.cvlibs.net/datasets/kitti/eval_step.php]

20 PAPERS • 2 BENCHMARKS

BURST

BURST is a benchmark suite built upon TAO that requires tracking and segmenting multiple objects from camera video. Class-guided Common: Track and segment all objects belonging to a set of 78 common classes (based on the COCO class set) Long-tail: Track and segment all objects belonging to an extended set of 482 object all 482 object classes (class label predictions are not required) Exemplar-guided Mask: Track and segment all objects in the video for which the first-frame object masks are given. This task is identical to Video Object Segmentation (VOS). Box: Track and segment all objects in the video for which the first-frame object bounding-boxes are given. Point: Track and segment all objects in the video for which we are only given the (x,y) point coordinates of the mask centroid in the first-frame in which the objects appear.

14 PAPERS • 5 BENCHMARKS

PreSIL

PreSIL (Precise Synthetic Image and LiDAR)

Consists of over 50,000 frames and includes high-definition images with full resolution depth information, semantic segmentation (images), point-wise segmentation (point clouds), and detailed annotations

12 PAPERS • NO BENCHMARKS YET

YIM Dataset (Yeast Cells in Microstructures Dataset)

An instance segmentation dataset of yeast cells in microstructures. The dataset includes 493 densely annotated microscopy images. For more information see the paper "An Instance Segmentation Dataset of Yeast Cells in Microstructures".

1 PAPER • NO BENCHMARKS YET

MICC-SRI

…All frames are available both as RGB images and semantic segmentations. RGB images are non-photorealistic being rendered by a game engine, while semantic segmentations are similar to a real-world segmentations.

1 PAPER • NO BENCHMARKS YET

A2D2 (Audi Autonomous Driving Dataset)

Audi Autonomous Driving Dataset (A2D2) consists of simultaneously recorded images and 3D point clouds, together with 3D bounding boxes, semantic segmentation, instance segmentation, and data extracted

61 PAPERS • NO BENCHMARKS YET

CAMO++

CAMO++ is a dataset for camouflaged object segmentation. This dataset increases the number of images with hierarchical pixel-wise ground-truths. The authors also provide a benchmark suite for the task of camouflaged instance segmentation.

6 PAPERS • NO BENCHMARKS YET

HRWSI (High-Resolution Web Stereo Image)

…Also, it provides sky segmentation masks, instance segmentation masks as well as invalid pixel masks.

10 PAPERS • NO BENCHMARKS YET

YouTube-VIS 2019

YouTubeVIS is a new dataset tailored for tasks like simultaneous detection, segmentation and tracking of object instances in videos and is collected based on the current largest video object segmentation

145 PAPERS • 2 BENCHMARKS

AIRS

The AIRS (Aerial Imagery for Roof Segmentation) dataset provides a wide coverage of aerial imagery with 7.5 cm resolution and contains over 220,000 buildings. The task posed for AIRS is defined as roof segmentation.

8 PAPERS • 1 BENCHMARK

Fetoscopy Placenta Data

The fetoscopy placenta dataset is associated with our MICCAI2020 publication titled “Deep Placental Vessel Segmentation for Fetoscopic Mosaicking”. The dataset contains 483 frames with ground-truth vessel segmentation annotations taken from six different in vivo fetoscopic procedure videos. The dataset also includes six unannotated in vivo continuous fetoscopic video clips (950 frames) with predicted vessel segmentation maps obtained from the leave-one-out cross-validation of our method. We annotate a binary mask for vessel segmentation using the Pixel Annotation Tool.

2 PAPERS • NO BENCHMARKS YET

REFUGE2

The goal of REFUGE2 challenge is to evaluate and compare automated algorithms for glaucoma detection and optic disc/cup segmentation on a standard dataset of retinal fundus images. We invite the medical image analysis community to participate by developing and testing existing and novel automated classification and segmentation methods. REFUGE2 challenge consists of THREE Tasks: Classification of clinical Glaucoma Segmentation of Optic Disc and Cup Localization of Fovea (macular center)

3 PAPERS • NO BENCHMARKS YET

virufy-data

Dataset to segmentize coughs

0 PAPER • NO BENCHMARKS YET

AbdomenCT-1K

We present a large and diverse abdominal CT organ segmentation dataset, termed AbdomenCT-1K, with more than 1000 (1K) CT scans from 12 medical centers, including multi-phase, multi-vendor, and multi-disease Furthermore, we conduct a large-scale study for liver, kidney, spleen, and pancreas segmentation and reveal the unsolved segmentation problems of the SOTA methods, such as the limited generalization ability To advance the unsolved problems, we further build four organ segmentation benchmarks for fully supervised, semi-supervised, weakly supervised, and continual learning, which are currently challenging and We believe the AbdomenCT-1K dataset will promote future in-depth research towards clinical applicable abdominal organ segmentation methods.

23 PAPERS • NO BENCHMARKS YET

PIC (Person In Context 2021)

The Person In Context (PIC) dataset is a dataset for human-centric relation segmentation (HRS), which contains 17,122 high-resolution images and densely annotated entity segmentation and relations, including

1 PAPER • NO BENCHMARKS YET

LASIESTA

LASIESTA (Labeled and Annotated Sequences for Integral Evaluation of SegmenTation Algorithms) is a segmentation and detection dataset composed by many real indoor and outdoor sequences organized into categories

0 PAPER • NO BENCHMARKS YET

Cluttered Omniglot

Dataset for one-shot segmentation.

3 PAPERS • 1 BENCHMARK

DanbooRegion

DanbooRegion is a dataset consists of 5377 in-the-wild illustration downloaded from the Danbooru2018 and region segment map annotation pairs samples are provided as at 1024px 8-bit RGB images, and region segment maps as int-32 index images.

1 PAPER • NO BENCHMARKS YET

Datasets

670 dataset results for segmentation