🔔 Share your dataset with the ML community!

Filter by Modality

Filter by Task

Filter by Language

676 dataset results for segmentation

…Since the dataset is collected ‘in the wild’, the speech segments are corrupted with real world noise including laughter, cross-talk, channel effects, music and other sounds.

496 PAPERS • 5 BENCHMARKS

Virtual KITTI

…synthetic video dataset designed to learn and evaluate computer vision models for several video understanding tasks: object detection and multi-object tracking, scene-level and instance-level semantic segmentation

120 PAPERS • 1 BENCHMARK

Synth-Colon

Synthetic dataset for polyp segmentation. It is the first dataset generated using zero annotations from medical professionals. The dataset is composed of 20 000 images with a resolution of 500×500.

2 PAPERS • NO BENCHMARKS YET

OMD (Oxford Multimotion Dataset)

…The dataset culminates in a complex toy car segment representative of many challenging real-world scenarios.

3 PAPERS • NO BENCHMARKS YET

CBIS-DDSM

CBIS-DDSM (Curated Breast Imaging Subset of Digital Database for Screening Mammography)

…Updated ROI segmentation and bounding boxes, and pathologic diagnosis for training data are also included. Finally, the ROI annotations for the abnormalities in the DDSM were provided to indicate a general position of lesions, but not a precise segmentation for them. Therefore, many researchers must implement segmentation algorithms for accurate feature extraction.

11 PAPERS • 2 BENCHMARKS

USC-GRAD-STDdb

USC-GRAD-STDdb (Small Target Detection database)

USC-GRAD-STDdb comprises 115 video segments containing more than 25,000 annotated frames of HD 720p resolution (≈1280x720) with small objects of interest from 16 (≈4x4) to 256 (≈16x16) as pixel area.

1 PAPER • 1 BENCHMARK

MECCANO

…Video Acquisition: 1920x1080 at 12.00 fps 11 training videos and 9 validation/test videos 8857 video segments temporally annotated indicating the verbs which describe the actions performed 64349 active

14 PAPERS • 3 BENCHMARKS

MLRSNet

…The dataset can be used for multi-label based image classification, multi-label based image retrieval, and image segmentation.

11 PAPERS • 1 BENCHMARK

BU-BIL (Boston University Biomedical Image Library)

…Paper: How to Collect Segmentations for Biomedical Images? A Benchmark Evaluating the Performance of Experts, Crowdsourced Non-Experts, and Algorithms

1 PAPER • NO BENCHMARKS YET

OCID (Object Clutter Indoor Dataset)

…The main purpose of OCID is to allow systematic comparison of existing object segmentation methods in scenes with increasing amount of clutter.

22 PAPERS • 1 BENCHMARK

360+x: A Panoptic Multi-modal Scene Understanding Dataset

…Annotation: Scene class label and Temporal segments label. Metadata: including textual scene descriptions, weather conditions, capture time, and GPS information.

1 PAPER • NO BENCHMARKS YET

PapioVoc

PapioVoc (Guinea baboon vocalizations dataset automatically extracted with a deep neural network from natural audio recordings)

…A convolutional neural network (CNN) was used on these large and noisy audio recordings to automatically extract segments of sound containing a baboon vocal production by following the method of Bonafos The resulting dataset consists of one-second to several-minute wav files of automatically detected vocalizations segments. The dataset consists of the segments predicted by the CNN to contain a baboon vocalization. Windows containing signal less than one second apart were merged into a single vocalization. If the time windows that follow a vocalization also contain a vocalization, then the signal they contain is added to the first segment for which a vocalization has been detected. As soon as a one-second segment no longer contains a signal corresponding to a vocalization, the wav file is closed.

1 PAPER • NO BENCHMARKS YET

DiDeMo (Distinct Describable Moments)

…The videos in the dataset are divided into 5-second segments to reduce the complexity of annotation.

183 PAPERS • 3 BENCHMARKS

RadioGalaxyNET Dataset

…Each instance furnishes details about the extended radio galaxy class, a bounding box covering all components, a pixel-level segmentation mask, and the keypoint position of the corresponding infrared host

1 PAPER • 1 BENCHMARK

Localized Narratives

…This dense visual grounding takes the form of a mouse trace segment per word and is unique to our data.

55 PAPERS • 5 BENCHMARKS

TICaM (Time-of-flight In-car Cabin Monitoring)

…It consists of an exhaustive list of actions performed while driving and multi-modal labeled images (depth, RGB and IR), with complete annotations for 2D and 3D object detection, instance and semantic segmentation

5 PAPERS • NO BENCHMARKS YET

Video2GIF

…IDs and URLs of the GIFs and the videos are provided, along with temporal alignment of GIF segments to their source videos.

11 PAPERS • NO BENCHMARKS YET

CALLHOME American English Speech

…Transcripts: The transcripts cover contiguous 5 or 10-minute segments from recorded conversations. Speaker Awareness: All speakers were aware that they were being recorded.

11 PAPERS • 7 BENCHMARKS

EVICAN

…With 4600 images and ∼26 000 segmented cells, our collection offers an unparalleled heterogeneous training dataset for cell biology deep learning application development.

2 PAPERS • 1 BENCHMARK

ELAI-Dust Storm (ELAI Dust Storm Dataset from MODIS)

…Inspiration Could the MODIS true-colour satellite images be utilized for detecting dust storms with higher accuracy and segmentation capability? The associated notebook simply presents the image data visualization, statistical data augmentation and a U-Net-based model to detect dust storms in a semantic segment fashion. Research Ideas Would the latest state-of-the-art segmentation models increase the performance of detecting dust storms in satellite true-colour images? Few-shot learning for dust storm segmentation and related self-supervised learning techniques What is the role of ensemble learning in improving model performance?

0 PAPER • NO BENCHMARKS YET

Simulated micro-Doppler Signatures

…Dataset can be easily used for supervised classification, out-of-distribution detection (near and far), unsupervised learning and modulation pattern segmentation.

1 PAPER • NO BENCHMARKS YET

MIR-1K

…accompaniment and the singing voice recorded as left and right channels, respectively, Manual annotations of pitch contours in semitone, indices and types for unvoiced frames, lyrics, and vocal/non-vocal segments

20 PAPERS • NO BENCHMARKS YET

Biwi 3D Audiovisual Corpus of Affective Communication - B3D(AC)^2 (BIWI 3D)

…In order to ease automatic speech segmentation, we carried out the recordings in a anechoic room, with walls covered by sound wave-absorbing materials.

5 PAPERS • 1 BENCHMARK

CIP (Complete Inertial Pose)

…It provides data for the analysis of the complete inertial pose pipeline, from raw measurements, to sensor-to-segment calibration, multi-sensor fusion, skeleton kinematics, to the complete human pose.

1 PAPER • NO BENCHMARKS YET

DoPose (Dortmund 6D Pose dataset)

…The dataset includes RGB images, Depth images, 6D Pose of objects, segmentation mask (all and visible), COCO Json annotation, camera transformations, and 3D model of all objects.

1 PAPER • NO BENCHMARKS YET

MARIDA (Marine Debris Archive)

…Although it is designed to be beneficial for several machine learning tasks, it primarily aims to benchmark weakly supervised pixel-level semantic segmentation learning methods.

6 PAPERS • 1 BENCHMARK

GLips (German Lips)

…Additionally, the complete TextGrid files containing the segmentation information of those sessions are also included. The size of the uncompressed dataset is 15GB.

5 PAPERS • NO BENCHMARKS YET

THEODORE (Learning from THEODORE)

…Beside capturing fisheye images from virtual environments we create annotations for semantic segmentation, instance masks and bounding boxes for object detection tasks.

7 PAPERS • NO BENCHMARKS YET

SESYD Dataset (Systems Evaluation SYnthetic Documents)

…This database targets two main research problems in the document image analysis field (i) symbol recognition and spotting in line drawing images (floorplans and electrical diagrams) (ii) character segmentation

0 PAPER • NO BENCHMARKS YET

HA-ViD (HA-ViD: A Human Assembly Video Dataset)

…We benchmark four foundational video understanding tasks: action recognition, action segmentation, object detection and multi-object tracking.

1 PAPER • NO BENCHMARKS YET

Endomapper

…Meta-data and annotations associated to the dataset varies from anatomical landmark and description of the procedure labeling, tools segmentation masks, COLMAP 3D reconstructions, simulated sequences with

12 PAPERS • NO BENCHMARKS YET

XImageNet-12 (XIMAGENET-12: An Explainable AI Benchmark Dataset for Model Robustness Evaluation)

…With the following topics: Blur Background / Segmented Background / AI generated Background/ Bias of tools during annotation/ Color in Background / Dependent Factor in Background/ LatenSpace Distance of

5 PAPERS • 1 BENCHMARK

EPIC-KITCHENS-100

…EPIC-KITCHENS-55), EPIC-KITCHENS-100 has been annotated using a novel pipeline that allows denser (54% more actions per minute) and more complete annotations of fine-grained actions (+128% more action segments

137 PAPERS • 7 BENCHMARKS

HuGaDB

…for analysis and activity recognition consisting of continues recordings of combined activities, such as walking, running, taking stairs up and down, sitting down, and so on; and the data recorded are segmented

6 PAPERS • NO BENCHMARKS YET

VizDoom

…During the game, each player can access various observations, including the first-person view screen pixels, the corresponding depth-map and segmentation-map (pixel-wise object labels), the bird-view maze

151 PAPERS • 3 BENCHMARKS

INSANE Cross-Domain UAV Data Set (Cross-Domain UAV Data Sets with Increased Number of Sensors for developing Advanced and Novel Estimators)

…The cross-domain outdoor to indoor transition segments are especially challenging because of realistic sensor behavior such as GNSS degradation and dropouts, changes in the measured magnetic field, and flight scenario, such as the transition data, which requires sensor switching, or the Mars analog data with higher velocities, multiple touchdowns, challenging ground structures or constant velocity segments

1 PAPER • NO BENCHMARKS YET

FSC-P2 (Fearless Steps Challenge Phase2)

…This (FS-02) edition of the FEARLESS STEPS Challenge includes the following 6 tasks --- TASK 1: Speech Activity Detection (SAD) TASK 2: Speaker Identification (using Speaker Segments Track 2: ASR using Diarized Segments (ASR_track2)

1 PAPER • NO BENCHMARKS YET

Workshop Tools Dataset

…contains 20 common workshop tools, and for each object: - a watertight triangular surface mesh; - a synthetic colored surface point-cloud; - ground truth inertial parameters; - ground truth part-level segmentation by Open3D element vertex 2000 property float32 x property float32 y property float32 z property float32 red property float32 green property float32 blue property uint8 segmentation please cite our paper: @inproceedings{Nadeau_PartSegForInertialIdent_2023, AUTHOR = {Philippe Nadeau AND Matthew Giamou AND Jonathan Kelly}, TITLE = { {The Sum of Its Parts: Visual Part Segmentation

1 PAPER • NO BENCHMARKS YET

WebNLG

…natural language generation challenge which consists of mapping the sets of triplets to text, including referring expression generation, aggregation, lexicalization, surface realization, and sentence segmentation

143 PAPERS • 17 BENCHMARKS

BIMCV COVID-19

…In addition, 23 images were annotated by a team of expert radiologists to include semantic segmentation of radiographic findings.

8 PAPERS • NO BENCHMARKS YET

FLAIR (French Land cover from Aerospace ImageRy)

…Notably, deep learning methods are employed to obtain a semantic segmentation of aerial images.

1 PAPER • 1 BENCHMARK

KITTI

…Despite its popularity, the dataset itself does not contain ground truth for semantic segmentation. However, various researchers have manually annotated parts of the dataset to fit their necessities.

3,244 PAPERS • 141 BENCHMARKS

NERDS 360 (NeRF for Reconstruction, Decomposition and Scene Synthesis of 360° outdoor scenes)

…Tasks our dataset support: Generaliazable Novel view synthesis (Few shot evaluation) Novel view synthesis (Overfitting evaluation) 6D pose estimation Object editing Depth estimation Semantic Segmentation Instance Segmentation

3 PAPERS • 1 BENCHMARK

EPISURG

EPISURG (EPISURG: a dataset of postoperative MRI for quantitative analysis of resection neurosurgery for refractory epilepsy)

…Three human raters segmented the resection cavity on partially overlapping subsets of EPISURG: Rater 1: 133 subjects (researcher in neuroimaging) Rater 2: 34 subjects (clinical research fellow) Rater dataset for your research please cite the following publications: Pérez-García F., Rodionov R., Alim-Marvasti A., Sparks R., Duncan J.S., Ourselin S. (2020) Simulation of Brain Resection for Cavity Segmentation

2 PAPERS • NO BENCHMARKS YET

LabPics (LabPics Dataset for computer vision for autonomous chemistry labs and medical labs)

LabPics Chemistry Dataset Dataset for computer vision for materials segmentation and classification in chemistry labs, medical labs, and any setting where materials are handled inside containers. In addition to instance segmentation maps, the dataset also includes semantic segmentation maps that give each pixel in the image all the classes to which it belongs.

5 PAPERS • NO BENCHMARKS YET

BPCIS (Bacterial Phase Contrast for Instance Segementation)

BPCIS is collection of 364 bacterial phase contrast images and corresponding label matrices for instance segmentation. Labels were made according to fluorescence channels where possible.

1 PAPER • NO BENCHMARKS YET

Smarty4covid

Smarty4covid (The smarty4covid dataset and knowledge base: a framework enabling interpretable analysis of audio signals)

…It has been utilized towards the development of models able to: (i) extract clinically informative respiratory indicators from regular breathing records, and (ii) identify cough, breath and voice segments

1 PAPER • NO BENCHMARKS YET

nuScenes LiDAR only

…Image based benchmark datasets have driven development in computer vision tasks such as object detection, tracking and segmentation of agents in the environment.

9 PAPERS • 2 BENCHMARKS

Datasets

676 dataset results for segmentation