2 dataset results for Multi-Task Learning AND RGB-D

The NYU-Depth V2 data set is comprised of video sequences from a variety of indoor scenes as recorded by both the RGB and Depth cameras from the Microsoft Kinect. It features:

845 PAPERS • 20 BENCHMARKS

Hypersim

For many fundamental scene understanding tasks, it is difficult or impossible to obtain per-pixel ground truth labels from real images. Hypersim is a photorealistic synthetic dataset for holistic indoor scene understanding. It contains 77,400 images of 461 indoor scenes with detailed per-pixel labels and corresponding ground truth geometry.

61 PAPERS • 1 BENCHMARK

Datasets

2 dataset results for Multi-Task Learning AND RGB-D