Traditionally, 3D indoor scene reconstruction from posed images happens in two phases: per-image depth estimation, followed by depth merging and surface reconstruction.
Per-pixel ground-truth depth data is challenging to acquire at scale.
Ranked #3 on Monocular Depth Estimation on VA (Virtual Apartment)
One strategy for mitigating noise in a low-light situation is to increase the shutter time of the camera, thus allowing each photosite to integrate more light and decrease noise variance.
Learning based methods have shown very promising results for the task of depth estimation in single images.
Ranked #4 on Monocular Depth Estimation on Mid-Air Dataset