IB-MVS: An Iterative Algorithm for Deep Multi-View Stereo based on Binary Decisions

29 Nov 2021  ·  Christian Sormann, Mattia Rossi, Andreas Kuhn, Friedrich Fraundorfer ·

We present a novel deep-learning-based method for Multi-View Stereo. Our method estimates high resolution and highly precise depth maps iteratively, by traversing the continuous space of feasible depth values at each pixel in a binary decision fashion. The decision process leverages a deep-network architecture: this computes a pixelwise binary mask that establishes whether each pixel actual depth is in front or behind its current iteration individual depth hypothesis. Moreover, in order to handle occluded regions, at each iteration the results from different source images are fused using pixelwise weights estimated by a second network. Thanks to the adopted binary decision strategy, which permits an efficient exploration of the depth space, our method can handle high resolution images without trading resolution and precision. This sets it apart from most alternative learning-based Multi-View Stereo methods, where the explicit discretization of the depth space requires the processing of large cost volumes. We compare our method with state-of-the-art Multi-View Stereo methods on the DTU, Tanks and Temples and the challenging ETH3D benchmarks and show competitive results.

PDF Abstract

Results from the Paper

Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
3D Reconstruction DTU IB-MVS Acc 0.334 # 5
Overall 0.321 # 4
Comp 0.309 # 6
Point Clouds Tanks and Temples IB-MVS Mean F1 (Intermediate) 56.02 # 7
Mean F1 (Advanced) 31.96 # 4


No methods listed for this paper. Add relevant methods here