GC-MVSNet: Multi-View, Multi-Scale, Geometrically-Consistent Multi-View Stereo

30 Oct 2023  ·  Vibhas K. Vats, Sripad Joshi, David J. Crandall, Md. Alimoor Reza, Soon-Heung Jung ·

Traditional multi-view stereo (MVS) methods rely heavily on photometric and geometric consistency constraints, but newer machine learning-based MVS methods check geometric consistency across multiple source views only as a post-processing step. In this paper, we present a novel approach that explicitly encourages geometric consistency of reference view depth maps across multiple source views at different scales during learning (see Fig. 1). We find that adding this geometric consistency loss significantly accelerates learning by explicitly penalizing geometrically inconsistent pixels, reducing the training iteration requirements to nearly half that of other MVS methods. Our extensive experiments show that our approach achieves a new state-of-the-art on the DTU and BlendedMVS datasets, and competitive results on the Tanks and Temples benchmark. To the best of our knowledge, GC-MVSNet is the first attempt to enforce multi-view, multi-scale geometric consistency during learning.

PDF Abstract

Results from the Paper


Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
3D Reconstruction DTU GC-MVSNet Acc 0.330 # 10
Overall 0.295 # 4
Comp 0.260 # 5
Point Clouds Tanks and Temples GC-MVSNet Mean F1 (Intermediate) 62.74 # 7
Mean F1 (Advanced) 38.74 # 7

Methods


No methods listed for this paper. Add relevant methods here