TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
3D Reconstruction	DTU	GBi-Net	Acc	0.312	# 4
3D Reconstruction	DTU	GBi-Net	Overall	0.303	# 7
3D Reconstruction	DTU	GBi-Net	Comp	0.293	# 12
Point Clouds	Tanks and Temples	GBi-Net	Mean F1 (Intermediate)	61.42	# 11

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/generalized-binary-search-network-for-highly/3d-reconstruction-on-dtu)](https://paperswithcode.com/sota/3d-reconstruction-on-dtu?p=generalized-binary-search-network-for-highly)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/generalized-binary-search-network-for-highly/point-clouds-on-tanks-and-temples)](https://paperswithcode.com/sota/point-clouds-on-tanks-and-temples?p=generalized-binary-search-network-for-highly)`

Generalized Binary Search Network for Highly-Efficient Multi-View Stereo

CVPR 2022 · Zhenxing Mi, Di Chang, Dan Xu ·

Multi-view Stereo (MVS) with known camera parameters is essentially a 1D search problem within a valid depth range. Recent deep learning-based MVS methods typically densely sample depth hypotheses in the depth range, and then construct prohibitively memory-consuming 3D cost volumes for depth prediction. Although coarse-to-fine sampling strategies alleviate this overhead issue to a certain extent, the efficiency of MVS is still an open challenge. In this work, we propose a novel method for highly efficient MVS that remarkably decreases the memory footprint, meanwhile clearly advancing state-of-the-art depth prediction performance. We investigate what a search strategy can be reasonably optimal for MVS taking into account of both efficiency and effectiveness. We first formulate MVS as a binary search problem, and accordingly propose a generalized binary search network for MVS. Specifically, in each step, the depth range is split into 2 bins with extra 1 error tolerance bin on both sides. A classification is performed to identify which bin contains the true depth. We also design three mechanisms to respectively handle classification errors, deal with out-of-range samples and decrease the training memory. The new formulation makes our method only sample a very small number of depth hypotheses in each step, which is highly memory efficient, and also greatly facilitates quick training convergence. Experiments on competitive benchmarks show that our method achieves state-of-the-art accuracy with much less memory. Particularly, our method obtains an overall score of 0.289 on DTU dataset and tops the first place on challenging Tanks and Temples advanced dataset among all the learning-based methods. The trained models and code will be released at https://github.com/MiZhenxing/GBi-Net.

PDF Abstract CVPR 2022 PDF CVPR 2022 Abstract

Code

Add Remove Mark official

mizhenxing/gbi-net official

123

Tasks

Add Remove

3D Reconstruction

Depth Estimation

Depth Prediction

Point Clouds

Datasets

DTU

BlendedMVS

Tanks and Temples

Results from the Paper

Edit

Ranked #7 on 3D Reconstruction on DTU

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
3D Reconstruction	DTU	GBi-Net	Acc	0.312	# 4	Compare
			Overall	0.303	# 7	Compare
			Comp	0.293	# 12	Compare
Point Clouds	Tanks and Temples	GBi-Net	Mean F1 (Intermediate)	61.42	# 11	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Generalized Binary Search Network for Highly-Efficient Multi-View Stereo

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove