TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
3D Reconstruction	DTU	Cas-MVSNet	Acc	0.325	# 6
3D Reconstruction	DTU	Cas-MVSNet	Overall	0.355	# 16
3D Reconstruction	DTU	Cas-MVSNet	Comp	0.385	# 18
Point Clouds	Tanks and Temples	Cas-MVSNet	Mean F1 (Intermediate)	56.84	# 14
Point Clouds	Tanks and Temples	Cas-MVSNet	Mean F1 (Advanced)	31.12	# 12

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/cascade-cost-volume-for-high-resolution-multi/point-clouds-on-tanks-and-temples)](https://paperswithcode.com/sota/point-clouds-on-tanks-and-temples?p=cascade-cost-volume-for-high-resolution-multi)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/cascade-cost-volume-for-high-resolution-multi/3d-reconstruction-on-dtu)](https://paperswithcode.com/sota/3d-reconstruction-on-dtu?p=cascade-cost-volume-for-high-resolution-multi)`

Cascade Cost Volume for High-Resolution Multi-View Stereo and Stereo Matching

CVPR 2020 · Xiaodong Gu, Zhiwen Fan, Zuozhuo Dai, Siyu Zhu, Feitong Tan, Ping Tan ·

The deep multi-view stereo (MVS) and stereo matching approaches generally construct 3D cost volumes to regularize and regress the output depth or disparity. These methods are limited when high-resolution outputs are needed since the memory and time costs grow cubically as the volume resolution increases. In this paper, we propose a both memory and time efficient cost volume formulation that is complementary to existing multi-view stereo and stereo matching approaches based on 3D cost volumes. First, the proposed cost volume is built upon a standard feature pyramid encoding geometry and context at gradually finer scales. Then, we can narrow the depth (or disparity) range of each stage by the depth (or disparity) map from the previous stage. With gradually higher cost volume resolution and adaptive adjustment of depth (or disparity) intervals, the output is recovered in a coarser to fine manner. We apply the cascade cost volume to the representative MVS-Net, and obtain a 23.1% improvement on DTU benchmark (1st place), with 50.6% and 74.2% reduction in GPU memory and run-time. It is also the state-of-the-art learning-based method on Tanks and Temples benchmark. The statistics of accuracy, run-time and GPU memory on other representative stereo CNNs also validate the effectiveness of our proposed method.

PDF Abstract CVPR 2020 PDF CVPR 2020 Abstract

Code

Add Remove Mark official

alibaba/cascade-stereo official

440

apchenstu/mvsnerf

667

kwea123/CasMVSNet_pl

263

hz-ants/cascade-mvsnet

Tasks

Add Remove

3D Reconstruction

Point Clouds

Stereo Matching

Datasets

DTU

Middlebury

Tanks and Temples

Results from the Paper

Edit

Ranked #12 on Point Clouds on Tanks and Temples

Get a GitHub badge

Results from Other Papers

Task	Dataset	Model	Metric Name	Metric Value	Rank	Compare
3D Reconstruction	DTU	Cas-MVSNet	Acc	0.325	# 6	See all
			Overall	0.355	# 16	See all
			Comp	0.385	# 18	See all
Point Clouds	Tanks and Temples	Cas-MVSNet	Mean F1 (Intermediate)	56.84	# 14	See all
Point Clouds	Tanks and Temples	Cas-MVSNet	Mean F1 (Advanced)	31.12	# 12	See all

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Cascade Cost Volume for High-Resolution Multi-View Stereo and Stereo Matching

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit