Stereo Magnification with Multi-Layer Images

Representing scenes with multiple semi-transparent colored layers has been a popular and successful choice for real-time novel view synthesis. Existing approaches infer colors and transparency values over regularly-spaced layers of planar or spherical shape. In this work, we introduce a new view synthesis approach based on multiple semi-transparent layers with scene-adapted geometry. Our approach infers such representations from stereo pairs in two stages. The first stage infers the geometry of a small number of data-adaptive layers from a given pair of views. The second stage infers the color and the transparency values for these layers producing the final representation for novel view synthesis. Importantly, both stages are connected through a differentiable renderer and are trained in an end-to-end manner. In the experiments, we demonstrate the advantage of the proposed approach over the use of regularly-spaced layers with no adaptation to scene geometry. Despite being orders of magnitude faster during rendering, our approach also outperforms a recently proposed IBRNet system based on implicit geometry representation. See results at https://samsunglabs.github.io/StereoLayers .

PDF Abstract CVPR 2022 PDF CVPR 2022 Abstract

Datasets


Introduced in the Paper:

SWORD

Used in the Paper:

LLFF RealEstate10K

Results from the Paper


Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Novel View Synthesis SWORD StereoLayers (8 layers) PSNR 25.54 # 2
SSIM 0.79 # 2
LPIPS 0.113 # 1
Novel View Synthesis SWORD StereoLayers (2 layers) PSNR 25.28 # 3
SSIM 0.78 # 3
LPIPS 0.102 # 2
Novel View Synthesis SWORD StereoLayers PSNR 25.95 # 1
SSIM 0.81 # 1
LPIPS 0.096 # 3

Methods