no code implementations • CVPR 2024 • Jaeho Moon, Juan Luis Gonzalez Bello, Byeongjun Kwon, Munchurl Kim
Subsequently, in the fine training stage, we refine the DE network to learn the detailed depth of the objects from the reprojection loss, while ensuring accurate DE on the moving object regions by employing our regularization loss with a cost-volume-based weighting factor.
no code implementations • 13 Dec 2023 • Juan Luis Gonzalez Bello, Minh-Quan Viet Bui, Munchurl Kim
Recent advances in neural rendering have shown that, albeit slow, implicit compact models can learn a scene's geometries and view-dependent appearances from multiple views.
no code implementations • CVPR 2024 • Juan Luis Gonzalez Bello, Munchurl Kim
In this paper, we firstly consider view-dependent effects into single image-based novel view synthesis (NVS) problems.
no code implementations • 18 May 2022 • Juan Luis Gonzalez Bello, Jaeho Moon, Munchurl Kim
Recently, much attention has been drawn to learning the underlying 3D structures of a scene from monocular videos in a fully self-supervised fashion.
1 code implementation • CVPR 2021 • Juan Luis Gonzalez Bello, Munchurl Kim
Our PLADE-Net is based on a new network architecture with neural positional encoding and a novel loss function that borrows from the closed-form solution of the matting Laplacian to learn pixel-level accurate depth estimation from stereo images.
no code implementations • ICLR 2020 • Juan Luis Gonzalez Bello, Munchurl Kim
Our proposed network architecture, the monster-net, is devised with a novel t-shaped adaptive kernel with globally and locally adaptive dilation, which can efficiently incorporate global camera shift into and handle local 3D geometries of the target image's pixels for the synthesis of naturally looking 3D panned views when a 2-D input image is given.
no code implementations • 2 Oct 2019 • Juan Luis Gonzalez Bello, Munchurl Kim
Our proposed network architecture, the monster-net, is devised with a novel "t-shaped" adaptive kernel with globally and locally adaptive dilation, which can efficiently incorporate global camera shift into and handle local 3D geometries of the target image's pixels for the synthesis of naturally looking 3D panned views when a 2-D input image is given.
no code implementations • 20 Sep 2019 • Juan Luis Gonzalez Bello, Munchurl Kim
The 3D-zoom operation is the positive translation of the camera in the Z-axis, perpendicular to the image plane.
no code implementations • 20 Mar 2019 • Juan Luis Gonzalez Bello, Munchurl Kim
Convolutional neural networks (CNN) have shown state-of-the-art results for low-level computer vision problems such as stereo and monocular disparity estimations, but still, have much room to further improve their performance in terms of accuracy, numbers of parameters, etc.