no code implementations • 12 Apr 2024 • Yuqun Wu, Jae Yong Lee, Chuhang Zou, Shenlong Wang, Derek Hoiem
Our experiments show 4x the performance of RegNeRF and 8x that of FreeNeRF on average F1@2cm for ETH3D MVS benchmark, suggesting a fruitful research direction to improve the geometric accuracy of NeRF-based models, and sheds light on a potential future approach to enable NeRF-based optimization to eventually outperform traditional MVS.
no code implementations • 2 Dec 2022 • Jae Yong Lee, Yuqun Wu, Chuhang Zou, Shenlong Wang, Derek Hoiem
Instead, we propose to encode features in bins of Fourier features that are commonly used for positional encoding.
no code implementations • 14 Oct 2022 • Jae Yong Lee, Chuhang Zou, Derek Hoiem
Recent work in multi-view stereo (MVS) combines learnable photometric scores and regularization with PatchMatch-based optimization to achieve robust pixelwise estimates of depth, normals, and visibility.
no code implementations • 14 Jun 2022 • Rania Briq, Chuhang Zou, Leonid Pishchulin, Chris Broaddus, Juergen Gall
We consider the problem of synthesizing multi-action human motion sequences of arbitrary lengths.
1 code implementation • ICCV 2021 • Jae Yong Lee, Joseph DeGol, Chuhang Zou, Derek Hoiem
To overcome the challenge of the non-differentiable PatchMatch optimization that involves iterative sampling and hard decisions, we use reinforcement learning to minimize expected photometric cost and maximize likelihood of ground truth depth and normals.
no code implementations • 18 Feb 2020 • Donghyun Kim, Tian Lan, Chuhang Zou, Ning Xu, Bryan A. Plummer, Stan Sclaroff, Jayan Eledath, Gerard Medioni
We embed the attention module in a ``slow-fast'' architecture, where the slower network runs on sparsely sampled keyframes and the light-weight shallow network runs on non-keyframes at a high frame rate.
1 code implementation • 21 Oct 2019 • Mao-Chuang Yeh, Shuai Tang, Anand Bhattad, Chuhang Zou, David Forsyth
Style transfer methods produce a transferred image which is a rendering of a content image in the manner of a style image.
3 code implementations • 9 Oct 2019 • Chuhang Zou, Jheng-Wei Su, Chi-Han Peng, Alex Colburn, Qi Shan, Peter Wonka, Hung-Kuo Chu, Derek Hoiem
Recent approaches for predicting layouts from 360 panoramas produce excellent results.
1 code implementation • 3 Sep 2019 • Theerasit Issaranon, Chuhang Zou, David Forsyth
We describe a method that predicts, from a single RGB image, a depth map that describes the scene when a masked object is removed - we call this "counterfactual depth" that models hidden scene geometry together with the observations.
1 code implementation • 29 Jul 2019 • Chuhang Zou, Derek Hoiem
One major challenge in 3D reconstruction is to infer the complete shape geometry from partial foreground occlusions.
2 code implementations • CVPR 2018 • Chuhang Zou, Alex Colburn, Qi Shan, Derek Hoiem
We propose an algorithm to predict room layout from a single image that generalizes across panoramas and perspective images, cuboid layouts and more general layouts (e. g. L-shape room).
1 code implementation • 25 Oct 2017 • Chuhang Zou, Ruiqi Guo, Zhizhong Li, Derek Hoiem
In this paper, we aim to interpret indoor scenes from one RGBD image.
2 code implementations • ICCV 2017 • Chuhang Zou, Ersin Yumer, Jimei Yang, Duygu Ceylan, Derek Hoiem
The success of various applications including robotics, digital content creation, and visualization demand a structured and abstract representation of the 3D world from limited sensor data.
1 code implementation • 9 Apr 2015 • Ruiqi Guo, Chuhang Zou, Derek Hoiem
One major goal of vision is to infer physical models of objects, surfaces, and their layout from sensors.