no code implementations • 17 Nov 2024 • HaoYu Wu, Meher Gitika Karumuri, Chuhang Zou, Seungbae Bang, Yuelong Li, Dimitris Samaras, Sunil Hadap
Current image-to-3D approaches suffer from high computational costs and lack scalability for high-resolution outputs.
no code implementations • 24 Sep 2024 • Jae Yong Lee, Yuqun Wu, Chuhang Zou, Derek Hoiem, Shenlong Wang
The goal of this paper is to encode a 3D scene into an extremely compact representation from 2D images and to enable its transmittance, decoding and rendering in real-time across various platforms.
no code implementations • 12 Apr 2024 • Yuqun Wu, Jae Yong Lee, Chuhang Zou, Shenlong Wang, Derek Hoiem
The latest regularized Neural Radiance Field (NeRF) approaches produce poor geometry and view extrapolation for large scale sparse view scenes, such as ETH3D.
no code implementations • 2 Dec 2022 • Jae Yong Lee, Yuqun Wu, Chuhang Zou, Shenlong Wang, Derek Hoiem
Instead, we propose to encode features in bins of Fourier features that are commonly used for positional encoding.
no code implementations • 14 Oct 2022 • Jae Yong Lee, Chuhang Zou, Derek Hoiem
Recent work in multi-view stereo (MVS) combines learnable photometric scores and regularization with PatchMatch-based optimization to achieve robust pixelwise estimates of depth, normals, and visibility.
no code implementations • 14 Jun 2022 • Rania Briq, Chuhang Zou, Leonid Pishchulin, Chris Broaddus, Juergen Gall
We consider the problem of synthesizing multi-action human motion sequences of arbitrary lengths.
1 code implementation • ICCV 2021 • Jae Yong Lee, Joseph DeGol, Chuhang Zou, Derek Hoiem
To overcome the challenge of the non-differentiable PatchMatch optimization that involves iterative sampling and hard decisions, we use reinforcement learning to minimize expected photometric cost and maximize likelihood of ground truth depth and normals.
no code implementations • 18 Feb 2020 • Donghyun Kim, Tian Lan, Chuhang Zou, Ning Xu, Bryan A. Plummer, Stan Sclaroff, Jayan Eledath, Gerard Medioni
We embed the attention module in a ``slow-fast'' architecture, where the slower network runs on sparsely sampled keyframes and the light-weight shallow network runs on non-keyframes at a high frame rate.
1 code implementation • 21 Oct 2019 • Mao-Chuang Yeh, Shuai Tang, Anand Bhattad, Chuhang Zou, David Forsyth
Style transfer methods produce a transferred image which is a rendering of a content image in the manner of a style image.
3 code implementations • 9 Oct 2019 • Chuhang Zou, Jheng-Wei Su, Chi-Han Peng, Alex Colburn, Qi Shan, Peter Wonka, Hung-Kuo Chu, Derek Hoiem
Recent approaches for predicting layouts from 360 panoramas produce excellent results.
1 code implementation • 3 Sep 2019 • Theerasit Issaranon, Chuhang Zou, David Forsyth
We describe a method that predicts, from a single RGB image, a depth map that describes the scene when a masked object is removed - we call this "counterfactual depth" that models hidden scene geometry together with the observations.
1 code implementation • 29 Jul 2019 • Chuhang Zou, Derek Hoiem
One major challenge in 3D reconstruction is to infer the complete shape geometry from partial foreground occlusions.
2 code implementations • CVPR 2018 • Chuhang Zou, Alex Colburn, Qi Shan, Derek Hoiem
We propose an algorithm to predict room layout from a single image that generalizes across panoramas and perspective images, cuboid layouts and more general layouts (e. g. L-shape room).
1 code implementation • 25 Oct 2017 • Chuhang Zou, Ruiqi Guo, Zhizhong Li, Derek Hoiem
In this paper, we aim to interpret indoor scenes from one RGBD image.
2 code implementations • ICCV 2017 • Chuhang Zou, Ersin Yumer, Jimei Yang, Duygu Ceylan, Derek Hoiem
The success of various applications including robotics, digital content creation, and visualization demand a structured and abstract representation of the 3D world from limited sensor data.
1 code implementation • 9 Apr 2015 • Ruiqi Guo, Chuhang Zou, Derek Hoiem
One major goal of vision is to infer physical models of objects, surfaces, and their layout from sensors.