Search Results for author: Chuhang Zou

Found 14 papers, 9 papers with code

MonoPatchNeRF: Improving Neural Radiance Fields with Patch-based Monocular Guidance

no code implementations • 12 Apr 2024 • Yuqun Wu, Jae Yong Lee, Chuhang Zou, Shenlong Wang, Derek Hoiem

Our experiments show 4x the performance of RegNeRF and 8x that of FreeNeRF on average F1@2cm for ETH3D MVS benchmark, suggesting a fruitful research direction to improve the geometric accuracy of NeRF-based models, and sheds light on a potential future approach to enable NeRF-based optimization to eventually outperform traditional MVS.

Novel View Synthesis SSIM

Paper
Add Code

QFF: Quantized Fourier Features for Neural Field Representations

no code implementations • 2 Dec 2022 • Jae Yong Lee, Yuqun Wu, Chuhang Zou, Shenlong Wang, Derek Hoiem

Instead, we propose to encode features in bins of Fourier features that are commonly used for positional encoding.

Paper
Add Code

Deep PatchMatch MVS with Learned Patch Coplanarity, Geometric Consistency and Adaptive Pixel Sampling

no code implementations • 14 Oct 2022 • Jae Yong Lee, Chuhang Zou, Derek Hoiem

Recent work in multi-view stereo (MVS) combines learnable photometric scores and regularization with PatchMatch-based optimization to achieve robust pixelwise estimates of depth, normals, and visibility.

Paper
Add Code

Recurrent Transformer Variational Autoencoders for Multi-Action Motion Synthesis

no code implementations • 14 Jun 2022 • Rania Briq, Chuhang Zou, Leonid Pishchulin, Chris Broaddus, Juergen Gall

We consider the problem of synthesizing multi-action human motion sequences of arbitrary lengths.

Motion Synthesis

Paper
Add Code

PatchMatch-RL: Deep MVS with Pixelwise Depth, Normal, and Visibility

1 code implementation • ICCV 2021 • Jae Yong Lee, Joseph DeGol, Chuhang Zou, Derek Hoiem

To overcome the challenge of the non-differentiable PatchMatch optimization that involves iterative sampling and hard decisions, we use reinforcement learning to minimize expected photometric cost and maximize likelihood of ground truth depth and normals.

Paper
Code

MILA: Multi-Task Learning from Videos via Efficient Inter-Frame Attention

no code implementations • 18 Feb 2020 • Donghyun Kim, Tian Lan, Chuhang Zou, Ning Xu, Bryan A. Plummer, Stan Sclaroff, Jayan Eledath, Gerard Medioni

We embed the attention module in a ``slow-fast'' architecture, where the slower network runs on sparsely sampled keyframes and the light-weight shallow network runs on non-keyframes at a high frame rate.

Multi-Task Learning

Paper
Add Code

Improving Style Transfer with Calibrated Metrics

1 code implementation • 21 Oct 2019 • Mao-Chuang Yeh, Shuai Tang, Anand Bhattad, Chuhang Zou, David Forsyth

Style transfer methods produce a transferred image which is a rendering of a content image in the manner of a style image.

Style Transfer

Paper
Code

Manhattan Room Layout Reconstruction from a Single 360 image: A Comparative Study of State-of-the-art Methods

3 code implementations • 9 Oct 2019 • Chuhang Zou, Jheng-Wei Su, Chi-Han Peng, Alex Colburn, Qi Shan, Peter Wonka, Hung-Kuo Chu, Derek Hoiem

Recent approaches for predicting layouts from 360 panoramas produce excellent results.

Semantic Segmentation

309

Paper
Code

Counterfactual Depth from a Single RGB Image

1 code implementation • 3 Sep 2019 • Theerasit Issaranon, Chuhang Zou, David Forsyth

We describe a method that predicts, from a single RGB image, a depth map that describes the scene when a masked object is removed - we call this "counterfactual depth" that models hidden scene geometry together with the observations.

counterfactual

Paper
Code

Silhouette Guided Point Cloud Reconstruction beyond Occlusion

1 code implementation • 29 Jul 2019 • Chuhang Zou, Derek Hoiem

One major challenge in 3D reconstruction is to infer the complete shape geometry from partial foreground occlusions.

Point cloud reconstruction

Paper
Code

LayoutNet: Reconstructing the 3D Room Layout from a Single RGB Image

2 code implementations • CVPR 2018 • Chuhang Zou, Alex Colburn, Qi Shan, Derek Hoiem

We propose an algorithm to predict room layout from a single image that generalizes across panoramas and perspective images, cuboid layouts and more general layouts (e. g. L-shape room).

Ranked #2 on 3D Room Layouts From A Single RGB Panorama on Realtor360

3D Room Layouts From A Single RGB Panorama Translation

213

Paper
Code

Complete 3D Scene Parsing from an RGBD Image

1 code implementation • 25 Oct 2017 • Chuhang Zou, Ruiqi Guo, Zhizhong Li, Derek Hoiem

In this paper, we aim to interpret indoor scenes from one RGBD image.

Retrieval Scene Parsing +1

Paper
Code

3D-PRNN: Generating Shape Primitives with Recurrent Neural Networks

2 code implementations • ICCV 2017 • Chuhang Zou, Ersin Yumer, Jimei Yang, Duygu Ceylan, Derek Hoiem

The success of various applications including robotics, digital content creation, and visualization demand a structured and abstract representation of the 3D world from limited sensor data.

Retrieval

118

Paper
Code

Predicting Complete 3D Models of Indoor Scenes

1 code implementation • 9 Apr 2015 • Ruiqi Guo, Chuhang Zou, Derek Hoiem

One major goal of vision is to infer physical models of objects, surfaces, and their layout from sensors.

Visual Reasoning

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.