Search Results for author: Chuhang Zou

Found 16 papers, 9 papers with code

Direct and Explicit 3D Generation from a Single Image

no code implementations17 Nov 2024 HaoYu Wu, Meher Gitika Karumuri, Chuhang Zou, Seungbae Bang, Yuelong Li, Dimitris Samaras, Sunil Hadap

Current image-to-3D approaches suffer from high computational costs and lack scalability for high-resolution outputs.

3D Generation Decoder +2

Plenoptic PNG: Real-Time Neural Radiance Fields in 150 KB

no code implementations24 Sep 2024 Jae Yong Lee, Yuqun Wu, Chuhang Zou, Derek Hoiem, Shenlong Wang

The goal of this paper is to encode a 3D scene into an extremely compact representation from 2D images and to enable its transmittance, decoding and rendering in real-time across various platforms.

MonoPatchNeRF: Improving Neural Radiance Fields with Patch-based Monocular Guidance

no code implementations12 Apr 2024 Yuqun Wu, Jae Yong Lee, Chuhang Zou, Shenlong Wang, Derek Hoiem

The latest regularized Neural Radiance Field (NeRF) approaches produce poor geometry and view extrapolation for large scale sparse view scenes, such as ETH3D.

Novel View Synthesis SSIM

QFF: Quantized Fourier Features for Neural Field Representations

no code implementations2 Dec 2022 Jae Yong Lee, Yuqun Wu, Chuhang Zou, Shenlong Wang, Derek Hoiem

Instead, we propose to encode features in bins of Fourier features that are commonly used for positional encoding.

Deep PatchMatch MVS with Learned Patch Coplanarity, Geometric Consistency and Adaptive Pixel Sampling

no code implementations14 Oct 2022 Jae Yong Lee, Chuhang Zou, Derek Hoiem

Recent work in multi-view stereo (MVS) combines learnable photometric scores and regularization with PatchMatch-based optimization to achieve robust pixelwise estimates of depth, normals, and visibility.

PatchMatch-RL: Deep MVS with Pixelwise Depth, Normal, and Visibility

1 code implementation ICCV 2021 Jae Yong Lee, Joseph DeGol, Chuhang Zou, Derek Hoiem

To overcome the challenge of the non-differentiable PatchMatch optimization that involves iterative sampling and hard decisions, we use reinforcement learning to minimize expected photometric cost and maximize likelihood of ground truth depth and normals.

MILA: Multi-Task Learning from Videos via Efficient Inter-Frame Attention

no code implementations18 Feb 2020 Donghyun Kim, Tian Lan, Chuhang Zou, Ning Xu, Bryan A. Plummer, Stan Sclaroff, Jayan Eledath, Gerard Medioni

We embed the attention module in a ``slow-fast'' architecture, where the slower network runs on sparsely sampled keyframes and the light-weight shallow network runs on non-keyframes at a high frame rate.

Multi-Task Learning

Improving Style Transfer with Calibrated Metrics

1 code implementation21 Oct 2019 Mao-Chuang Yeh, Shuai Tang, Anand Bhattad, Chuhang Zou, David Forsyth

Style transfer methods produce a transferred image which is a rendering of a content image in the manner of a style image.

Style Transfer

Counterfactual Depth from a Single RGB Image

1 code implementation3 Sep 2019 Theerasit Issaranon, Chuhang Zou, David Forsyth

We describe a method that predicts, from a single RGB image, a depth map that describes the scene when a masked object is removed - we call this "counterfactual depth" that models hidden scene geometry together with the observations.

counterfactual Decoder

Silhouette Guided Point Cloud Reconstruction beyond Occlusion

1 code implementation29 Jul 2019 Chuhang Zou, Derek Hoiem

One major challenge in 3D reconstruction is to infer the complete shape geometry from partial foreground occlusions.

Point cloud reconstruction

LayoutNet: Reconstructing the 3D Room Layout from a Single RGB Image

2 code implementations CVPR 2018 Chuhang Zou, Alex Colburn, Qi Shan, Derek Hoiem

We propose an algorithm to predict room layout from a single image that generalizes across panoramas and perspective images, cuboid layouts and more general layouts (e. g. L-shape room).

3D Room Layouts From A Single RGB Panorama Translation

3D-PRNN: Generating Shape Primitives with Recurrent Neural Networks

2 code implementations ICCV 2017 Chuhang Zou, Ersin Yumer, Jimei Yang, Duygu Ceylan, Derek Hoiem

The success of various applications including robotics, digital content creation, and visualization demand a structured and abstract representation of the 3D world from limited sensor data.

Retrieval

Predicting Complete 3D Models of Indoor Scenes

1 code implementation9 Apr 2015 Ruiqi Guo, Chuhang Zou, Derek Hoiem

One major goal of vision is to infer physical models of objects, surfaces, and their layout from sensors.

Diversity Visual Reasoning

Cannot find the paper you are looking for? You can Submit a new open access paper.