no code implementations • 21 Mar 2024 • Haiwen Feng, Zheng Ding, Zhihao Xia, Simon Niklaus, Victoria Abrevaya, Michael J. Black, Xuaner Zhang
We introduce bounded generation as a generalized task to control video generation to synthesize arbitrary camera and subject motion based only on a given start and end frame.
no code implementations • 19 Mar 2024 • Hadi AlZayer, Zhihao Xia, Xuaner Zhang, Eli Shechtman, Jia-Bin Huang, Michael Gharbi
We show that by using simple segmentations and coarse 2D manipulations, we can synthesize a photorealistic edit faithful to the user's input while addressing second-order effects like harmonizing the lighting and physical interactions between edited objects.
no code implementations • CVPR 2024 • Zheng Ding, Xuaner Zhang, Zhuowen Tu, Zhihao Xia
We propose a method to adapt a pretrained diffusion model for image restoration by simply adding noise to the input image to be restored and then denoise.
1 code implementation • CVPR 2023 • Zheng Ding, Xuaner Zhang, Zhihao Xia, Lars Jebe, Zhuowen Tu, Xiuming Zhang
On a high level, DiffusionRig learns to map simplistic renderings of 3D face models to realistic photos of a given person.
no code implementations • CVPR 2023 • Ke Wang, Michaël Gharbi, He Zhang, Zhihao Xia, Eli Shechtman
Learning-based image harmonization techniques are usually trained to undo synthetic random global transformations applied to a masked foreground in a single ground truth photo.
no code implementations • ICCV 2023 • Goutam Bhat, Michaël Gharbi, Jiawen Chen, Luc van Gool, Zhihao Xia
Extensive experiments on real and synthetic data show that, despite only using noisy bursts during training, models trained with our self-supervised strategy match, and sometimes surpass, the quality of fully-supervised baselines trained with synthetic data or weakly-paired ground-truth.
1 code implementation • CVPR 2022 • Ilya Chugunov, Yuxuan Zhang, Zhihao Xia, Xuaner, Zhang, Jiawen Chen, Felix Heide
Modern smartphones can continuously stream multi-megapixel RGB images at 60Hz, synchronized with high-quality 3D pose information and low-resolution LiDAR-driven depth estimates.
no code implementations • ICCV 2021 • Zhihao Xia, Jason Lawrence, Supreeth Achar
Casual photography is often performed in uncontrolled lighting that can result in low quality images and degrade the performance of downstream processing.
no code implementations • CVPR 2021 • Zhihao Xia, Michaël Gharbi, Federico Perazzi, Kalyan Sunkavalli, Ayan Chakrabarti
We introduce a neural network-based method to denoise pairs of images taken in quick succession, with and without a flash, in low-light environments.
no code implementations • CVPR 2020 • Zhihao Xia, Federico Perazzi, Michaël Gharbi, Kalyan Sunkavalli, Ayan Chakrabarti
Bursts of images exhibit significant self-similarity across both time and space.
1 code implementation • NeurIPS 2019 • Zhihao Xia, Ayan Chakrabarti
Deep neural networks have been very successful in compressive-sensing and image restoration applications, as a means to estimate images from partial, blurry, or otherwise degraded measurements.
1 code implementation • CVPR 2020 • Zhihao Xia, Patrick Sullivan, Ayan Chakrabarti
Beyond depth estimation from a single image, the monocular cue is useful in a broader range of depth inference applications and settings---such as when one can leverage other available depth cues for improved accuracy.
Ranked #65 on Monocular Depth Estimation on NYU-Depth V2 (RMSE metric)
1 code implementation • 13 Jun 2019 • Zhihao Xia, Ayan Chakrabarti
We evaluate our method for training networks for compressive-sensing and blind deconvolution, considering both non-blind and blind training for the latter.
1 code implementation • 13 Jun 2018 • Zhihao Xia, Ayan Chakrabarti
In this work, we propose a new method for natural image denoising that trains a deep neural network to determine whether patches in a noisy image input share common underlying patterns.
4 code implementations • 18 Mar 2018 • Yu Sun, Zhihao Xia, Ulugbek S. Kamilov
Image reconstruction under multiple light scattering is crucial in a number of applications such as diffraction tomography.