Search Results for author: Tianfan Xue

Found 32 papers, 13 papers with code

HDRFlow: Real-Time HDR Video Reconstruction with Large Motions

no code implementations • 6 Mar 2024 • Gangwei Xu, Yujin Wang, Jinwei Gu, Tianfan Xue, Xin Yang

HDRFlow has three novel designs: an HDR-domain alignment loss (HALoss), an efficient flow network with a multi-size large kernel (MLK), and a new HDR flow training scheme.

Optical Flow Estimation Video Reconstruction

Paper
Add Code

GenNBV: Generalizable Next-Best-View Policy for Active 3D Reconstruction

no code implementations • 25 Feb 2024 • Xiao Chen, Quanyi Li, Tai Wang, Tianfan Xue, Jiangmiao Pang

Previous works attempt to automate this process using the Next-Best-View (NBV) policy for active 3D reconstruction.

3D Reconstruction Reinforcement Learning (RL)

Paper
Add Code

Event-Based Motion Magnification

1 code implementation • 19 Feb 2024 • Yutian Chen, Shi Guo, Fangzheng Yu, Feng Zhang, Jinwei Gu, Tianfan Xue

Detecting and magnifying imperceptible high-frequency motions in real-world scenarios has substantial implications for industrial and medical applications.

Motion Detection Motion Magnification

Paper
Code

EmbodiedScan: A Holistic Multi-Modal 3D Perception Suite Towards Embodied AI

1 code implementation • 26 Dec 2023 • Tai Wang, Xiaohan Mao, Chenming Zhu, Runsen Xu, Ruiyuan Lyu, Peisen Li, Xiao Chen, Wenwei Zhang, Kai Chen, Tianfan Xue, Xihui Liu, Cewu Lu, Dahua Lin, Jiangmiao Pang

In the realm of computer vision and robotics, embodied agents are expected to explore their environment and carry out human instructions.

Scene Understanding

291

Paper
Code

Depicting Beyond Scores: Advancing Image Quality Assessment through Multi-modal Language Models

1 code implementation • 14 Dec 2023 • Zhiyuan You, Zheyuan Li, Jinjin Gu, Zhenfei Yin, Tianfan Xue, Chao Dong

We introduce a Depicted image Quality Assessment method (DepictQA), overcoming the constraints of traditional score-based methods.

Descriptive Image Quality Assessment +1

259

Paper
Code

Text-to-3D Generation with Bidirectional Diffusion using both 2D and 3D priors

no code implementations • 7 Dec 2023 • Lihe Ding, Shaocong Dong, Zhanpeng Huang, Zibin Wang, Yiyuan Zhang, Kaixiong Gong, Dan Xu, Tianfan Xue

Recently, researchers have attempted to improve the genuineness of 3D objects by directly training on 3D datasets, albeit at the cost of low-quality texture generation due to the limited texture diversity in 3D datasets.

3D Generation Text to 3D +1

Paper
Add Code

Obj-NeRF: Extract Object NeRFs from Multi-view Images

no code implementations • 26 Nov 2023 • Zhiyi Li, Lihe Ding, Tianfan Xue

To solve this problem, in this paper, we propose Obj-NeRF, a comprehensive pipeline that recovers the 3D geometry of a specific object from multi-view images using a single prompt.

3D Reconstruction Novel View Synthesis +2

Paper
Add Code

AutoDIR: Automatic All-in-One Image Restoration with Latent Diffusion

1 code implementation • 16 Oct 2023 • Yitong Jiang, Zhaoyang Zhang, Tianfan Xue, Jinwei Gu

To this end, we propose an all-in-one image restoration framework with latent diffusion (AutoDIR), which can automatically detect and address multiple unknown degradations.

Blind Image Quality Assessment Image Restoration

Paper
Code

Reconstruct-and-Generate Diffusion Model for Detail-Preserving Image Denoising

no code implementations • 19 Sep 2023 • Yujin Wang, Lingen Li, Tianfan Xue, Jinwei Gu

To address the trade-off between visual appeal and fidelity of high-frequency details in denoising tasks, we propose a novel approach called the Reconstruct-and-Generate Diffusion Model (RnG).

Image Denoising Image Restoration

Paper
Add Code

AligNeRF: High-Fidelity Neural Radiance Fields via Alignment-Aware Training

no code implementations • CVPR 2023 • Yifan Jiang, Peter Hedman, Ben Mildenhall, Dejia Xu, Jonathan T. Barron, Zhangyang Wang, Tianfan Xue

Neural Radiance Fields (NeRFs) are a powerful representation for modeling a 3D scene as a continuous function.

Camera Calibration Vocal Bursts Intensity Prediction

Paper
Add Code

Fast and High-Quality Image Denoising via Malleable Convolutions

no code implementations • 2 Jan 2022 • Yifan Jiang, Bartlomiej Wronski, Ben Mildenhall, Jonathan T. Barron, Zhangyang Wang, Tianfan Xue

These spatially-varying kernels are produced by an efficient predictor network running on a downsampled input, making them much more efficient to compute than per-pixel kernels produced by a full-resolution image, and also enlarging the network's receptive field compared with static kernels.

Image Denoising Image Restoration +1

Paper
Add Code

Defocus Map Estimation and Deblurring from a Single Dual-Pixel Image

no code implementations • ICCV 2021 • Shumian Xin, Neal Wadhwa, Tianfan Xue, Jonathan T. Barron, Pratul P. Srinivasan, Jiawen Chen, Ioannis Gkioulekas, Rahul Garg

We use data captured with a consumer smartphone camera to demonstrate that, after a one-time calibration step, our approach improves upon prior works for both defocus map estimation and blur removal, despite being entirely unsupervised.

Deblurring

Paper
Add Code

How to Train Neural Networks for Flare Removal

1 code implementation • ICCV 2021 • Yicheng Wu, Qiurui He, Tianfan Xue, Rahul Garg, Jiawen Chen, Ashok Veeraraghavan, Jonathan T. Barron

When a camera is pointed at a strong light source, the resulting photograph may contain lens flare artifacts.

Flare Removal

32,750

Paper
Code

Real-time Localized Photorealistic Video Style Transfer

no code implementations • 20 Oct 2020 • Xide Xia, Tianfan Xue, Wei-Sheng Lai, Zheng Sun, Abby Chang, Brian Kulis, Jiawen Chen

We present a novel algorithm for transferring artistic styles of semantically meaningful local regions of an image onto local regions of a target video while preserving its photorealism.

Style Transfer Video Segmentation +2

Paper
Add Code

Learned Dual-View Reflection Removal

no code implementations • 1 Oct 2020 • Simon Niklaus, Xuaner Cecilia Zhang, Jonathan T. Barron, Neal Wadhwa, Rahul Garg, Feng Liu, Tianfan Xue

Traditional reflection removal algorithms either use a single image as input, which suffers from intrinsic ambiguities, or use multiple images from a moving camera, which is inconvenient for users.

Reflection Removal

Paper
Add Code

Neural Light Transport for Relighting and View Synthesis

1 code implementation • 9 Aug 2020 • Xiuming Zhang, Sean Fanello, Yun-Ta Tsai, Tiancheng Sun, Tianfan Xue, Rohit Pandey, Sergio Orts-Escolano, Philip Davidson, Christoph Rhemann, Paul Debevec, Jonathan T. Barron, Ravi Ramamoorthi, William T. Freeman

In particular, we show how to fuse previously seen observations of illuminants and views to synthesize a new image of the same scene under a desired lighting condition from a chosen viewpoint.

261

Paper
Code

Joint Bilateral Learning for Real-time Universal Photorealistic Style Transfer

3 code implementations • ECCV 2020 • Xide Xia, Meng Zhang, Tianfan Xue, Zheng Sun, Hui Fang, Brian Kulis, Jiawen Chen

Photorealistic style transfer is the task of transferring the artistic style of an image onto a content target, producing a result that is plausibly taken with a camera.

4k Style Transfer

Paper
Code

Handheld Mobile Photography in Very Low Light

no code implementations • 24 Oct 2019 • Orly Liba, Kiran Murthy, Yun-Ta Tsai, Tim Brooks, Tianfan Xue, Nikhil Karnad, Qiurui He, Jonathan T. Barron, Dillon Sharlet, Ryan Geiss, Samuel W. Hasinoff, Yael Pritch, Marc Levoy

Aside from the physical limits imposed by read noise and photon shot noise, these cameras are typically handheld, have small apertures and sensors, use mass-produced analog electronics that cannot easily be cooled, and are commonly used to photograph subjects that move, like children and pets.

Tone Mapping

Paper
Add Code

Stereoscopic Dark Flash for Low-light Photography

no code implementations • 5 Jan 2019 • Jian Wang, Tianfan Xue, Jonathan T. Barron, Jiawen Chen

In this work, we present a camera configuration for acquiring "stereoscopic dark flash" images: a simultaneous stereo pair in which one camera is a conventional RGB sensor, but the other camera is sensitive to near-infrared and near-ultraviolet instead of R and B.

Paper
Add Code

Unprocessing Images for Learned Raw Denoising

4 code implementations • CVPR 2019 • Tim Brooks, Ben Mildenhall, Tianfan Xue, Jiawen Chen, Dillon Sharlet, Jonathan T. Barron

Machine learning techniques work best when the data used for training resembles the data used for evaluation.

Ranked #1 on Color Image Denoising on Darmstadt Noise Dataset

Image Denoising Noise Estimation +1

32,755

Paper
Code

MoSculp: Interactive Visualization of Shape and Time

no code implementations • 14 Sep 2018 • Xiuming Zhang, Tali Dekel, Tianfan Xue, Andrew Owens, Qiurui He, Jiajun Wu, Stefanie Mueller, William T. Freeman

We present a system that allows users to visualize complex human motion via 3D motion sculptures---a representation that conveys the 3D structure swept by a human body as it moves through space.

Paper
Add Code

Seeing Tree Structure from Vibration

no code implementations • ECCV 2018 • Tianfan Xue, Jiajun Wu, Zhoutong Zhang, Chengkai Zhang, Joshua B. Tenenbaum, William T. Freeman

Humans recognize object structure from both their appearance and motion; often, motion helps to resolve ambiguities in object structure that arise when we observe object appearance only.

Bayesian Inference Object

Paper
Add Code

Visual Dynamics: Stochastic Future Generation via Layered Cross Convolutional Networks

no code implementations • 24 Jul 2018 • Tianfan Xue, Jiajun Wu, Katherine L. Bouman, William T. Freeman

We study the problem of synthesizing a number of likely future frames from a single input image.

Paper
Add Code

Pix3D: Dataset and Methods for Single-Image 3D Shape Modeling

1 code implementation • CVPR 2018 • Xingyuan Sun, Jiajun Wu, Xiuming Zhang, Zhoutong Zhang, Chengkai Zhang, Tianfan Xue, Joshua B. Tenenbaum, William T. Freeman

We study 3D shape modeling from a single image and make contributions to it in three aspects.

Ranked #1 on 3D Shape Classification on Pix3D

3D Reconstruction 3D Shape Modeling +5

485

Paper
Code

3D Interpreter Networks for Viewer-Centered Wireframe Modeling

no code implementations • 3 Apr 2018 • Jiajun Wu, Tianfan Xue, Joseph J. Lim, Yuandong Tian, Joshua B. Tenenbaum, Antonio Torralba, William T. Freeman

3D-INN is trained on real images to estimate 2D keypoint heatmaps from an input image; it then predicts 3D object structure from heatmaps using knowledge learned from synthetic 3D shapes.

Image Retrieval Keypoint Estimation +2

Paper
Add Code

Video Enhancement with Task-Oriented Flow

4 code implementations • 24 Nov 2017 • Tianfan Xue, Baian Chen, Jiajun Wu, Donglai Wei, William T. Freeman

Many video enhancement algorithms rely on optical flow to register frames in a video sequence.

Ranked #7 on Video Frame Interpolation on Middlebury

Denoising Motion Estimation +5

427

Paper
Code

MarrNet: 3D Shape Reconstruction via 2.5D Sketches

no code implementations • NeurIPS 2017 • Jiajun Wu, Yifan Wang, Tianfan Xue, Xingyuan Sun, William T. Freeman, Joshua B. Tenenbaum

First, compared to full 3D shape, 2. 5D sketches are much easier to be recovered from a 2D image; models that recover 2. 5D sketches are also more likely to transfer from synthetic to real data.

Ranked #2 on 3D Shape Classification on Pix3D

3D Object Reconstruction From A Single Image 3D Reconstruction +3

Paper
Add Code

Learning a Probabilistic Latent Space of Object Shapes via 3D Generative-Adversarial Modeling

3 code implementations • NeurIPS 2016 • Jiajun Wu, Chengkai Zhang, Tianfan Xue, William T. Freeman, Joshua B. Tenenbaum

We study the problem of 3D object generation.

Ranked #3 on 3D Shape Classification on Pix3D

3D Object Recognition 3D Point Cloud Linear Classification +3

813

Paper
Code

Best-Buddies Similarity - Robust Template Matching using Mutual Nearest Neighbors

no code implementations • 6 Sep 2016 • Shaul Oron, Tali Dekel, Tianfan Xue, William T. Freeman, Shai Avidan

We propose a novel method for template matching in unconstrained environments.

Template Matching

Paper
Add Code

Visual Dynamics: Probabilistic Future Frame Synthesis via Cross Convolutional Networks

3 code implementations • NeurIPS 2016 • Tianfan Xue, Jiajun Wu, Katherine L. Bouman, William T. Freeman

We study the problem of synthesizing a number of likely future frames from a single input image.

76,579

Paper
Code

Single Image 3D Interpreter Network

1 code implementation • 29 Apr 2016 • Jiajun Wu, Tianfan Xue, Joseph J. Lim, Yuandong Tian, Joshua B. Tenenbaum, Antonio Torralba, William T. Freeman

In this work, we propose 3D INterpreter Network (3D-INN), an end-to-end framework which sequentially estimates 2D keypoint heatmaps and 3D object structure, trained on both real 2D-annotated images and synthetic 3D data.

Image Retrieval Keypoint Estimation +2

Paper
Code

The Aperture Problem for Refractive Motion

no code implementations • CVPR 2015 • Tianfan Xue, Hossein Mobahi, Fredo Durand, William T. Freeman

We pose and solve a generalization of the aperture problem for moving refractive elements.

Optical Flow Estimation

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.