Search Results for author: Haitian Zheng

Found 25 papers, 11 papers with code

Jurassic World Remake: Bringing Ancient Fossils Back to Life via Zero-Shot Long Image-to-Image Translation

1 code implementation14 Aug 2023 Alexander Martin, Haitian Zheng, Jie An, Jiebo Luo

In this work, we use text-guided latent diffusion models for zero-shot image-to-image translation (I2I) across large domain gaps (longI2I), where large amounts of new visual features and new geometry need to be generated to enter the target domain.

Image-to-Image Translation

Improving Video Colorization by Test-Time Tuning

1 code implementation25 Jun 2023 Yaping Zhao, Haitian Zheng, Jiebo Luo, Edmund Y. Lam

With the advancements in deep learning, video colorization by propagating color information from a colorized reference frame to a monochrome video sequence has been well explored.

Colorization

Cross-Camera Deep Colorization

1 code implementation26 Aug 2022 Yaping Zhao, Haitian Zheng, Mengqi Ji, Ruqi Huang

Our method takes cross-domain and cross-scale images as input, and consequently synthesizes HR colorization results to facilitate the trade-off between spatial-temporal resolution and color depth in the single-camera imaging system.

Colorization

CM-GAN: Image Inpainting with Cascaded Modulation GAN and Object-Aware Training

1 code implementation22 Mar 2022 Haitian Zheng, Zhe Lin, Jingwan Lu, Scott Cohen, Eli Shechtman, Connelly Barnes, Jianming Zhang, Ning Xu, Sohrab Amirghodsi, Jiebo Luo

We propose cascaded modulation GAN (CM-GAN), a new network design consisting of an encoder with Fourier convolution blocks that extract multi-scale feature representations from the input image with holes and a dual-stream decoder with a novel cascaded global-spatial modulation block at each scale level.

Image Inpainting

Point Cloud Denoising via Momentum Ascent in Gradient Fields

1 code implementation21 Feb 2022 Yaping Zhao, Haitian Zheng, Zhongrui Wang, Jiebo Luo, Edmund Y. Lam

To achieve point cloud denoising, traditional methods heavily rely on geometric priors, and most learning-based approaches suffer from outliers and loss of details.

Denoising Position

MANet: Improving Video Denoising with a Multi-Alignment Network

1 code implementation20 Feb 2022 Yaping Zhao, Haitian Zheng, Zhongrui Wang, Jiebo Luo, Edmund Y. Lam

In video denoising, the adjacent frames often provide very useful information, but accurate alignment is needed before such information can be harnassed.

Denoising Video Denoising

SpaceEdit: Learning a Unified Editing Space for Open-Domain Image Color Editing

no code implementations CVPR 2022 Jing Shi, Ning Xu, Haitian Zheng, Alex Smith, Jiebo Luo, Chenliang Xu

Recently, large pretrained models (e. g., BERT, StyleGAN, CLIP) show great knowledge transfer and generalization capability on various downstream tasks within their domains.

Image-to-Image Translation Retrieval +1

SpaceEdit: Learning a Unified Editing Space for Open-Domain Image Editing

no code implementations30 Nov 2021 Jing Shi, Ning Xu, Haitian Zheng, Alex Smith, Jiebo Luo, Chenliang Xu

Recently, large pretrained models (e. g., BERT, StyleGAN, CLIP) have shown great knowledge transfer and generalization capability on various downstream tasks within their domains.

Image-to-Image Translation Retrieval +1

Learning to Aggregate and Refine Noisy Labels for Visual Sentiment Analysis

no code implementations15 Sep 2021 Wei Zhu, Zihe Zheng, Haitian Zheng, Hanjia Lyu, Jiebo Luo

The learned prototypes and their labels can be regarded as denoising features and labels for the local regions and can guide the training process to prevent the model from overfitting the noisy cases.

Denoising Learning with noisy labels +1

Semantic Layout Manipulation with High-Resolution Sparse Attention

1 code implementation14 Dec 2020 Haitian Zheng, Zhe Lin, Jingwan Lu, Scott Cohen, Jianming Zhang, Ning Xu, Jiebo Luo

A core problem of this task is how to transfer visual details from the input images to the new semantic layout while making the resulting image visually realistic.

Vocal Bursts Intensity Prediction

Image Sentiment Transfer

no code implementations19 Jun 2020 Tianlang Chen, Wei Xiong, Haitian Zheng, Jiebo Luo

In this paper, we propose an effective and flexible framework that performs image sentiment transfer at the object level.

Disentanglement Image-to-Image Translation +2

Personalized Fashion Recommendation from Personal Social Media Data: An Item-to-Set Metric Learning Approach

no code implementations25 May 2020 Haitian Zheng, Kefei Wu, Jong-Hwi Park, Wei Zhu, Jiebo Luo

In this work, we study the problem of personalized fashion recommendation from social media data, i. e. recommending new outfits to social media users that fit their fashion preferences.

Metric Learning

What comprises a good talking-head video generation?: A Survey and Benchmark

1 code implementation7 May 2020 Lele Chen, Guofeng Cui, Ziyi Kou, Haitian Zheng, Chenliang Xu

In this work, we present a carefully-designed benchmark for evaluating talking-head video generation with standardized dataset pre-processing strategies.

Talking Head Generation Video Generation

Unsupervised Pose Flow Learning for Pose Guided Synthesis

no code implementations30 Sep 2019 Haitian Zheng, Lele Chen, Chenliang Xu, Jiebo Luo

Pose guided synthesis aims to generate a new image in an arbitrary target pose while preserving the appearance details from the source image.

CrossNet: An End-to-end Reference-based Super Resolution Network using Cross-scale Warping

1 code implementation ECCV 2018 Haitian Zheng, Mengqi Ji, Haoqian Wang, Yebin Liu, Lu Fang

The Reference-based Super-resolution (RefSR) super-resolves a low-resolution (LR) image given an external high-resolution (HR) reference image, where the reference image and LR image share similar viewpoint but with significant resolution gap x8.

Patch Matching Reference-based Super-Resolution

SurfaceNet: An End-to-end 3D Neural Network for Multiview Stereopsis

3 code implementations ICCV 2017 Mengqi Ji, Juergen Gall, Haitian Zheng, Yebin Liu, Lu Fang

It takes a set of images and their corresponding camera parameters as input and directly infers the 3D model.

Utilizing High-level Visual Feature for Indoor Shopping Mall Navigation

no code implementations6 Oct 2016 Ziwei Xu, Haitian Zheng, Minjian Pang, Yangchun Zhu, Xiongfei Su, Guyue Zhou, Lu Fang

Towards robust and convenient indoor shopping mall navigation, we propose a novel learning-based scheme to utilize the high-level visual information from the storefront images captured by personal devices of users.

Visual Navigation Vocal Bursts Intensity Prediction

Deep Learning for Surface Material Classification Using Haptic And Visual Information

no code implementations21 Dec 2015 Haitian Zheng, Lu Fang, Mengqi Ji, Matti Strese, Yigitcan Ozer, Eckehard Steinbach

When a user scratches a hand-held rigid tool across an object surface, an acceleration signal can be captured, which carries relevant information about the surface.

Classification General Classification +1

Learning High-level Prior with Convolutional Neural Networks for Semantic Segmentation

no code implementations22 Nov 2015 Haitian Zheng, Yebin Liu, Mengqi Ji, Feng Wu, Lu Fang

Finally, the optimization problem enables us to take advantage of state-of-the-art fully convolutional network structure for the implementation of the above encoders and decoder.

Image Segmentation Segmentation +2

Cannot find the paper you are looking for? You can Submit a new open access paper.