Search Results for author: Shangchen Zhou

Found 47 papers, 30 papers with code

Learning Inclusion Matching for Animation Paint Bucket Colorization

1 code implementation27 Mar 2024 Yuekun Dai, Shangchen Zhou, Qinyue Li, Chongyi Li, Chen Change Loy

In this work, we introduce a new learning-based inclusion matching pipeline, which directs the network to comprehend the inclusion relationships between segments rather than relying solely on direct visual correspondences.

Colorization

Control Color: Multimodal Diffusion-based Interactive Image Colorization

no code implementations16 Feb 2024 Zhexin Liang, Zhaochen Li, Shangchen Zhou, Chongyi Li, Chen Change Loy

We also introduce a novel module based on self-attention and a content-guided deformable autoencoder to address the long-standing issues of color overflow and inaccurate coloring.

Colorization Color Manipulation +1

Towards Language-Driven Video Inpainting via Multimodal Large Language Models

no code implementations18 Jan 2024 Jianzong Wu, Xiangtai Li, Chenyang Si, Shangchen Zhou, Jingkang Yang, Jiangning Zhang, Yining Li, Kai Chen, Yunhai Tong, Ziwei Liu, Chen Change Loy

We introduce a new task -- language-driven video inpainting, which uses natural language instructions to guide the inpainting process.

Video Inpainting

Upscale-A-Video: Temporal-Consistent Diffusion Model for Real-World Video Super-Resolution

no code implementations11 Dec 2023 Shangchen Zhou, Peiqing Yang, Jianyi Wang, Yihang Luo, Chen Change Loy

Text-based diffusion models have exhibited remarkable success in generation and editing, showing great promise for enhancing visual content with their generative prior.

Video Super-Resolution

Iterative Token Evaluation and Refinement for Real-World Super-Resolution

1 code implementation9 Dec 2023 Chaofeng Chen, Shangchen Zhou, Liang Liao, HaoNing Wu, Wenxiu Sun, Qiong Yan, Weisi Lin

Distortion removal involves simple HQ token prediction with LQ images, while texture generation uses a discrete diffusion model to iteratively refine the distortion removal output with a token refinement network.

Image Super-Resolution Texture Synthesis

LAVIE: High-Quality Video Generation with Cascaded Latent Diffusion Models

2 code implementations26 Sep 2023 Yaohui Wang, Xinyuan Chen, Xin Ma, Shangchen Zhou, Ziqi Huang, Yi Wang, Ceyuan Yang, Yinan He, Jiashuo Yu, Peiqing Yang, Yuwei Guo, Tianxing Wu, Chenyang Si, Yuming Jiang, Cunjian Chen, Chen Change Loy, Bo Dai, Dahua Lin, Yu Qiao, Ziwei Liu

To this end, we propose LaVie, an integrated video generation framework that operates on cascaded video latent diffusion models, comprising a base T2V model, a temporal interpolation model, and a video super-resolution model.

Text-to-Video Generation Video Generation +1

PGDiff: Guiding Diffusion Models for Versatile Face Restoration via Partial Guidance

1 code implementation NeurIPS 2023 Peiqing Yang, Shangchen Zhou, Qingyi Tao, Chen Change Loy

When combined with a diffusion prior, this partial guidance can deliver appealing results across a range of restoration tasks.

Adaptive Window Pruning for Efficient Local Motion Deblurring

no code implementations25 Jun 2023 Haoying Li, Jixin Zhao, Shangchen Zhou, Huajun Feng, Chongyi Li, Chen Change Loy

Existing image deblurring methods predominantly focus on global deblurring, inadvertently affecting the sharpness of backgrounds in locally blurred images and wasting unnecessary computation on sharp pixels, especially for high-resolution images.

Deblurring Image Deblurring

Flare7K++: Mixing Synthetic and Real Datasets for Nighttime Flare Removal and Beyond

1 code implementation7 Jun 2023 Yuekun Dai, Chongyi Li, Shangchen Zhou, Ruicheng Feng, Yihang Luo, Chen Change Loy

To address this issue, we additionally provide the annotations of light sources in Flare7K++ and propose a new end-to-end pipeline to preserve the light source while removing lens flares.

Flare Removal

Exploiting Diffusion Prior for Real-World Image Super-Resolution

3 code implementations11 May 2023 Jianyi Wang, Zongsheng Yue, Shangchen Zhou, Kelvin C. K. Chan, Chen Change Loy

We present a novel approach to leverage prior knowledge encapsulated in pre-trained text-to-image diffusion models for blind super-resolution (SR).

Blind Super-Resolution Image Super-Resolution

MIPI 2023 Challenge on RGBW Remosaic: Methods and Results

no code implementations20 Apr 2023 Qianhui Sun, Qingyu Yang, Chongyi Li, Shangchen Zhou, Ruicheng Feng, Yuekun Dai, Wenxiu Sun, Qingpeng Zhu, Chen Change Loy, Jinwei Gu

Developing and integrating advanced image sensors with novel algorithms in camera systems are prevalent with the increasing demand for computational photography and imaging on mobile platforms.

SSIM

MIPI 2023 Challenge on RGBW Fusion: Methods and Results

no code implementations20 Apr 2023 Qianhui Sun, Qingyu Yang, Chongyi Li, Shangchen Zhou, Ruicheng Feng, Yuekun Dai, Wenxiu Sun, Qingpeng Zhu, Chen Change Loy, Jinwei Gu

Developing and integrating advanced image sensors with novel algorithms in camera systems are prevalent with the increasing demand for computational photography and imaging on mobile platforms.

SSIM

Iterative Prompt Learning for Unsupervised Backlit Image Enhancement

no code implementations ICCV 2023 Zhexin Liang, Chongyi Li, Shangchen Zhou, Ruicheng Feng, Chen Change Loy

To solve this issue, we devise a prompt learning framework that first learns an initial prompt pair by constraining the text-image similarity between the prompt (negative/positive sample) and the corresponding image (backlit image/well-lit image) in the CLIP latent space.

Image Enhancement Image Manipulation

Embedding Fourier for Ultra-High-Definition Low-Light Image Enhancement

no code implementations23 Feb 2023 Chongyi Li, Chun-Le Guo, Man Zhou, Zhexin Liang, Shangchen Zhou, Ruicheng Feng, Chen Change Loy

Our approach is motivated by a few unique characteristics in the Fourier domain: 1) most luminance information concentrates on amplitudes while noise is closely related to phases, and 2) a high-resolution image and its low-resolution version share similar amplitude patterns. Through embedding Fourier into our network, the amplitude and phase of a low-light image are separately processed to avoid amplifying noise when enhancing luminance.

4k Low-Light Image Enhancement +1

Deep Dynamic Scene Deblurring from Optical Flow

no code implementations18 Jan 2023 Jiawei Zhang, Jinshan Pan, Daoye Wang, Shangchen Zhou, Xing Wei, Furong Zhao, Jianbo Liu, Jimmy Ren

In this paper, we explore optical flow to remove dynamic scene blur by using the multi-scale spatially variant recurrent neural network (RNN).

Deblurring Optical Flow Estimation

Learning Dual Memory Dictionaries for Blind Face Restoration

1 code implementation15 Oct 2022 Xiaoming Li, Shiguang Zhang, Shangchen Zhou, Lei Zhang, WangMeng Zuo

Generally, it is a challenging and intractable task to improve the photo-realistic performance of blind restoration and adaptively handle the generic and specific restoration scenarios with a single unified model.

Blind Face Restoration

Flare7K: A Phenomenological Nighttime Flare Removal Dataset

1 code implementation12 Oct 2022 Yuekun Dai, Chongyi Li, Shangchen Zhou, Ruicheng Feng, Chen Change Loy

In this paper, we introduce, Flare7K, the first nighttime flare removal dataset, which is generated based on the observation and statistics of real-world nighttime lens flares.

Flare Removal

CuDi: Curve Distillation for Efficient and Controllable Exposure Adjustment

no code implementations28 Jul 2022 Chongyi Li, Chunle Guo, Ruicheng Feng, Shangchen Zhou, Chen Change Loy

Our method inherits the zero-reference learning and curve-based framework from an effective low-light image enhancement method, Zero-DCE, with further speed up in its inference speed, reduction in its model size, and extension to controllable exposure adjustment.

Low-Light Image Enhancement

Towards Robust Blind Face Restoration with Codebook Lookup Transformer

1 code implementation22 Jun 2022 Shangchen Zhou, Kelvin C. K. Chan, Chongyi Li, Chen Change Loy

In this paper, we demonstrate that a learned discrete codebook prior in a small proxy space largely reduces the uncertainty and ambiguity of restoration mapping by casting blind face restoration as a code prediction task, while providing rich visual atoms for generating high-quality faces.

Blind Face Restoration

On the Generalization of BasicVSR++ to Video Deblurring and Denoising

1 code implementation11 Apr 2022 Kelvin C. K. Chan, Shangchen Zhou, Xiangyu Xu, Chen Change Loy

The exploitation of long-term information has been a long-standing problem in video restoration.

Deblurring Denoising +2

BasicVSR++: Improving Video Super-Resolution with Enhanced Propagation and Alignment

3 code implementations CVPR 2022 Kelvin C. K. Chan, Shangchen Zhou, Xiangyu Xu, Chen Change Loy

We show that by empowering the recurrent framework with the enhanced propagation and alignment, one can exploit spatiotemporal information across misaligned video frames more effectively.

Analog Video Restoration Video Enhancement +1

Flexible Piecewise Curves Estimation for Photo Enhancement

no code implementations26 Oct 2020 Chongyi Li, Chunle Guo, Qiming Ai, Shangchen Zhou, Chen Change Loy

This paper presents a new method, called FlexiCurve, for photo enhancement.

Blind Face Restoration via Deep Multi-scale Component Dictionaries

1 code implementation ECCV 2020 Xiaoming Li, Chaofeng Chen, Shangchen Zhou, Xianhui Lin, WangMeng Zuo, Lei Zhang

Next, with the degraded input, we match and select the most similar component features from their corresponding dictionaries and transfer the high-quality details to the input via the proposed dictionary feature transfer (DFT) block.

Blind Face Restoration Video Super-Resolution

Cross-Scale Internal Graph Neural Network for Image Super-Resolution

1 code implementation NeurIPS 2020 Shangchen Zhou, Jiawei Zhang, WangMeng Zuo, Chen Change Loy

Specifically, we dynamically construct a cross-scale graph by searching k-nearest neighboring patches in the downsampled LR image for each query patch in the LR image.

Image Restoration Image Super-Resolution

Pix2Vox++: Multi-scale Context-aware 3D Object Reconstruction from Single and Multiple Images

3 code implementations22 Jun 2020 Haozhe Xie, Hongxun Yao, Shengping Zhang, Shangchen Zhou, Wenxiu Sun

A multi-scale context-aware fusion module is then introduced to adaptively select high-quality reconstructions for different parts from all coarse 3D volumes to obtain a fused 3D volume.

3D Object Reconstruction

GRNet: Gridding Residual Network for Dense Point Cloud Completion

1 code implementation ECCV 2020 Haozhe Xie, Hongxun Yao, Shangchen Zhou, Jiageng Mao, Shengping Zhang, Wenxiu Sun

In particular, we devise two novel differentiable layers, named Gridding and Gridding Reverse, to convert between point clouds and 3D grids without losing structural information.

Point Cloud Completion

Hybrid Graph Neural Networks for Crowd Counting

no code implementations31 Jan 2020 Ao Luo, Fan Yang, Xin Li, Dong Nie, Zhicheng Jiao, Shangchen Zhou, Hong Cheng

In this paper, we present a novel network structure called Hybrid Graph Neural Network (HyGnn) which targets to relieve the problem by interweaving the multi-scale features for crowd density as well as its auxiliary task (localization) together and performing joint reasoning over a graph.

Crowd Counting

SSAH: Semi-supervised Adversarial Deep Hashing with Self-paced Hard Sample Generation

no code implementations20 Nov 2019 Sheng Jin, Shangchen Zhou, Yao Liu, Chao Chen, Xiaoshuai Sun, Hongxun Yao, Xian-Sheng Hua

In this paper, we propose a novel Semi-supervised Self-pace Adversarial Hashing method, named SSAH to solve the above problems in a unified framework.

Deep Hashing Generative Adversarial Network

Toward 3D Object Reconstruction from Stereo Images

1 code implementation18 Oct 2019 Haozhe Xie, Hongxun Yao, Shangchen Zhou, Shengping Zhang, Xiaoshuai Sun, Wenxiu Sun

Inferring the 3D shape of an object from an RGB image has shown impressive results, however, existing methods rely primarily on recognizing the most similar 3D model from the training set to solve the problem.

3D Object Reconstruction Benchmarking +1

Spatio-Temporal Filter Adaptive Network for Video Deblurring

1 code implementation ICCV 2019 Shangchen Zhou, Jiawei Zhang, Jinshan Pan, Haozhe Xie, WangMeng Zuo, Jimmy Ren

To overcome the limitation of separate optical flow estimation, we propose a Spatio-Temporal Filter Adaptive Network (STFAN) for the alignment and deblurring in a unified framework.

Ranked #3 on Deblurring on DVD (using extra training data)

Deblurring Image Deblurring +1

DAVANet: Stereo Deblurring with View Aggregation

1 code implementation CVPR 2019 Shangchen Zhou, Jiawei Zhang, WangMeng Zuo, Haozhe Xie, Jinshan Pan, Jimmy Ren

Nowadays stereo cameras are more commonly adopted in emerging devices such as dual-lens smartphones and unmanned aerial vehicles.

Deblurring Image Deblurring

Pix2Vox: Context-aware 3D Reconstruction from Single and Multi-view Images

5 code implementations ICCV 2019 Haozhe Xie, Hongxun Yao, Xiaoshuai Sun, Shangchen Zhou, Shengping Zhang

Then, a context-aware fusion module is introduced to adaptively select high-quality reconstructions for each part (e. g., table legs) from different coarse 3D volumes to obtain a fused 3D volume.

3D Object Reconstruction 3D Reconstruction +1

Deep Saliency Hashing

no code implementations4 Jul 2018 Sheng Jin, Hongxun Yao, Xiaoshuai Sun, Shangchen Zhou, Lei Zhang, Xian-Sheng Hua

As the core of DSaH, the saliency loss guides the attention network to mine discriminative regions from pairs of images.

Deep Hashing Quantization

Cannot find the paper you are looking for? You can Submit a new open access paper.