Search Results for author: Shangchen Zhou

Found 47 papers, 30 papers with code

Learning Inclusion Matching for Animation Paint Bucket Colorization

1 code implementation • 27 Mar 2024 • Yuekun Dai, Shangchen Zhou, Qinyue Li, Chongyi Li, Chen Change Loy

In this work, we introduce a new learning-based inclusion matching pipeline, which directs the network to comprehend the inclusion relationships between segments rather than relying solely on direct visual correspondences.

Colorization

175

Paper
Code

LN3Diff: Scalable Latent Neural Fields Diffusion for Speedy 3D Generation

no code implementations • 18 Mar 2024 • Yushi Lan, Fangzhou Hong, Shuai Yang, Shangchen Zhou, Xuyi Meng, Bo Dai, Xingang Pan, Chen Change Loy

The latent is decoded by a transformer-based decoder into a high-capacity 3D neural field.

3D Generation 3D Reconstruction +1

Paper
Add Code

Control Color: Multimodal Diffusion-based Interactive Image Colorization

no code implementations • 16 Feb 2024 • Zhexin Liang, Zhaochen Li, Shangchen Zhou, Chongyi Li, Chen Change Loy

We also introduce a novel module based on self-attention and a content-guided deformable autoencoder to address the long-standing issues of color overflow and inaccurate coloring.

Colorization Color Manipulation +1

Paper
Add Code

Towards Language-Driven Video Inpainting via Multimodal Large Language Models

no code implementations • 18 Jan 2024 • Jianzong Wu, Xiangtai Li, Chenyang Si, Shangchen Zhou, Jingkang Yang, Jiangning Zhang, Yining Li, Kai Chen, Yunhai Tong, Ziwei Liu, Chen Change Loy

We introduce a new task -- language-driven video inpainting, which uses natural language instructions to guide the inpainting process.

Video Inpainting

Paper
Add Code

Upscale-A-Video: Temporal-Consistent Diffusion Model for Real-World Video Super-Resolution

no code implementations • 11 Dec 2023 • Shangchen Zhou, Peiqing Yang, Jianyi Wang, Yihang Luo, Chen Change Loy

Text-based diffusion models have exhibited remarkable success in generation and editing, showing great promise for enhancing visual content with their generative prior.

Video Super-Resolution

Paper
Add Code

Iterative Token Evaluation and Refinement for Real-World Super-Resolution

1 code implementation • 9 Dec 2023 • Chaofeng Chen, Shangchen Zhou, Liang Liao, HaoNing Wu, Wenxiu Sun, Qiong Yan, Weisi Lin

Distortion removal involves simple HQ token prediction with LQ images, while texture generation uses a discrete diffusion model to iteratively refine the distortion removal output with a token refinement network.

Image Super-Resolution Texture Synthesis

Paper
Code

LAVIE: High-Quality Video Generation with Cascaded Latent Diffusion Models

2 code implementations • 26 Sep 2023 • Yaohui Wang, Xinyuan Chen, Xin Ma, Shangchen Zhou, Ziqi Huang, Yi Wang, Ceyuan Yang, Yinan He, Jiashuo Yu, Peiqing Yang, Yuwei Guo, Tianxing Wu, Chenyang Si, Yuming Jiang, Cunjian Chen, Chen Change Loy, Bo Dai, Dahua Lin, Yu Qiao, Ziwei Liu

To this end, we propose LaVie, an integrated video generation framework that operates on cascaded video latent diffusion models, comprising a base T2V model, a temporal interpolation model, and a video super-resolution model.

Ranked #4 on Text-to-Video Generation on EvalCrafter Text-to-Video (ECTV) Dataset (using extra training data)

Text-to-Video Generation Video Generation +1

724

Paper
Code

PGDiff: Guiding Diffusion Models for Versatile Face Restoration via Partial Guidance

1 code implementation • NeurIPS 2023 • Peiqing Yang, Shangchen Zhou, Qingyi Tao, Chen Change Loy

When combined with a diffusion prior, this partial guidance can deliver appealing results across a range of restoration tasks.

118

Paper
Code

ProPainter: Improving Propagation and Transformer for Video Inpainting

1 code implementation • ICCV 2023 • Shangchen Zhou, Chongyi Li, Kelvin C. K. Chan, Chen Change Loy

We also propose a mask-guided sparse video Transformer, which achieves high efficiency by discarding unnecessary and redundant tokens.

Ranked #1 on Video Inpainting on YouTube-VOS 2018

Optical Flow Estimation Video Inpainting

4,627

Paper
Code

Adaptive Window Pruning for Efficient Local Motion Deblurring

no code implementations • 25 Jun 2023 • Haoying Li, Jixin Zhao, Shangchen Zhou, Huajun Feng, Chongyi Li, Chen Change Loy

Existing image deblurring methods predominantly focus on global deblurring, inadvertently affecting the sharpness of backgrounds in locally blurred images and wasting unnecessary computation on sharp pixels, especially for high-resolution images.

Deblurring Image Deblurring

Paper
Add Code

Flare7K++: Mixing Synthetic and Real Datasets for Nighttime Flare Removal and Beyond

1 code implementation • 7 Jun 2023 • Yuekun Dai, Chongyi Li, Shangchen Zhou, Ruicheng Feng, Yihang Luo, Chen Change Loy

To address this issue, we additionally provide the annotations of light sources in Flare7K++ and propose a new end-to-end pipeline to preserve the light source while removing lens flares.

Flare Removal

110

Paper
Code

MIPI 2023 Challenge on Nighttime Flare Removal: Methods and Results

no code implementations • 23 May 2023 • Yuekun Dai, Chongyi Li, Shangchen Zhou, Ruicheng Feng, Qingpeng Zhu, Qianhui Sun, Wenxiu Sun, Chen Change Loy, Jinwei Gu

In this paper, we summarize and review the Nighttime Flare Removal track on MIPI 2023.

Flare Removal

Paper
Add Code

Exploiting Diffusion Prior for Real-World Image Super-Resolution

3 code implementations • 11 May 2023 • Jianyi Wang, Zongsheng Yue, Shangchen Zhou, Kelvin C. K. Chan, Chen Change Loy

We present a novel approach to leverage prior knowledge encapsulated in pre-trained text-to-image diffusion models for blind super-resolution (SR).

Blind Super-Resolution Image Super-Resolution

1,823

Paper
Code

MIPI 2023 Challenge on RGB+ToF Depth Completion: Methods and Results

no code implementations • 27 Apr 2023 • Qingpeng Zhu, Wenxiu Sun, Yuekun Dai, Chongyi Li, Shangchen Zhou, Ruicheng Feng, Qianhui Sun, Chen Change Loy, Jinwei Gu, Yi Yu, Yangke Huang, Kang Zhang, Meiya Chen, Yu Wang, Yongchao Li, Hao Jiang, Amrit Kumar Muduli, Vikash Kumar, Kunal Swami, Pankaj Kumar Bajpai, Yunchao Ma, Jiajun Xiao, Zhi Ling

To evaluate the performance of different depth completion methods, we organized an RGB+sparse ToF depth completion competition.

Depth Completion

Paper
Add Code

MIPI 2023 Challenge on RGBW Remosaic: Methods and Results

no code implementations • 20 Apr 2023 • Qianhui Sun, Qingyu Yang, Chongyi Li, Shangchen Zhou, Ruicheng Feng, Yuekun Dai, Wenxiu Sun, Qingpeng Zhu, Chen Change Loy, Jinwei Gu

Developing and integrating advanced image sensors with novel algorithms in camera systems are prevalent with the increasing demand for computational photography and imaging on mobile platforms.

SSIM

Paper
Add Code

MIPI 2023 Challenge on RGBW Fusion: Methods and Results

no code implementations • 20 Apr 2023 • Qianhui Sun, Qingyu Yang, Chongyi Li, Shangchen Zhou, Ruicheng Feng, Yuekun Dai, Wenxiu Sun, Qingpeng Zhu, Chen Change Loy, Jinwei Gu

Developing and integrating advanced image sensors with novel algorithms in camera systems are prevalent with the increasing demand for computational photography and imaging on mobile platforms.

SSIM

Paper
Add Code

Iterative Prompt Learning for Unsupervised Backlit Image Enhancement

no code implementations • ICCV 2023 • Zhexin Liang, Chongyi Li, Shangchen Zhou, Ruicheng Feng, Chen Change Loy

To solve this issue, we devise a prompt learning framework that first learns an initial prompt pair by constraining the text-image similarity between the prompt (negative/positive sample) and the corresponding image (backlit image/well-lit image) in the CLIP latent space.

Image Enhancement Image Manipulation

Paper
Add Code

Nighttime Smartphone Reflective Flare Removal Using Optical Center Symmetry Prior

1 code implementation • CVPR 2023 • Yuekun Dai, Yihang Luo, Shangchen Zhou, Chongyi Li, Chen Change Loy

With the dataset, neural networks can be trained to remove the reflective flares effectively.

Flare Removal

Paper
Code

Embedding Fourier for Ultra-High-Definition Low-Light Image Enhancement

no code implementations • 23 Feb 2023 • Chongyi Li, Chun-Le Guo, Man Zhou, Zhexin Liang, Shangchen Zhou, Ruicheng Feng, Chen Change Loy

Our approach is motivated by a few unique characteristics in the Fourier domain: 1) most luminance information concentrates on amplitudes while noise is closely related to phases, and 2) a high-resolution image and its low-resolution version share similar amplitude patterns. Through embedding Fourier into our network, the amplitude and phase of a low-light image are separately processed to avoid amplifying noise when enhancing luminance.

4k Low-Light Image Enhancement +1

Paper
Add Code

Deep Dynamic Scene Deblurring from Optical Flow

no code implementations • 18 Jan 2023 • Jiawei Zhang, Jinshan Pan, Daoye Wang, Shangchen Zhou, Xing Wei, Furong Zhao, Jianbo Liu, Jimmy Ren

In this paper, we explore optical flow to remove dynamic scene blur by using the multi-scale spatially variant recurrent neural network (RNN).

Deblurring Optical Flow Estimation

Paper
Add Code

Learning Dual Memory Dictionaries for Blind Face Restoration

1 code implementation • 15 Oct 2022 • Xiaoming Li, Shiguang Zhang, Shangchen Zhou, Lei Zhang, WangMeng Zuo

Generally, it is a challenging and intractable task to improve the photo-realistic performance of blind restoration and adaptively handle the generic and specific restoration scenarios with a single unified model.

Blind Face Restoration

112

Paper
Code

Flare7K: A Phenomenological Nighttime Flare Removal Dataset

1 code implementation • 12 Oct 2022 • Yuekun Dai, Chongyi Li, Shangchen Zhou, Ruicheng Feng, Chen Change Loy

In this paper, we introduce, Flare7K, the first nighttime flare removal dataset, which is generated based on the observation and statistics of real-world nighttime lens flares.

Ranked #2 on Flare Removal on Flare7K

Flare Removal

110

Paper
Code

MIPI 2022 Challenge on RGBW Sensor Re-mosaic: Dataset and Report

1 code implementation • 15 Sep 2022 • Qingyu Yang, Guang Yang, Jun Jiang, Chongyi Li, Ruicheng Feng, Shangchen Zhou, Wenxiu Sun, Qingpeng Zhu, Chen Change Loy, Jinwei Gu

A detailed description of all models developed in this challenge is provided in this paper.

SSIM

Paper
Code

MIPI 2022 Challenge on RGBW Sensor Fusion: Dataset and Report

1 code implementation • 15 Sep 2022 • Qingyu Yang, Guang Yang, Jun Jiang, Chongyi Li, Ruicheng Feng, Shangchen Zhou, Wenxiu Sun, Qingpeng Zhu, Chen Change Loy, Jinwei Gu

A detailed description of all models developed in this challenge is provided in this paper.

Sensor Fusion SSIM

Paper
Code

MIPI 2022 Challenge on RGB+ToF Depth Completion: Dataset and Report

1 code implementation • 15 Sep 2022 • Wenxiu Sun, Qingpeng Zhu, Chongyi Li, Ruicheng Feng, Shangchen Zhou, Jun Jiang, Qingyu Yang, Chen Change Loy, Jinwei Gu

A detailed description of all models developed in this challenge is provided in this paper.

Depth Completion

Paper
Code

MIPI 2022 Challenge on Quad-Bayer Re-mosaic: Dataset and Report

1 code implementation • 15 Sep 2022 • Qingyu Yang, Guang Yang, Jun Jiang, Chongyi Li, Ruicheng Feng, Shangchen Zhou, Wenxiu Sun, Qingpeng Zhu, Chen Change Loy, Jinwei Gu

A detailed description of all models developed in this challenge is provided in this paper.

SSIM

Paper
Code

MIPI 2022 Challenge on Under-Display Camera Image Restoration: Methods and Results

1 code implementation • 15 Sep 2022 • Ruicheng Feng, Chongyi Li, Shangchen Zhou, Wenxiu Sun, Qingpeng Zhu, Jun Jiang, Qingyu Yang, Chen Change Loy, Jinwei Gu

In this paper, we summarize and review the Under-Display Camera (UDC) Image Restoration track on MIPI 2022.

Image Restoration

Paper
Code

CuDi: Curve Distillation for Efficient and Controllable Exposure Adjustment

no code implementations • 28 Jul 2022 • Chongyi Li, Chunle Guo, Ruicheng Feng, Shangchen Zhou, Chen Change Loy

Our method inherits the zero-reference learning and curve-based framework from an effective low-light image enhancement method, Zero-DCE, with further speed up in its inference speed, reduction in its model size, and extension to controllable exposure adjustment.

Low-Light Image Enhancement

Paper
Add Code

Towards Robust Blind Face Restoration with Codebook Lookup Transformer

1 code implementation • 22 Jun 2022 • Shangchen Zhou, Kelvin C. K. Chan, Chongyi Li, Chen Change Loy

In this paper, we demonstrate that a learned discrete codebook prior in a small proxy space largely reduces the uncertainty and ambiguity of restoration mapping by casting blind face restoration as a code prediction task, while providing rich visual atoms for generating high-quality faces.

Ranked #1 on Blind Face Restoration on CelebA-Test

Blind Face Restoration

13,381

Paper
Code

On the Generalization of BasicVSR++ to Video Deblurring and Denoising

1 code implementation • 11 Apr 2022 • Kelvin C. K. Chan, Shangchen Zhou, Xiangyu Xu, Chen Change Loy

The exploitation of long-term information has been a long-standing problem in video restoration.

Deblurring Denoising +2

532

Paper
Code

LEDNet: Joint Low-light Enhancement and Deblurring in the Dark

1 code implementation • 7 Feb 2022 • Shangchen Zhou, Chongyi Li, Chen Change Loy

With the pipeline, we present the first large-scale dataset for joint low-light enhancement and deblurring.

Ranked #2 on Low-Light Image Enhancement on Sony-Total-Dark

Deblurring Low-light Image Deblurring and Enhancement +1

187

Paper
Code

Investigating Tradeoffs in Real-World Video Super-Resolution

1 code implementation • CVPR 2022 • Kelvin C. K. Chan, Shangchen Zhou, Xiangyu Xu, Chen Change Loy

The diversity and complexity of degradations in real-world video super-resolution (VSR) pose non-trivial challenges in inference and training.

Ranked #9 on Video Super-Resolution on MSU Video Upscalers: Quality Enhancement

Benchmarking Video Super-Resolution

838

Paper
Code

BasicVSR++: Improving Video Super-Resolution with Enhanced Propagation and Alignment

3 code implementations • CVPR 2022 • Kelvin C. K. Chan, Shangchen Zhou, Xiangyu Xu, Chen Change Loy

We show that by empowering the recurrent framework with the enhanced propagation and alignment, one can exploit spatiotemporal information across misaligned video frames more effectively.

Ranked #1 on Video Enhancement on MFQE v2

Analog Video Restoration Video Enhancement +1

6,572

Paper
Code

NTIRE 2021 Challenge on Quality Enhancement of Compressed Video: Methods and Results

1 code implementation • 21 Apr 2021 • Ren Yang, Radu Timofte, Jing Liu, Yi Xu, Xinjian Zhang, Minyi Zhao, Shuigeng Zhou, Kelvin C. K. Chan, Shangchen Zhou, Xiangyu Xu, Chen Change Loy, Xin Li, Fanglong Liu, He Zheng, Lielin Jiang, Qi Zhang, Dongliang He, Fu Li, Qingqing Dang, Yibin Huang, Matteo Maggioni, Zhongqian Fu, Shuai Xiao, Cheng Li, Thomas Tanay, Fenglong Song, Wentao Chao, Qiang Guo, Yan Liu, Jiang Li, Xiaochao Qu, Dewang Hou, Jiayu Yang, Lyn Jiang, Di You, Zhenyu Zhang, Chong Mou, Iaroslav Koshelev, Pavel Ostyakov, Andrey Somov, Jia Hao, Xueyi Zou, Shijie Zhao, Xiaopeng Sun, Yiting Liao, Yuanzhi Zhang, Qing Wang, Gen Zhan, Mengxi Guo, Junlin Li, Ming Lu, Zhan Ma, Pablo Navarrete Michelini, Hai Wang, Yiyun Chen, Jingyu Guo, Liliang Zhang, Wenming Yang, Sijung Kim, Syehoon Oh, Yucong Wang, Minjie Cai, Wei Hao, Kangdi Shi, Liangyan Li, Jun Chen, Wei Gao, Wang Liu, XiaoYu Zhang, Linjie Zhou, Sixin Lin, Ru Wang

This paper reviews the first NTIRE challenge on quality enhancement of compressed video, with a focus on the proposed methods and results.

Paper
Code

Efficient Regional Memory Network for Video Object Segmentation

1 code implementation • CVPR 2021 • Haozhe Xie, Hongxun Yao, Shangchen Zhou, Shengping Zhang, Wenxiu Sun

For the current query frame, the query regions are tracked and predicted based on the optical flow estimated from the previous frame.

Ranked #9 on Semi-Supervised Video Object Segmentation on DAVIS (no YouTube-VOS training)

Object One-shot visual object segmentation +3

Paper
Code

Flexible Piecewise Curves Estimation for Photo Enhancement

no code implementations • 26 Oct 2020 • Chongyi Li, Chunle Guo, Qiming Ai, Shangchen Zhou, Chen Change Loy

This paper presents a new method, called FlexiCurve, for photo enhancement.

Paper
Add Code

Blind Face Restoration via Deep Multi-scale Component Dictionaries

1 code implementation • ECCV 2020 • Xiaoming Li, Chaofeng Chen, Shangchen Zhou, Xianhui Lin, WangMeng Zuo, Lei Zhang

Next, with the degraded input, we match and select the most similar component features from their corresponding dictionaries and transfer the high-quality details to the input via the proposed dictionary feature transfer (DFT) block.

Ranked #32 on Video Super-Resolution on MSU Video Super Resolution Benchmark: Detail Restoration

Blind Face Restoration Video Super-Resolution

908

Paper
Code

Cross-Scale Internal Graph Neural Network for Image Super-Resolution

1 code implementation • NeurIPS 2020 • Shangchen Zhou, Jiawei Zhang, WangMeng Zuo, Chen Change Loy

Specifically, we dynamically construct a cross-scale graph by searching k-nearest neighboring patches in the downsampled LR image for each query patch in the LR image.

Image Restoration Image Super-Resolution

304

Paper
Code

Pix2Vox++: Multi-scale Context-aware 3D Object Reconstruction from Single and Multiple Images

3 code implementations • 22 Jun 2020 • Haozhe Xie, Hongxun Yao, Shengping Zhang, Shangchen Zhou, Wenxiu Sun

A multi-scale context-aware fusion module is then introduced to adaptively select high-quality reconstructions for different parts from all coarse 3D volumes to obtain a fused 3D volume.

Ranked #3 on 3D Object Reconstruction on Data3D−R2N2

3D Object Reconstruction

439

Paper
Code

GRNet: Gridding Residual Network for Dense Point Cloud Completion

1 code implementation • ECCV 2020 • Haozhe Xie, Hongxun Yao, Shangchen Zhou, Jiageng Mao, Shengping Zhang, Wenxiu Sun

In particular, we devise two novel differentiable layers, named Gridding and Gridding Reverse, to convert between point clouds and 3D grids without losing structural information.

Ranked #3 on Point Cloud Completion on Completion3D

Point Cloud Completion

285

Paper
Code

Hybrid Graph Neural Networks for Crowd Counting

no code implementations • 31 Jan 2020 • Ao Luo, Fan Yang, Xin Li, Dong Nie, Zhicheng Jiao, Shangchen Zhou, Hong Cheng

In this paper, we present a novel network structure called Hybrid Graph Neural Network (HyGnn) which targets to relieve the problem by interweaving the multi-scale features for crowd density as well as its auxiliary task (localization) together and performing joint reasoning over a graph.

Crowd Counting

Paper
Add Code

SSAH: Semi-supervised Adversarial Deep Hashing with Self-paced Hard Sample Generation

no code implementations • 20 Nov 2019 • Sheng Jin, Shangchen Zhou, Yao Liu, Chao Chen, Xiaoshuai Sun, Hongxun Yao, Xian-Sheng Hua

In this paper, we propose a novel Semi-supervised Self-pace Adversarial Hashing method, named SSAH to solve the above problems in a unified framework.

Deep Hashing Generative Adversarial Network

Paper
Add Code

Toward 3D Object Reconstruction from Stereo Images

1 code implementation • 18 Oct 2019 • Haozhe Xie, Hongxun Yao, Shangchen Zhou, Shengping Zhang, Xiaoshuai Sun, Wenxiu Sun

Inferring the 3D shape of an object from an RGB image has shown impressive results, however, existing methods rely primarily on recognizing the most similar 3D model from the training set to solve the problem.

3D Object Reconstruction Benchmarking +1

Paper
Code

Spatio-Temporal Filter Adaptive Network for Video Deblurring

1 code implementation • ICCV 2019 • Shangchen Zhou, Jiawei Zhang, Jinshan Pan, Haozhe Xie, WangMeng Zuo, Jimmy Ren

To overcome the limitation of separate optical flow estimation, we propose a Spatio-Temporal Filter Adaptive Network (STFAN) for the alignment and deblurring in a unified framework.

Ranked #3 on Deblurring on DVD (using extra training data)

Deblurring Image Deblurring +1

167

Paper
Code

DAVANet: Stereo Deblurring with View Aggregation

1 code implementation • CVPR 2019 • Shangchen Zhou, Jiawei Zhang, WangMeng Zuo, Haozhe Xie, Jinshan Pan, Jimmy Ren

Nowadays stereo cameras are more commonly adopted in emerging devices such as dual-lens smartphones and unmanned aerial vehicles.

Deblurring Image Deblurring

136

Paper
Code

Pix2Vox: Context-aware 3D Reconstruction from Single and Multi-view Images

5 code implementations • ICCV 2019 • Haozhe Xie, Hongxun Yao, Xiaoshuai Sun, Shangchen Zhou, Shengping Zhang

Then, a context-aware fusion module is introduced to adaptively select high-quality reconstructions for each part (e. g., table legs) from different coarse 3D volumes to obtain a fused 3D volume.

Ranked #4 on 3D Object Reconstruction on Data3D−R2N2

3D Object Reconstruction 3D Reconstruction +1

439

Paper
Code

Deep Saliency Hashing

no code implementations • 4 Jul 2018 • Sheng Jin, Hongxun Yao, Xiaoshuai Sun, Shangchen Zhou, Lei Zhang, Xian-Sheng Hua

As the core of DSaH, the saliency loss guides the attention network to mine discriminative regions from pairs of images.

Deep Hashing Quantization

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.