Search Results for author: Chen Change Loy

Found 271 papers, 180 papers with code

Video Object Segmentation with Joint Re-identification and Attention-Aware Mask Propagation

no code implementations • ECCV 2018 • Xiaoxiao Li, Chen Change Loy

The problem of video object segmentation can become extremely challenging when multiple instances co-exist.

Semantic Segmentation Video Object Segmentation +1

Paper
Add Code

Mix-and-Match Tuning for Self-Supervised Semantic Segmentation

no code implementations • 2 Dec 2017 • Xiaohang Zhan, Ziwei Liu, Ping Luo, Xiaoou Tang, Chen Change Loy

The key of this new form of learning is to design a proxy task (e. g. image colorization), from which a discriminative loss can be formulated on unlabeled data.

Colorization Image Colorization +3

Paper
Add Code

From Facial Expression Recognition to Interpersonal Relation Prediction

no code implementations • 21 Sep 2016 • Zhanpeng Zhang, Ping Luo, Chen Change Loy, Xiaoou Tang

Unlike existing models that typically learn from facial expression labels alone, we devise an effective multitask network that is capable of learning from rich auxiliary attributes such as gender, age, and head pose, beyond just facial expression data.

Attribute Facial Expression Recognition +2

Paper
Add Code

Be Your Own Prada: Fashion Synthesis with Structural Coherence

no code implementations • ICCV 2017 • Shizhan Zhu, Sanja Fidler, Raquel Urtasun, Dahua Lin, Chen Change Loy

In the second stage, a generative model with a newly proposed compositional mapping layer is used to render the final image with precise regions and textures conditioned on this map.

Fashion Synthesis Semantic Segmentation +1

Paper
Add Code

Faceness-Net: Face Detection through Deep Facial Part Responses

no code implementations • 29 Jan 2017 • Shuo Yang, Ping Luo, Chen Change Loy, Xiaoou Tang

We propose a deep convolutional neural network (CNN) for face detection leveraging on facial attributes based supervision.

Face Detection

Paper
Add Code

Deep Learning Markov Random Field for Semantic Segmentation

no code implementations • 23 Jun 2016 • Ziwei Liu, Xiaoxiao Li, Ping Luo, Chen Change Loy, Xiaoou Tang

Semantic segmentation tasks can be well modeled by Markov Random Field (MRF).

Segmentation Semantic Segmentation +2

Paper
Add Code

Face Detection through Scale-Friendly Deep Convolutional Networks

no code implementations • 9 Jun 2017 • Shuo Yang, Yuanjun Xiong, Chen Change Loy, Xiaoou Tang

Specifically, our method achieves 76. 4 average precision on the challenging WIDER FACE dataset and 96% recall rate on the FDDB dataset with 7 frames per second (fps) for 900 * 1300 input image.

Face Detection

Paper
Add Code

Local Similarity-Aware Deep Feature Embedding

no code implementations • NeurIPS 2016 • Chen Huang, Chen Change Loy, Xiaoou Tang

Existing deep embedding methods in vision tasks are capable of learning a compact Euclidean space from images, where Euclidean distances correspond to a similarity metric.

Ranked #27 on Metric Learning on CUB-200-2011

Image Retrieval Retrieval +2

Paper
Add Code

Deep Cascaded Bi-Network for Face Hallucination

no code implementations • 18 Jul 2016 • Shizhan Zhu, Sifei Liu, Chen Change Loy, Xiaoou Tang

We present a novel framework for hallucinating faces of unconstrained poses and with very low resolution (face size as small as 5pxIOD).

Ranked #5 on Image Super-Resolution on VggFace2 - 8x upscaling

Face Hallucination Hallucination

Paper
Add Code

Discriminative Sparse Neighbor Approximation for Imbalanced Learning

no code implementations • 3 Feb 2016 • Chen Huang, Chen Change Loy, Xiaoou Tang

These methods further deteriorate on small, imbalanced data that has a large degree of class overlap.

Paper
Add Code

Reading Scene Text in Deep Convolutional Sequences

1 code implementation • 14 Jun 2015 • Pan He, Weilin Huang, Yu Qiao, Chen Change Loy, Xiaoou Tang

We develop a Deep-Text Recurrent Network (DTRN) that regards scene text reading as a sequence labelling problem.

Scene Text Recognition

Paper
Code

Towards Arbitrary-View Face Alignment by Recommendation Trees

no code implementations • 20 Nov 2015 • Shizhan Zhu, Cheng Li, Chen Change Loy, Xiaoou Tang

The unified framework seamlessly handles different viewpoints and landmark protocols, and it is trained by optimising directly on landmark locations, thus yielding superior results on arbitrary-view face alignment.

Face Alignment Head Pose Estimation +1

Paper
Add Code

An Empirical Study of Recent Face Alignment Methods

no code implementations • 16 Nov 2015 • Heng Yang, Xuhui Jia, Chen Change Loy, Peter Robinson

In this paper, we carry out a rigorous evaluation of these methods by making the following contributions: 1) we proposes a new evaluation metric for face alignment on a set of images, i. e., area under error distribution curve within a threshold, AUC$_\alpha$, given the fact that the traditional evaluation measure (mean error) is very sensitive to big alignment error.

Face Alignment Face Detection

Paper
Add Code

Semantic Image Segmentation via Deep Parsing Network

no code implementations • ICCV 2015 • Ziwei Liu, Xiaoxiao Li, Ping Luo, Chen Change Loy, Xiaoou Tang

This paper addresses semantic image segmentation by incorporating rich information into Markov Random Field (MRF), including high-order relations and mixture of label contexts.

Ranked #89 on Semantic Segmentation on Cityscapes test

Image Segmentation Semantic Segmentation

Paper
Add Code

From Facial Parts Responses to Face Detection: A Deep Learning Approach

1 code implementation • ICCV 2015 • Shuo Yang, Ping Luo, Chen Change Loy, Xiaoou Tang

In this paper, we propose a novel deep convolutional network (DCN) that achieves outstanding performance on FDDB, PASCAL Face, and AFW.

Face Detection

Paper
Code

Learning Social Relation Traits from Face Images

no code implementations • ICCV 2015 • Zhanpeng Zhang, Ping Luo, Chen Change Loy, Xiaoou Tang

Social relation defines the association, e. g, warm, friendliness, and dominance, between two or more people.

Attribute Relation

Paper
Add Code

Learning Deep Representation for Face Alignment with Auxiliary Attributes

no code implementations • 18 Aug 2014 • Zhanpeng Zhang, Ping Luo, Chen Change Loy, Xiaoou Tang

In this study, we show that landmark detection or face alignment task is not a single and independent problem.

Ranked #13 on Unsupervised Facial Landmark Detection on MAFL

Attribute Face Alignment

Paper
Add Code

Boosting Optical Character Recognition: A Super-Resolution Approach

no code implementations • 7 Jun 2015 • Chao Dong, Ximei Zhu, Yubin Deng, Chen Change Loy, Yu Qiao

Text image super-resolution is a challenging yet open research problem in the computer vision community.

Image Super-Resolution Optical Character Recognition +1

Paper
Add Code

Learning to Recognize Pedestrian Attribute

no code implementations • 5 Jan 2015 • Yubin Deng, Ping Luo, Chen Change Loy, Xiaoou Tang

Learning to recognize pedestrian attributes at far distance is a challenging problem in visual surveillance since face and body close-shots are hardly available; instead, only far-view image frames of pedestrian are given.

Attribute Informativeness

Paper
Add Code

Learning from Multiple Sources for Video Summarisation

no code implementations • 13 Jan 2015 • Xiatian Zhu, Chen Change Loy, Shaogang Gong

Many visual surveillance tasks, e. g. video summarisation, is conventionally accomplished through analysing imagerybased features.

Clustering Video Understanding

Paper
Add Code

Crowd Saliency Detection via Global Similarity Structure

no code implementations • 14 Oct 2014 • Mei Kuan Lim, Ven Jyn Kok, Chen Change Loy, Chee Seng Chan

This paper proposes a novel framework to identify and localize salient regions in a crowd scene, by transforming low-level features extracted from crowd motion field into a global similarity structure.

Saliency Detection

Paper
Add Code

Transferring Landmark Annotations for Cross-Dataset Face Alignment

no code implementations • 2 Sep 2014 • Shizhan Zhu, Cheng Li, Chen Change Loy, Xiaoou Tang

We show extensive results on combining various popular databases (LFW, AFLW, LFPW, HELEN) for improved cross-dataset and unseen data alignment.

Face Alignment Object Recognition

Paper
Add Code

Zoom-Net: Mining Deep Feature Interactions for Visual Relationship Recognition

no code implementations • ECCV 2018 • Guojun Yin, Lu Sheng, Bin Liu, Nenghai Yu, Xiaogang Wang, Jing Shao, Chen Change Loy

We show that by encouraging deep message propagation and interactions between local object features and global predicate features, one can achieve compelling performance in recognizing complex relationships without using any linguistic priors.

Object

Paper
Add Code

Improving On-policy Learning with Statistical Reward Accumulation

no code implementations • 7 Sep 2018 • Yubin Deng, Ke Yu, Dahua Lin, Xiaoou Tang, Chen Change Loy

Most methods in deep-RL achieve good results via the maximization of the reward signal provided by the environment, typically in the form of discounted cumulative returns.

Atari Games

Paper
Add Code

Instance-level Facial Attributes Transfer with Geometry-Aware Flow

no code implementations • 30 Nov 2018 • Weidong Yin, Ziwei Liu, Chen Change Loy

Geometry-aware flow is able to warp the source face attribute into the target face context and generate a warp-and-blend result.

Attribute Hallucination

Paper
Add Code

Lifelong Learning via Progressive Distillation and Retrospection

no code implementations • ECCV 2018 • Saihui Hou, Xinyu Pan, Chen Change Loy, Zilei Wang, Dahua Lin

Lifelong learning aims at adapting a learned model to new tasks while retaining the knowledge gained earlier.

Knowledge Distillation

Paper
Add Code

Cumulative Attribute Space for Age and Crowd Density Estimation

no code implementations • CVPR 2013 • Ke Chen, Shaogang Gong, Tao Xiang, Chen Change Loy

A number of computer vision problems such as human age estimation, crowd density estimation and body/face pose (view angle) estimation can be formulated as a regression problem by learning a mapping function between a high dimensional vector-formed feature input and a scalarvalued output.

Age Estimation Attribute +3

Paper
Add Code

Constructing Robust Affinity Graphs for Spectral Clustering

no code implementations • CVPR 2014 • Xiatian Zhu, Chen Change Loy, Shaogang Gong

Spectral clustering requires robust and meaningful affinity graphs as input in order to form clusters with desired structures that can well support human intuition.

Clustering

Paper
Add Code

Scene-Independent Group Profiling in Crowd

no code implementations • CVPR 2014 • Jing Shao, Chen Change Loy, Xiaogang Wang

Groups are the primary entities that make up a crowd.

Scene Understanding

Paper
Add Code

Deeply Learned Attributes for Crowded Scene Understanding

no code implementations • CVPR 2015 • Jing Shao, Kai Kang, Chen Change Loy, Xiaogang Wang

We further measure user study performance on WWW and compare this with the proposed deep models.

Attribute Multi-Task Learning +1

Paper
Add Code

Unsupervised Learning of Discriminative Attributes and Visual Representations

no code implementations • CVPR 2016 • Chen Huang, Chen Change Loy, Xiaoou Tang

Attributes offer useful mid-level features to interpret visual data.

Attribute Clustering +5

Paper
Add Code

Learning Deep Representation for Imbalanced Classification

no code implementations • CVPR 2016 • Chen Huang, Yining Li, Chen Change Loy, Xiaoou Tang

We further demonstrate that more discriminative deep representation can be learned by enforcing a deep network to maintain both inter-cluster and inter-class margins.

Classification General Classification +2

Paper
Add Code

WIDER Face and Pedestrian Challenge 2018: Methods and Results

no code implementations • 19 Feb 2019 • Chen Change Loy, Dahua Lin, Wanli Ouyang, Yuanjun Xiong, Shuo Yang, Qingqiu Huang, Dongzhan Zhou, Wei Xia, Quanquan Li, Ping Luo, Junjie Yan, Jian-Feng Wang, Zuoxin Li, Ye Yuan, Boxun Li, Shuai Shao, Gang Yu, Fangyun Wei, Xiang Ming, Dong Chen, Shifeng Zhang, Cheng Chi, Zhen Lei, Stan Z. Li, Hongkai Zhang, Bingpeng Ma, Hong Chang, Shiguang Shan, Xilin Chen, Wu Liu, Boyan Zhou, Huaxiong Li, Peng Cheng, Tao Mei, Artem Kukharenko, Artem Vasenin, Nikolay Sergievskiy, Hua Yang, Liangqi Li, Qiling Xu, Yuan Hong, Lin Chen, Mingjun Sun, Yirong Mao, Shiying Luo, Yongjun Li, Ruiping Wang, Qiaokang Xie, Ziyang Wu, Lei Lu, Yiheng Liu, Wengang Zhou

This paper presents a review of the 2018 WIDER Challenge on Face and Pedestrian.

Face Detection Pedestrian Detection +2

Paper
Add Code

Unsupervised Bi-directional Flow-based Video Generation from one Snapshot

no code implementations • 3 Mar 2019 • Lu Sheng, Junting Pan, Jiaming Guo, Jing Shao, Xiaogang Wang, Chen Change Loy

Imagining multiple consecutive frames given one single snapshot is challenging, since it is difficult to simultaneously predict diverse motions from a single image and faithfully generate novel frames without visual distortions.

Video Generation

Paper
Add Code

TransGaGa: Geometry-Aware Unsupervised Image-to-Image Translation

no code implementations • CVPR 2019 • Wayne Wu, Kaidi Cao, Cheng Li, Chen Qian, Chen Change Loy

Extensive experiments demonstrate the superior performance of our method to other state-of-the-art approaches, especially in the challenging near-rigid and non-rigid objects translation tasks.

Translation Unsupervised Image-To-Image Translation

Paper
Add Code

Learning to Synthesize Fashion Textures

no code implementations • 18 Nov 2019 • Wu Shi, Tak-Wai Hui, Ziwei Liu, Dahua Lin, Chen Change Loy

Another important observation is that fashion textures are multi-modal.

Paper
Add Code

EcoNAS: Finding Proxies for Economical Neural Architecture Search

no code implementations • CVPR 2020 • Dongzhan Zhou, Xinchi Zhou, Wenwei Zhang, Chen Change Loy, Shuai Yi, Xuesen Zhang, Wanli Ouyang

While many methods have been proposed to improve the efficiency of NAS, the search progress is still laborious because training and evaluating plausible architectures over large search space is time-consuming.

Neural Architecture Search

Paper
Add Code

Everybody's Talkin': Let Me Talk as You Want

no code implementations • 15 Jan 2020 • Linsen Song, Wayne Wu, Chen Qian, Ran He, Chen Change Loy

The audio-translated expression parameters are then used to synthesize a photo-realistic human subject in each video frame, with the movement of the mouth regions precisely mapped to the source audio.

3D Face Reconstruction

Paper
Add Code

Residual Knowledge Distillation

no code implementations • 21 Feb 2020 • Mengya Gao, Yujun Shen, Quanquan Li, Chen Change Loy

Knowledge distillation (KD) is one of the most potent ways for model compression.

Knowledge Distillation Model Compression

Paper
Add Code

TransMoMo: Invariance-Driven Unsupervised Video Motion Retargeting

no code implementations • CVPR 2020 • Zhuoqian Yang, Wentao Zhu, Wayne Wu, Chen Qian, Qiang Zhou, Bolei Zhou, Chen Change Loy

We present a lightweight video motion retargeting approach TransMoMo that is capable of transferring motion of a person in a source video realistically to another video of a target person.

motion retargeting

Paper
Add Code

MessyTable: Instance Association in Multiple Camera Views

no code implementations • ECCV 2020 • Zhongang Cai, Junzhe Zhang, Daxuan Ren, Cunjun Yu, Haiyu Zhao, Shuai Yi, Chai Kiat Yeo, Chen Change Loy

We present an interesting and challenging dataset that features a large number of scenes with messy tables captured from multiple camera views.

Paper
Add Code

Understanding Deformable Alignment in Video Super-Resolution

no code implementations • 15 Sep 2020 • Kelvin C. K. Chan, Xintao Wang, Ke Yu, Chao Dong, Chen Change Loy

Aside from the contributions to deformable alignment, our formulation inspires a more flexible approach to introduce offset diversity to flow-based alignment, improving its performance.

Optical Flow Estimation Video Super-Resolution

Paper
Add Code

Flexible Piecewise Curves Estimation for Photo Enhancement

no code implementations • 26 Oct 2020 • Chongyi Li, Chunle Guo, Qiming Ai, Shangchen Zhou, Chen Change Loy

This paper presents a new method, called FlexiCurve, for photo enhancement.

Paper
Add Code

GLEAN: Generative Latent Bank for Large-Factor Image Super-Resolution

no code implementations • CVPR 2021 • Kelvin C. K. Chan, Xintao Wang, Xiangyu Xu, Jinwei Gu, Chen Change Loy

We show that pre-trained Generative Adversarial Networks (GANs), e. g., StyleGAN, can be used as a latent bank to improve the restoration quality of large-factor image super-resolution (SR).

Decoder Image Super-Resolution

Paper
Add Code

Positional Encoding as Spatial Inductive Bias in GANs

no code implementations • CVPR 2021 • Rui Xu, Xintao Wang, Kai Chen, Bolei Zhou, Chen Change Loy

In this work, taking SinGAN and StyleGAN2 as examples, we show that such capability, to a large extent, is brought by the implicit positional encoding when using zero padding in the generators.

Image Manipulation Inductive Bias +1

Paper
Add Code

CARAFE++: Unified Content-Aware ReAssembly of FEatures

no code implementations • 7 Dec 2020 • Jiaqi Wang, Kai Chen, Rui Xu, Ziwei Liu, Chen Change Loy, Dahua Lin

Feature reassembly, i. e. feature downsampling and upsampling, is a key operation in a number of modern convolutional network architectures, e. g., residual networks and feature pyramids.

Image Inpainting Instance Segmentation +3

Paper
Add Code

Chasing the Tail in Monocular 3D Human Reconstruction with Prototype Memory

no code implementations • 29 Dec 2020 • Yu Rong, Ziwei Liu, Chen Change Loy

The reason is that most of the current models perform regression based on a single human prototype, which is similar to common poses while far from the rare poses.

3D Human Reconstruction regression

Paper
Add Code

Unsupervised 3D Shape Completion through GAN Inversion

no code implementations • CVPR 2021 • Junzhe Zhang, Xinyi Chen, Zhongang Cai, Liang Pan, Haiyu Zhao, Shuai Yi, Chai Kiat Yeo, Bo Dai, Chen Change Loy

In contrast to previous fully supervised approaches, in this paper we present ShapeInversion, which introduces Generative Adversarial Network (GAN) inversion to shape completion for the first time.

Generative Adversarial Network valid

Paper
Add Code

Pareidolia Face Reenactment

no code implementations • CVPR 2021 • Linsen Song, Wayne Wu, Chaoyou Fu, Chen Qian, Chen Change Loy, Ran He

We present a new application direction named Pareidolia Face Reenactment, which is defined as animating a static illusory face to move in tandem with a human face in the video.

Face Reenactment Texture Synthesis

Paper
Add Code

MeshInversion: 3D textured mesh reconstruction with generative prior

no code implementations • 29 Sep 2021 • Junzhe Zhang, Daxuan Ren, Zhongang Cai, Chai Kiat Yeo, Bo Dai, Chen Change Loy

Reconstruction is achieved by searching for a latent space in the 3D GAN that best resembles the target mesh in accordance with the single view observation.

Paper
Add Code

SiT: Simulation Transformer for Particle-based Physics Simulation

no code implementations • 29 Sep 2021 • Yidi Shao, Chen Change Loy, Bo Dai

However, they force particles to interact with all neighbors without selection, and they fall short in capturing material semantics for different particles, leading to unsatisfactory performance, especially in generalization.

Paper
Add Code

A Comprehensive Overhaul of Distilling Unconditional GANs

no code implementations • 29 Sep 2021 • Guodong Xu, Yuenan Hou, Ziwei Liu, Chen Change Loy

To further enhance the semantic consistency between the teacher and student model, we present another latent-direction-based distillation loss that preserves the semantic relations in latent space.

Knowledge Distillation

Paper
Add Code

Playing for 3D Human Recovery

no code implementations • 14 Oct 2021 • Zhongang Cai, Mingyuan Zhang, Jiawei Ren, Chen Wei, Daxuan Ren, Zhengyu Lin, Haiyu Zhao, Lei Yang, Chen Change Loy, Ziwei Liu

Specifically, we contribute GTA-Human, a large-scale 3D human dataset generated with the GTA-V game engine, featuring a highly diverse set of subjects, actions, and scenarios.

Paper
Add Code

Self-Supervised Representation Learning: Introduction, Advances and Challenges

no code implementations • 18 Oct 2021 • Linus Ericsson, Henry Gouk, Chen Change Loy, Timothy M. Hospedales

Self-supervised representation learning methods aim to provide powerful deep feature learning without the requirement of large annotated datasets, thus alleviating the annotation bottleneck that is one of the main barriers to practical deployment of deep learning today.

Representation Learning

Paper
Add Code

The Nuts and Bolts of Adopting Transformer in GANs

no code implementations • 25 Oct 2021 • Rui Xu, Xiangyu Xu, Kai Chen, Bolei Zhou, Chen Change Loy

Transformer becomes prevalent in computer vision, especially for high-level vision tasks.

Generative Adversarial Network Image Generation

Paper
Add Code

Monocular 3D Reconstruction of Interacting Hands via Collision-Aware Factorized Refinements

no code implementations • 1 Nov 2021 • Yu Rong, Jingbo Wang, Ziwei Liu, Chen Change Loy

In this paper, we make the first attempt to reconstruct 3D interacting hands from monocular single RGB images.

3D Reconstruction

Paper
Add Code

Unsupervised Disentangling Structure and Appearance

no code implementations • 27 Sep 2018 • Wayne Wu, Kaidi Cao, Cheng Li, Chen Qian, Chen Change Loy

It is challenging to disentangle an object into two orthogonal spaces of structure and appearance since each can influence the visual observation in a different and unpredictable way.

Disentanglement

Paper
Add Code

MoCaNet: Motion Retargeting in-the-wild via Canonicalization Networks

no code implementations • 19 Dec 2021 • Wentao Zhu, Zhuoqian Yang, Ziang Di, Wayne Wu, Yizhou Wang, Chen Change Loy

Trained with the canonicalization operations and the derived regularizations, our method learns to factorize a skeleton sequence into three independent semantic subspaces, i. e., motion, structure, and view angle.

3D Reconstruction Action Analysis +2

Paper
Add Code

HuMMan: Multi-Modal 4D Human Dataset for Versatile Sensing and Modeling

no code implementations • 28 Apr 2022 • Zhongang Cai, Daxuan Ren, Ailing Zeng, Zhengyu Lin, Tao Yu, Wenjia Wang, Xiangyu Fan, Yang Gao, Yifan Yu, Liang Pan, Fangzhou Hong, Mingyuan Zhang, Chen Change Loy, Lei Yang, Ziwei Liu

4D human sensing and modeling are fundamental tasks in vision and graphics with numerous applications.

Fine-grained Action Recognition Pose Estimation

Paper
Add Code

Point-to-Voxel Knowledge Distillation for LiDAR Semantic Segmentation

no code implementations • CVPR 2022 • Yuenan Hou, Xinge Zhu, Yuexin Ma, Chen Change Loy, Yikang Li

This article addresses the problem of distilling knowledge from a large teacher model to a slim student network for LiDAR semantic segmentation.

Ranked #8 on LIDAR Semantic Segmentation on nuScenes (val mIoU metric)

3D Semantic Segmentation Knowledge Distillation +1

Paper
Add Code

CuDi: Curve Distillation for Efficient and Controllable Exposure Adjustment

no code implementations • 28 Jul 2022 • Chongyi Li, Chunle Guo, Ruicheng Feng, Shangchen Zhou, Chen Change Loy

Our method inherits the zero-reference learning and curve-based framework from an effective low-light image enhancement method, Zero-DCE, with further speed up in its inference speed, reduction in its model size, and extension to controllable exposure adjustment.

Low-Light Image Enhancement

Paper
Add Code

BeautyREC: Robust, Efficient, and Content-preserving Makeup Transfer

no code implementations • 12 Dec 2022 • Qixin Yan, Chunle Guo, Jixin Zhao, Yuekun Dai, Chen Change Loy, Chongyi Li

The key insights of this study are modeling component-specific correspondence for local makeup transfer, capturing long-range dependencies for global makeup transfer, and enabling efficient makeup transfer via a single-path structure.

Paper
Add Code

Self-Supervised Geometry-Aware Encoder for Style-Based 3D GAN Inversion

no code implementations • CVPR 2023 • Yushi Lan, Xuyi Meng, Shuai Yang, Chen Change Loy, Bo Dai

In this paper, we study the challenging problem of 3D GAN inversion where a latent code is predicted given a single face image to faithfully recover its 3D shapes and detailed textures.

3D Face Reconstruction

Paper
Add Code

Correspondence Distillation from NeRF-based GAN

no code implementations • 19 Dec 2022 • Yushi Lan, Chen Change Loy, Bo Dai

The neural radiance field (NeRF) has shown promising results in preserving the fine details of objects and scenes.

Paper
Add Code

Embedding Fourier for Ultra-High-Definition Low-Light Image Enhancement

no code implementations • 23 Feb 2023 • Chongyi Li, Chun-Le Guo, Man Zhou, Zhexin Liang, Shangchen Zhou, Ruicheng Feng, Chen Change Loy

Our approach is motivated by a few unique characteristics in the Fourier domain: 1) most luminance information concentrates on amplitudes while noise is closely related to phases, and 2) a high-resolution image and its low-resolution version share similar amplitude patterns. Through embedding Fourier into our network, the amplitude and phase of a low-light image are separately processed to avoid amplifying noise when enhancing luminance.

4k Low-Light Image Enhancement +1

Paper
Add Code

Task-Oriented Human-Object Interactions Generation with Implicit Neural Representations

no code implementations • 23 Mar 2023 • Quanzhou Li, Jingbo Wang, Chen Change Loy, Bo Dai

Generating task-oriented human-object interaction motions in simulation is challenging.

Human-Object Interaction Detection Motion Estimation +2

Paper
Add Code

SparseNeRF: Distilling Depth Ranking for Few-shot Novel View Synthesis

no code implementations • ICCV 2023 • Guangcong Wang, Zhaoxi Chen, Chen Change Loy, Ziwei Liu

Since coarse depth maps are not strictly scaled to the ground-truth depth maps, we propose a simple yet effective constraint, a local depth ranking method, on NeRFs such that the expected depth ranking of the NeRF is consistent with that of the coarse depth maps in local patches.

Novel View Synthesis

Paper
Add Code

Iterative Prompt Learning for Unsupervised Backlit Image Enhancement

no code implementations • ICCV 2023 • Zhexin Liang, Chongyi Li, Shangchen Zhou, Ruicheng Feng, Chen Change Loy

To solve this issue, we devise a prompt learning framework that first learns an initial prompt pair by constraining the text-image similarity between the prompt (negative/positive sample) and the corresponding image (backlit image/well-lit image) in the CLIP latent space.

Image Enhancement Image Manipulation

Paper
Add Code

MIPI 2023 Challenge on RGBW Fusion: Methods and Results

no code implementations • 20 Apr 2023 • Qianhui Sun, Qingyu Yang, Chongyi Li, Shangchen Zhou, Ruicheng Feng, Yuekun Dai, Wenxiu Sun, Qingpeng Zhu, Chen Change Loy, Jinwei Gu

Developing and integrating advanced image sensors with novel algorithms in camera systems are prevalent with the increasing demand for computational photography and imaging on mobile platforms.

SSIM

Paper
Add Code

MIPI 2023 Challenge on RGBW Remosaic: Methods and Results

no code implementations • 20 Apr 2023 • Qianhui Sun, Qingyu Yang, Chongyi Li, Shangchen Zhou, Ruicheng Feng, Yuekun Dai, Wenxiu Sun, Qingpeng Zhu, Chen Change Loy, Jinwei Gu

Developing and integrating advanced image sensors with novel algorithms in camera systems are prevalent with the increasing demand for computational photography and imaging on mobile platforms.

SSIM

Paper
Add Code

MIPI 2023 Challenge on RGB+ToF Depth Completion: Methods and Results

no code implementations • 27 Apr 2023 • Qingpeng Zhu, Wenxiu Sun, Yuekun Dai, Chongyi Li, Shangchen Zhou, Ruicheng Feng, Qianhui Sun, Chen Change Loy, Jinwei Gu, Yi Yu, Yangke Huang, Kang Zhang, Meiya Chen, Yu Wang, Yongchao Li, Hao Jiang, Amrit Kumar Muduli, Vikash Kumar, Kunal Swami, Pankaj Kumar Bajpai, Yunchao Ma, Jiajun Xiao, Zhi Ling

To evaluate the performance of different depth completion methods, we organized an RGB+sparse ToF depth completion competition.

Depth Completion

Paper
Add Code

Towards Multi-Layered 3D Garments Animation

no code implementations • ICCV 2023 • Yidi Shao, Chen Change Loy, Bo Dai

In this paper, we propose a novel data-driven method, called LayersNet, to model garment-level animations as particle-wise interactions in a micro physics system.

Paper
Add Code

MIPI 2023 Challenge on Nighttime Flare Removal: Methods and Results

no code implementations • 23 May 2023 • Yuekun Dai, Chongyi Li, Shangchen Zhou, Ruicheng Feng, Qingpeng Zhu, Qianhui Sun, Wenxiu Sun, Chen Change Loy, Jinwei Gu

In this paper, we summarize and review the Nighttime Flare Removal track on MIPI 2023.

Flare Removal

Paper
Add Code

Semi-Supervised and Long-Tailed Object Detection with CascadeMatch

no code implementations • 24 May 2023 • Yuhang Zang, Kaiyang Zhou, Chen Huang, Chen Change Loy

This paper focuses on long-tailed object detection in the semi-supervised learning setting, which poses realistic challenges, but has rarely been studied in the literature.

Long-tailed Object Detection Object +3

Paper
Add Code

Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation

no code implementations • 13 Jun 2023 • Shuai Yang, Yifan Zhou, Ziwei Liu, Chen Change Loy

The framework includes two parts: key frame translation and full video translation.

Patch Matching Translation

Paper
Add Code

Adaptive Window Pruning for Efficient Local Motion Deblurring

no code implementations • 25 Jun 2023 • Haoying Li, Jixin Zhao, Shangchen Zhou, Huajun Feng, Chongyi Li, Chen Change Loy

Existing image deblurring methods predominantly focus on global deblurring, inadvertently affecting the sharpness of backgrounds in locally blurred images and wasting unnecessary computation on sharp pixels, especially for high-resolution images.

Deblurring Image Deblurring

Paper
Add Code

PointHPS: Cascaded 3D Human Pose and Shape Estimation from Point Clouds

no code implementations • 28 Aug 2023 • Zhongang Cai, Liang Pan, Chen Wei, Wanqi Yin, Fangzhou Hong, Mingyuan Zhang, Chen Change Loy, Lei Yang, Ziwei Liu

To tackle these challenges, we propose a principled framework, PointHPS, for accurate 3D HPS from point clouds captured in real-world settings, which iteratively refines point features through a cascaded architecture.

3D human pose and shape estimation

Paper
Add Code

Audio-Driven Dubbing for User Generated Contents via Style-Aware Semi-Parametric Synthesis

no code implementations • 31 Aug 2023 • Linsen Song, Wayne Wu, Chaoyou Fu, Chen Change Loy, Ran He

Existing automated dubbing methods are usually designed for Professionally Generated Content (PGC) production, which requires massive training data and training time to learn a person-specific audio-video mapping.

Paper
Add Code

Interpret Vision Transformers as ConvNets with Dynamic Convolutions

no code implementations • 19 Sep 2023 • Chong Zhou, Chen Change Loy, Bo Dai

There has been a debate about the superiority between vision Transformers and ConvNets, serving as the backbone of computer vision models.

Paper
Add Code

DeformToon3D: Deformable Neural Radiance Fields for 3D Toonification

no code implementations • ICCV 2023 • Junzhe Zhang, Yushi Lan, Shuai Yang, Fangzhou Hong, Quan Wang, Chai Kiat Yeo, Ziwei Liu, Chen Change Loy

In this paper, we address the challenging problem of 3D toonification, which involves transferring the style of an artistic domain onto a target 3D face with stylized geometry and texture.

Decoder

Paper
Add Code

PaintHuman: Towards High-fidelity Text-to-3D Human Texturing via Denoised Score Distillation

no code implementations • 14 Oct 2023 • Jianhui Yu, Hao Zhu, Liming Jiang, Chen Change Loy, Weidong Cai, Wayne Wu

We first propose a novel score function, Denoised Score Distillation (DSD), which directly modifies the SDS by introducing negative gradient components to iteratively correct the gradient direction and generate high-quality textures.

Text to 3D text-to-3d-human +1

Paper
Add Code

VideoBooth: Diffusion-based Video Generation with Image Prompts

no code implementations • 1 Dec 2023 • Yuming Jiang, Tianxing Wu, Shuai Yang, Chenyang Si, Dahua Lin, Yu Qiao, Chen Change Loy, Ziwei Liu

In this paper, we study the task of video generation with image prompts, which provide more accurate and direct content control beyond the text prompts.

Video Generation

Paper
Add Code

Digital Life Project: Autonomous 3D Characters with Social Intelligence

no code implementations • 7 Dec 2023 • Zhongang Cai, Jianping Jiang, Zhongfei Qing, Xinying Guo, Mingyuan Zhang, Zhengyu Lin, Haiyi Mei, Chen Wei, Ruisi Wang, Wanqi Yin, Xiangyu Fan, Han Du, Liang Pan, Peng Gao, Zhitao Yang, Yang Gao, Jiaqi Li, Tianxiang Ren, Yukun Wei, Xiaogang Wang, Chen Change Loy, Lei Yang, Ziwei Liu

In this work, we present Digital Life Project, a framework utilizing language as the universal medium to build autonomous 3D characters, who are capable of engaging in social interactions and expressing with articulated body motions, thereby simulating life in a digital environment.

Ranked #2 on Motion Synthesis on InterHuman

Motion Captioning Motion Synthesis

Paper
Add Code

Gaussian3Diff: 3D Gaussian Diffusion for 3D Full Head Synthesis and Editing

no code implementations • 5 Dec 2023 • Yushi Lan, Feitong Tan, Di Qiu, Qiangeng Xu, Kyle Genova, Zeng Huang, Sean Fanello, Rohit Pandey, Thomas Funkhouser, Chen Change Loy, yinda zhang

We present a novel framework for generating photorealistic 3D human head and subsequently manipulating and reposing them with remarkable flexibility.

Face Model

Paper
Add Code

Upscale-A-Video: Temporal-Consistent Diffusion Model for Real-World Video Super-Resolution

no code implementations • 11 Dec 2023 • Shangchen Zhou, Peiqing Yang, Jianyi Wang, Yihang Luo, Chen Change Loy

Text-based diffusion models have exhibited remarkable success in generation and editing, showing great promise for enhancing visual content with their generative prior.

Decoder Video Super-Resolution

Paper
Add Code

Towards Language-Driven Video Inpainting via Multimodal Large Language Models

no code implementations • 18 Jan 2024 • Jianzong Wu, Xiangtai Li, Chenyang Si, Shangchen Zhou, Jingkang Yang, Jiangning Zhang, Yining Li, Kai Chen, Yunhai Tong, Ziwei Liu, Chen Change Loy

We introduce a new task -- language-driven video inpainting, which uses natural language instructions to guide the inpainting process.

Video Inpainting

Paper
Add Code

Control Color: Multimodal Diffusion-based Interactive Image Colorization

no code implementations • 16 Feb 2024 • Zhexin Liang, Zhaochen Li, Shangchen Zhou, Chongyi Li, Chen Change Loy

We also introduce a novel module based on self-attention and a content-guided deformable autoencoder to address the long-standing issues of color overflow and inaccurate coloring.

Colorization Color Manipulation +1

Paper
Add Code

Explore In-Context Segmentation via Latent Diffusion Models

no code implementations • 14 Mar 2024 • Chaoyang Wang, Xiangtai Li, Henghui Ding, Lu Qi, Jiangning Zhang, Yunhai Tong, Chen Change Loy, Shuicheng Yan

In-context segmentation has drawn more attention with the introduction of vision foundation models.

Metric Learning Segmentation

Paper
Add Code

LN3Diff: Scalable Latent Neural Fields Diffusion for Speedy 3D Generation

no code implementations • 18 Mar 2024 • Yushi Lan, Fangzhou Hong, Shuai Yang, Shangchen Zhou, Xuyi Meng, Bo Dai, Xingang Pan, Chen Change Loy

The latent is decoded by a transformer-based decoder into a high-capacity 3D neural field.

3D Generation 3D Reconstruction +2

Paper
Add Code

Duolando: Follower GPT with Off-Policy Reinforcement Learning for Dance Accompaniment

no code implementations • 27 Mar 2024 • Li SiYao, Tianpei Gu, Zhitao Yang, Zhengyu Lin, Ziwei Liu, Henghui Ding, Lei Yang, Chen Change Loy

We introduce a novel task within the field of 3D dance generation, termed dance accompaniment, which necessitates the generation of responsive movements from a dance partner, the "follower", synchronized with the lead dancer's movements and the underlying musical rhythm.

Paper
Add Code

MOWA: Multiple-in-One Image Warping Model

no code implementations • 16 Apr 2024 • Kang Liao, Zongsheng Yue, Zhonghua Wu, Chen Change Loy

To our knowledge, this is the first work that solves multiple practical warping tasks in one single model.

Motion Estimation Multi-Task Learning

Paper
Add Code

MIPI 2024 Challenge on Nighttime Flare Removal: Methods and Results

no code implementations • 30 Apr 2024 • Yuekun Dai, Dafeng Zhang, Xiaoming Li, Zongsheng Yue, Chongyi Li, Shangchen Zhou, Ruicheng Feng, Peiqing Yang, Zhezhu Jin, Guanqun Liu, Chen Change Loy, Lize Zhang, Shuai Liu, Chaoyu Feng, Luyang Wang, Shuan Chen, Guangqi Shao, Xiaotao Wang, Lei Lei, Qirui Yang, Qihua Cheng, Zhiqiang Xu, Yihao Liu, Huanjing Yue, Jingyu Yang, Florin-Alexandru Vasluianu, Zongwei Wu, George Ciubotariu, Radu Timofte, Zhao Zhang, Suiyi Zhao, Bo wang, Zhichao Zuo, Yanyan Wei, Kuppa Sai Sri Teja, Jayakar Reddy A, Girish Rongali, Kaushik Mitra, Zhihao Ma, Yongxu Liu, Wanying Zhang, Wei Shang, Yuhong He, Long Peng, Zhongxin Yu, Shaofei Luo, Jian Wang, Yuqi Miao, Baiang Li, Gang Wei, Rakshank Verma, Ritik Maheshwari, Rahul Tekchandani, Praful Hambarde, Satya Narayan Tazi, Santosh Kumar Vipparthi, Subrahmanyam Murala, Haopeng Zhang, Yingli Hou, Mingde Yao, Levin M S, Aniruth Sundararajan, Hari Kumar A

The increasing demand for computational photography and imaging on mobile platforms has led to the widespread development and integration of advanced image sensors with novel algorithms in camera systems.

Flare Removal

Paper
Add Code

MVIP-NeRF: Multi-view 3D Inpainting on NeRF Scenes via Diffusion Prior

no code implementations • 5 May 2024 • Honghua Chen, Chen Change Loy, Xingang Pan

Despite the emergence of successful NeRF inpainting methods built upon explicit RGB and depth 2D inpainting supervisions, these methods are inherently constrained by the capabilities of their underlying 2D inpainters.

Paper
Add Code

Deep Imbalanced Learning for Face Recognition and Attribute Prediction

1 code implementation • 1 Jun 2018 • Chen Huang, Yining Li, Chen Change Loy, Xiaoou Tang

Data for face analysis often exhibit highly-skewed class distribution, i. e., most data belong to a few majority classes, while the minority classes only contain a scarce amount of instances.

Attribute Face Recognition +1

Paper
Code

WIDER FACE: A Face Detection Benchmark

1 code implementation • CVPR 2016 • Shuo Yang, Ping Luo, Chen Change Loy, Xiaoou Tang

Face detection is one of the most studied topics in the computer vision community.

Ranked #34 on Face Detection on WIDER Face (Medium)

Face Detection

Paper
Code

Deep Network Interpolation for Continuous Imagery Effect Transition

2 code implementations • CVPR 2019 • Xintao Wang, Ke Yu, Chao Dong, Xiaoou Tang, Chen Change Loy

Deep convolutional neural network has demonstrated its capability of learning a deterministic mapping for the desired imagery effect.

Image Restoration Image-to-Image Translation +2

Paper
Code

Deep Fourier Up-Sampling

1 code implementation • 11 Oct 2022 • Man Zhou, Hu Yu, Jie Huang, Feng Zhao, Jinwei Gu, Chen Change Loy, Deyu Meng, Chongyi Li

Existing convolutional neural networks widely adopt spatial down-/up-sampling for multi-scale modeling.

Image Dehazing Image Segmentation +4

Paper
Code

An Embarrassingly Simple Approach for Knowledge Distillation

1 code implementation • 5 Dec 2018 • Mengya Gao, Yujun Shen, Quanquan Li, Junjie Yan, Liang Wan, Dahua Lin, Chen Change Loy, Xiaoou Tang

Knowledge Distillation (KD) aims at improving the performance of a low-capacity student model by inheriting knowledge from a high-capacity teacher model.

Face Recognition Knowledge Distillation +3

Paper
Code

Discover and Learn New Objects from Documentaries

1 code implementation • CVPR 2017 • Kai Chen, Hang Song, Chen Change Loy, Dahua Lin

Despite the remarkable progress in recent years, detecting objects in a new context remains a challenging task.

Object Weakly-supervised Learning

Paper
Code

Disentangling Content and Style via Unsupervised Geometry Distillation

1 code implementation • ICLR Workshop DeepGenStruct 2019 • Wayne Wu, Kaidi Cao, Cheng Li, Chen Qian, Chen Change Loy

It is challenging to disentangle an object into two orthogonal spaces of content and style since each can influence the visual observation differently and unpredictably.

Disentanglement

Paper
Code

Deep Convolution Networks for Compression Artifacts Reduction

2 code implementations • 9 Aug 2016 • Ke Yu, Chao Dong, Chen Change Loy, Xiaoou Tang

Lossy compression introduces complex compression artifacts, particularly blocking artifacts, ringing effects and blurring.

Blocking Transfer Learning

Paper
Code

A Large-Scale Car Dataset for Fine-Grained Categorization and Verification

3 code implementations • CVPR 2015 • Linjie Yang, Ping Luo, Chen Change Loy, Xiaoou Tang

Updated on 24/09/2015: This update provides preliminary experiment results for fine-grained classification on the surveillance data of CompCars.

Ranked #5 on Fine-Grained Image Classification on CompCars

Fine-Grained Image Classification General Classification

Paper
Code

Rethinking CLIP-based Video Learners in Cross-Domain Open-Vocabulary Action Recognition

1 code implementation • 3 Mar 2024 • Kun-Yu Lin, Henghui Ding, Jiaming Zhou, Yi-Xing Peng, Zhilin Zhao, Chen Change Loy, Wei-Shi Zheng

To answer this, we establish a CROSS-domain Open-Vocabulary Action recognition benchmark named XOV-Action, and conduct a comprehensive evaluation of five state-of-the-art CLIP-based video learners under various types of domain gaps.

Open Vocabulary Action Recognition

Paper
Code

RGB-D Salient Object Detection with Cross-Modality Modulation and Selection

1 code implementation • ECCV 2020 • Chongyi Li, Runmin Cong, Yongri Piao, Qianqian Xu, Chen Change Loy

Second, we propose an adaptive feature selection (AFS) module to select saliency-related features and suppress the inferior ones.

Ranked #8 on RGB-D Salient Object Detection on NJU2K

feature selection object-detection +3

Paper
Code

Merge or Not? Learning to Group Faces via Imitation Learning

1 code implementation • 13 Jul 2017 • Yue He, Kaidi Cao, Cheng Li, Chen Change Loy

Given a large number of unlabeled face images, face grouping aims at clustering the images into individual identities present in the data.

Clustering Imitation Learning

Paper
Code

DST-Det: Simple Dynamic Self-Training for Open-Vocabulary Object Detection

1 code implementation • 2 Oct 2023 • Shilin Xu, Xiangtai Li, Size Wu, Wenwei Zhang, Yunhai Tong, Chen Change Loy

We refer to this approach as the self-training strategy, which enhances recall and accuracy for novel classes without requiring extra annotations, datasets, and re-training.

Novel Object Detection Object +5

Paper
Code

CLIM: Contrastive Language-Image Mosaic for Region Representation

1 code implementation • 18 Dec 2023 • Size Wu, Wenwei Zhang, Lumin Xu, Sheng Jin, Wentao Liu, Chen Change Loy

Our experimental results demonstrate that CLIM improves different baseline open-vocabulary object detectors by a large margin on both OV-COCO and OV-LVIS benchmarks.

Ranked #6 on Open Vocabulary Object Detection on LVIS v1.0

Object object-detection +1

Paper
Code

ReconfigISP: Reconfigurable Camera Image Processing Pipeline

1 code implementation • ICCV 2021 • Ke Yu, Zexian Li, Yue Peng, Chen Change Loy, Jinwei Gu

Image Signal Processor (ISP) is a crucial component in digital cameras that transforms sensor signals into images for us to perceive and understand.

Image Restoration Neural Architecture Search +2

Paper
Code

StyleInV: A Temporal Style Modulated Inversion Network for Unconditional Video Generation

1 code implementation • ICCV 2023 • YuHan Wang, Liming Jiang, Chen Change Loy

In this paper, we introduce a novel motion generator design that uses a learning-based inversion network for GAN.

Style Transfer Unconditional Video Generation

Paper
Code

Computation-Efficient Knowledge Distillation via Uncertainty-Aware Mixup

1 code implementation • 17 Dec 2020 • Guodong Xu, Ziwei Liu, Chen Change Loy

Our goal is to achieve a performance comparable to conventional knowledge distillation with a lower computation cost during training.

Informativeness Knowledge Distillation +2

Paper
Code

Correlational Image Modeling for Self-Supervised Visual Pre-Training

1 code implementation • CVPR 2023 • Wei Li, Jiahao Xie, Chen Change Loy

We introduce Correlational Image Modeling (CIM), a novel and surprisingly effective approach to self-supervised visual pre-training.

Paper
Code

Mind the Gap in Distilling StyleGANs

1 code implementation • 18 Aug 2022 • Guodong Xu, Yuenan Hou, Ziwei Liu, Chen Change Loy

To further enhance the semantic consistency between the teacher and student model, we present a latent-direction-based distillation loss that preserves the semantic relations in latent space.

Knowledge Distillation

Paper
Code

FASA: Feature Augmentation and Sampling Adaptation for Long-Tailed Instance Segmentation

1 code implementation • ICCV 2021 • Yuhang Zang, Chen Huang, Chen Change Loy

We propose a simple yet effective method, Feature Augmentation and Sampling Adaptation (FASA), that addresses the data scarcity issue by augmenting the feature space especially for rare classes.

Instance Segmentation Segmentation +2

Paper
Code

Dense Siamese Network for Dense Unsupervised Learning

1 code implementation • 21 Mar 2022 • Wenwei Zhang, Jiangmiao Pang, Kai Chen, Chen Change Loy

It also extracts a batch of region embeddings that correspond to some sub-regions in the overlapped area to be contrasted for region consistency.

Ranked #2 on Unsupervised Semantic Segmentation on COCO-All (mIoU metric)

Self-Supervised Learning Unsupervised Semantic Segmentation

Paper
Code

Transformer with Implicit Edges for Particle-based Physics Simulation

1 code implementation • 22 Jul 2022 • Yidi Shao, Chen Change Loy, Bo Dai

Consequently, in this paper we propose a novel Transformer-based method, dubbed as Transformer with Implicit Edges (TIE), to capture the rich semantics of particle interactions in an edge-free manner.

Paper
Code

Quantifying Facial Age by Posterior of Age Comparisons

1 code implementation • 31 Aug 2017 • Yunxuan Zhang, Li Liu, Cheng Li, Chen Change Loy

We introduce a novel approach for annotating large quantity of in-the-wild facial images with high-quality posterior age distribution as labels.

Ranked #7 on Age Estimation on MORPH Album2 (using extra training data)

Age And Gender Classification Age Estimation

Paper
Code

Siamese DETR

1 code implementation • CVPR 2023 • Zeren Chen, Gengshi Huang, Wei Li, Jianing Teng, Kun Wang, Jing Shao, Chen Change Loy, Lu Sheng

In this work, we present Siamese DETR, a Siamese self-supervised pretraining approach for the Transformer architecture in DETR.

MULTI-VIEW LEARNING Representation Learning

Paper
Code

Temporally Consistent Video Colorization with Deep Feature Propagation and Self-regularization Learning

1 code implementation • 9 Oct 2021 • Yihao Liu, Hengyuan Zhao, Kelvin C. K. Chan, Xintao Wang, Chen Change Loy, Yu Qiao, Chao Dong

We address this problem from a new perspective, by jointly considering colorization and temporal consistency in a unified framework.

Colorization Image Colorization

Paper
Code

MIPI 2022 Challenge on Quad-Bayer Re-mosaic: Dataset and Report

1 code implementation • 15 Sep 2022 • Qingyu Yang, Guang Yang, Jun Jiang, Chongyi Li, Ruicheng Feng, Shangchen Zhou, Wenxiu Sun, Qingpeng Zhu, Chen Change Loy, Jinwei Gu

A detailed description of all models developed in this challenge is provided in this paper.

SSIM

Paper
Code

MIPI 2022 Challenge on RGB+ToF Depth Completion: Dataset and Report

1 code implementation • 15 Sep 2022 • Wenxiu Sun, Qingpeng Zhu, Chongyi Li, Ruicheng Feng, Shangchen Zhou, Jun Jiang, Qingyu Yang, Chen Change Loy, Jinwei Gu

A detailed description of all models developed in this challenge is provided in this paper.

Depth Completion

Paper
Code

MIPI 2022 Challenge on Under-Display Camera Image Restoration: Methods and Results

1 code implementation • 15 Sep 2022 • Ruicheng Feng, Chongyi Li, Shangchen Zhou, Wenxiu Sun, Qingpeng Zhu, Jun Jiang, Qingyu Yang, Chen Change Loy, Jinwei Gu

In this paper, we summarize and review the Under-Display Camera (UDC) Image Restoration track on MIPI 2022.

Image Restoration

Paper
Code

MIPI 2022 Challenge on RGBW Sensor Fusion: Dataset and Report

1 code implementation • 15 Sep 2022 • Qingyu Yang, Guang Yang, Jun Jiang, Chongyi Li, Ruicheng Feng, Shangchen Zhou, Wenxiu Sun, Qingpeng Zhu, Chen Change Loy, Jinwei Gu

A detailed description of all models developed in this challenge is provided in this paper.

Sensor Fusion SSIM

Paper
Code

MIPI 2022 Challenge on RGBW Sensor Re-mosaic: Dataset and Report

1 code implementation • 15 Sep 2022 • Qingyu Yang, Guang Yang, Jun Jiang, Chongyi Li, Ruicheng Feng, Shangchen Zhou, Wenxiu Sun, Qingpeng Zhu, Chen Change Loy, Jinwei Gu

A detailed description of all models developed in this challenge is provided in this paper.

SSIM

Paper
Code

Path-Restore: Learning Network Path Selection for Image Restoration

1 code implementation • 23 Apr 2019 • Ke Yu, Xintao Wang, Chao Dong, Xiaoou Tang, Chen Change Loy

To leverage this, we propose Path-Restore, a multi-path CNN with a pathfinder that can dynamically select an appropriate route for each image region.

Denoising Image Restoration +1

Paper
Code

Betrayed by Captions: Joint Caption Grounding and Generation for Open Vocabulary Instance Segmentation

2 code implementations • ICCV 2023 • Jianzong Wu, Xiangtai Li, Henghui Ding, Xia Li, Guangliang Cheng, Yunhai Tong, Chen Change Loy

Experiments on the COCO dataset with two settings: Open Vocabulary Instance Segmentation (OVIS) and Open Set Panoptic Segmentation (OSPS) demonstrate the superiority of the CGG.

Caption Generation Instance Segmentation +2

Paper
Code

Compression Artifacts Reduction by a Deep Convolutional Network

4 code implementations • ICCV 2015 • Chao Dong, Yubin Deng, Chen Change Loy, Xiaoou Tang

Lossy compression introduces complex compression artifacts, particularly the blocking artifacts, ringing effects and blurring.

Ranked #4 on JPEG Artifact Correction on ICB (Quality 20 Grayscale)

Blocking Denoising +2

Paper
Code

Generating Aligned Pseudo-Supervision from Non-Aligned Data for Image Restoration in Under-Display Camera

1 code implementation • CVPR 2023 • Ruicheng Feng, Chongyi Li, Huaijin Chen, Shuai Li, Jinwei Gu, Chen Change Loy

Due to the difficulty in collecting large-scale and perfectly aligned paired training data for Under-Display Camera (UDC) image restoration, previous methods resort to monitor-based image systems or simulation-based methods, sacrificing the realness of the data and introducing domain gaps.

Image Restoration

Paper
Code

Aesthetic-Driven Image Enhancement by Adversarial Learning

1 code implementation • 17 Jul 2017 • Yubin Deng, Chen Change Loy, Xiaoou Tang

We introduce EnhanceGAN, an adversarial learning based model that performs automatic image enhancement.

Image Cropping Image Enhancement

Paper
Code

DeformToon3D: Deformable 3D Toonification from Neural Radiance Fields

1 code implementation • 8 Sep 2023 • Junzhe Zhang, Yushi Lan, Shuai Yang, Fangzhou Hong, Quan Wang, Chai Kiat Yeo, Ziwei Liu, Chen Change Loy

In this paper, we address the challenging problem of 3D toonification, which involves transferring the style of an artistic domain onto a target 3D face with stylized geometry and texture.

Decoder

Paper
Code

Robust and Fast Decoding of High-Capacity Color QR Codes for Mobile Applications

1 code implementation • 21 Apr 2017 • Zhibo Yang, Huanle Xu, Jianyuan Deng, Chen Change Loy, Wing Cheong Lau

Particularly, we further discover a new type of chromatic distortion in high-density color QR codes, cross-module color interference, caused by the high density which also makes the geometric distortion correction more challenging.

Paper
Code

Learning to Steer by Mimicking Features from Heterogeneous Auxiliary Networks

2 code implementations • 7 Nov 2018 • Yuenan Hou, Zheng Ma, Chunxiao Liu, Chen Change Loy

In this paper, we considerably improve the accuracy and robustness of predictions through heterogeneous auxiliary networks feature mimicking, a new and effective training method that provides us with much richer contextual signals apart from steering direction.

Ranked #1 on Steering Control on BDD100K val

Image Segmentation Multi-Task Learning +3

Paper
Code

Monocular 3D Object Reconstruction with GAN Inversion

1 code implementation • 20 Jul 2022 • Junzhe Zhang, Daxuan Ren, Zhongang Cai, Chai Kiat Yeo, Bo Dai, Chen Change Loy

Reconstruction is achieved by searching for a latent space in the 3D GAN that best resembles the target mesh in accordance with the single view observation.

3D Object Reconstruction Object

Paper
Code

Unified Vision and Language Prompt Learning

1 code implementation • 13 Oct 2022 • Yuhang Zang, Wei Li, Kaiyang Zhou, Chen Huang, Chen Change Loy

Prompt tuning, a parameter- and data-efficient transfer learning paradigm that tunes only a small number of parameters in a model's input space, has become a trend in the vision community since the emergence of large vision-language models like CLIP.

Domain Generalization Few-Shot Learning +2

Paper
Code

Panoptic Video Scene Graph Generation

3 code implementations • CVPR 2023 • Jingkang Yang, Wenxuan Peng, Xiangtai Li, Zujin Guo, Liangyu Chen, Bo Li, Zheng Ma, Kaiyang Zhou, Wayne Zhang, Chen Change Loy, Ziwei Liu

PVSG relates to the existing video scene graph generation (VidSGG) problem, which focuses on temporal interactions between humans and objects grounded with bounding boxes in videos.

Graph Generation Panoptic Scene Graph Generation +5

Paper
Code

Explore In-Context Learning for 3D Point Cloud Understanding

2 code implementations • NeurIPS 2023 • Zhongbin Fang, Xiangtai Li, Xia Li, Joachim M. Buhmann, Chen Change Loy, Mengyuan Liu

With the rise of large-scale models trained on broad data, in-context learning has become a new learning paradigm that has demonstrated significant potential in natural language processing and computer vision tasks.

In-Context Learning

Paper
Code

Point-In-Context: Understanding Point Cloud via In-Context Learning

1 code implementation • 18 Apr 2024 • Mengyuan Liu, Zhongbin Fang, Xia Li, Joachim M. Buhmann, Xiangtai Li, Chen Change Loy

With the emergence of large-scale models trained on diverse datasets, in-context learning has emerged as a promising paradigm for multitasking, notably in natural language processing and image processing.

In-Context Learning

Paper
Code

Unsupervised Object-Level Representation Learning from Scene Images

1 code implementation • NeurIPS 2021 • Jiahao Xie, Xiaohang Zhan, Ziwei Liu, Yew Soon Ong, Chen Change Loy

Extensive experiments on COCO show that ORL significantly improves the performance of self-supervised learning on scene images, even surpassing supervised ImageNet pre-training on several downstream tasks.

Object Representation Learning +2

Paper
Code

Masked Frequency Modeling for Self-Supervised Visual Pre-Training

3 code implementations • 15 Jun 2022 • Jiahao Xie, Wei Li, Xiaohang Zhan, Ziwei Liu, Yew Soon Ong, Chen Change Loy

We present Masked Frequency Modeling (MFM), a unified frequency-domain-based approach for self-supervised pre-training of visual models.

Image Classification Image Restoration +2

Paper
Code

Nighttime Smartphone Reflective Flare Removal Using Optical Center Symmetry Prior

1 code implementation • CVPR 2023 • Yuekun Dai, Yihang Luo, Shangchen Zhou, Chongyi Li, Chen Change Loy

With the dataset, neural networks can be trained to remove the reflective flares effectively.

Flare Removal

Paper
Code

BRACE: The Breakdancing Competition Dataset for Dance Motion Synthesis

1 code implementation • 20 Jul 2022 • Davide Moltisanti, Jinyi Wu, Bo Dai, Chen Change Loy

Estimating human keypoints from these videos is difficult due to the complexity of the dance, as well as the multiple moving cameras recording setup.

Motion Synthesis Pose Estimation

Paper
Code

Delving Deep Into Hybrid Annotations for 3D Human Recovery in the Wild

1 code implementation • ICCV 2019 • Yu Rong, Ziwei Liu, Cheng Li, Kaidi Cao, Chen Change Loy

Specifically, we focus on the challenging task of in-the-wild 3D human recovery from single images when paired 3D annotations are not fully available.

Paper
Code

NTIRE 2021 Challenge on Quality Enhancement of Compressed Video: Methods and Results

1 code implementation • 21 Apr 2021 • Ren Yang, Radu Timofte, Jing Liu, Yi Xu, Xinjian Zhang, Minyi Zhao, Shuigeng Zhou, Kelvin C. K. Chan, Shangchen Zhou, Xiangyu Xu, Chen Change Loy, Xin Li, Fanglong Liu, He Zheng, Lielin Jiang, Qi Zhang, Dongliang He, Fu Li, Qingqing Dang, Yibin Huang, Matteo Maggioni, Zhongqian Fu, Shuai Xiao, Cheng Li, Thomas Tanay, Fenglong Song, Wentao Chao, Qiang Guo, Yan Liu, Jiang Li, Xiaochao Qu, Dewang Hou, Jiayu Yang, Lyn Jiang, Di You, Zhenyu Zhang, Chong Mou, Iaroslav Koshelev, Pavel Ostyakov, Andrey Somov, Jia Hao, Xueyi Zou, Shijie Zhao, Xiaopeng Sun, Yiting Liao, Yuanzhi Zhang, Qing Wang, Gen Zhan, Mengxi Guo, Junlin Li, Ming Lu, Zhan Ma, Pablo Navarrete Michelini, Hai Wang, Yiyun Chen, Jingyu Guo, Liliang Zhang, Wenming Yang, Sijung Kim, Syehoon Oh, Yucong Wang, Minjie Cai, Wei Hao, Kangdi Shi, Liangyan Li, Jun Chen, Wei Gao, Wang Liu, XiaoYu Zhang, Linjie Zhou, Sixin Lin, Ru Wang

This paper reviews the first NTIRE challenge on quality enhancement of compressed video, with a focus on the proposed methods and results.

Paper
Code

Position-Guided Point Cloud Panoptic Segmentation Transformer

1 code implementation • 23 Mar 2023 • Zeqi Xiao, Wenwei Zhang, Tai Wang, Chen Change Loy, Dahua Lin, Jiangmiao Pang

DEtection TRansformer (DETR) started a trend that uses a group of learnable queries for unified visual perception.

Ranked #1 on Panoptic Segmentation on SemanticKITTI

Instance Segmentation Panoptic Segmentation +3

Paper
Code

AnimeRun: 2D Animation Visual Correspondence from Open Source 3D Movies

1 code implementation • 10 Nov 2022 • Li SiYao, Yuhang Li, Bo Li, Chao Dong, Ziwei Liu, Chen Change Loy

Existing correspondence datasets for two-dimensional (2D) cartoon suffer from simple frame composition and monotonic movements, making them insufficient to simulate real animations.

Optical Flow Estimation

Paper
Code

When StyleGAN Meets Stable Diffusion: a $\mathscr{W}_+$ Adapter for Personalized Image Generation

1 code implementation • 29 Nov 2023 • Xiaoming Li, Xinyu Hou, Chen Change Loy

Text-to-image diffusion models have remarkably excelled in producing diverse, high-quality, and photo-realistic images.

Attribute Disentanglement +1

Paper
Code

Everything's Talkin': Pareidolia Face Reenactment

1 code implementation • 7 Apr 2021 • Linsen Song, Wayne Wu, Chaoyou Fu, Chen Qian, Chen Change Loy, Ran He

We present a new application direction named Pareidolia Face Reenactment, which is defined as animating a static illusory face to move in tandem with a human face in the video.

Face Reenactment Texture Synthesis

Paper
Code

Removing Diffraction Image Artifacts in Under-Display Camera via Dynamic Skip Connection Network

1 code implementation • CVPR 2021 • Ruicheng Feng, Chongyi Li, Huaijin Chen, Shuai Li, Chen Change Loy, Jinwei Gu

Recent development of Under-Display Camera (UDC) systems provides a true bezel-less and notch-free viewing experience on smartphones (and TV, laptops, tablets), while allowing images to be captured from the selfie camera embedded underneath.

Image Restoration

Paper
Code

Tube-Link: A Flexible Cross Tube Framework for Universal Video Segmentation

2 code implementations • ICCV 2023 • Xiangtai Li, Haobo Yuan, Wenwei Zhang, Guangliang Cheng, Jiangmiao Pang, Chen Change Loy

Our framework is a near-online approach that takes a short subclip as input and outputs the corresponding spatial-temporal tube masks.

Ranked #3 on Video Semantic Segmentation on VSPW

Contrastive Learning Segmentation +4

105

Paper
Code

MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary Instance Segmentation

1 code implementation • 22 Sep 2023 • Jiahao Xie, Wei Li, Xiangtai Li, Ziwei Liu, Yew Soon Ong, Chen Change Loy

We present MosaicFusion, a simple yet effective diffusion-based data augmentation approach for large vocabulary instance segmentation.

Data Augmentation Instance Segmentation +1

105

Paper
Code

Not All Pixels Are Equal: Difficulty-aware Semantic Segmentation via Deep Layer Cascade

1 code implementation • CVPR 2017 • Xiaoxiao Li, Ziwei Liu, Ping Luo, Chen Change Loy, Xiaoou Tang

Third, in comparison to MC, LC is an end-to-end trainable framework, allowing joint learning of all sub-models.

Ranked #22 on Semantic Segmentation on PASCAL VOC 2012 test

Semantic Segmentation

108

Paper
Code

Flare7K: A Phenomenological Nighttime Flare Removal Dataset

1 code implementation • 12 Oct 2022 • Yuekun Dai, Chongyi Li, Shangchen Zhou, Ruicheng Feng, Chen Change Loy

In this paper, we introduce, Flare7K, the first nighttime flare removal dataset, which is generated based on the observation and statistics of real-world nighttime lens flares.

Ranked #2 on Flare Removal on Flare7K

Flare Removal

110

Paper
Code

Flare7K++: Mixing Synthetic and Real Datasets for Nighttime Flare Removal and Beyond

1 code implementation • 7 Jun 2023 • Yuekun Dai, Chongyi Li, Shangchen Zhou, Ruicheng Feng, Yihang Luo, Chen Change Loy

To address this issue, we additionally provide the annotations of light sources in Flare7K++ and propose a new end-to-end pipeline to preserve the light source while removing lens flares.

Flare Removal

110

Paper
Code

Inter-Region Affinity Distillation for Road Marking Segmentation

1 code implementation • CVPR 2020 • Yuenan Hou, Zheng Ma, Chunxiao Liu, Tak-Wai Hui, Chen Change Loy

We study the problem of distilling knowledge from a large deep teacher network to a much smaller student network for the task of road marking segmentation.

Ranked #1 on Semantic Segmentation on ApolloScape

Knowledge Distillation Lane Detection +1

112

Paper
Code

StyleLight: HDR Panorama Generation for Lighting Estimation and Editing

1 code implementation • 29 Jul 2022 • Guangcong Wang, Yinuo Yang, Chen Change Loy, Ziwei Liu

To tackle this problem, we propose a coupled dual-StyleGAN panorama synthesis network (StyleLight) that integrates LDR and HDR panorama synthesis into a unified framework.

Lighting Estimation

113

Paper
Code

PGDiff: Guiding Diffusion Models for Versatile Face Restoration via Partial Guidance

1 code implementation • NeurIPS 2023 • Peiqing Yang, Shangchen Zhou, Qingyi Tao, Chen Change Loy

When combined with a diffusion prior, this partial guidance can deliver appealing results across a range of restoration tasks.

118

Paper
Code

Delving into High-Quality Synthetic Face Occlusion Segmentation Datasets

3 code implementations • 12 May 2022 • Kenny T. R. Voo, Liming Jiang, Chen Change Loy

This paper performs comprehensive analysis on datasets for occlusion-aware face segmentation, a task that is crucial for many downstream applications.

Segmentation Synthetic Data Generation +1

122

Paper
Code

CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction

1 code implementation • 2 Oct 2023 • Size Wu, Wenwei Zhang, Lumin Xu, Sheng Jin, Xiangtai Li, Wentao Liu, Chen Change Loy

However, when transferring the vision-language alignment of CLIP from global image representation to local region representation for the open-vocabulary dense prediction tasks, CLIP ViTs suffer from the domain shift from full images to local image regions.

Ranked #3 on Open Vocabulary Semantic Segmentation on PASCAL Context-59

Image Classification Image Segmentation +7

135

Paper
Code

Self-Supervised Learning via Conditional Motion Propagation

1 code implementation • CVPR 2019 • Xiaohang Zhan, Xingang Pan, Ziwei Liu, Dahua Lin, Chen Change Loy

Instead of explicitly modeling the motion probabilities, we design the pretext task as a conditional motion propagation problem.

Human Parsing Instance Segmentation +2

137

Paper
Code

A Shading-Guided Generative Implicit Model for Shape-Accurate 3D-Aware Image Synthesis

1 code implementation • NeurIPS 2021 • Xingang Pan, Xudong Xu, Chen Change Loy, Christian Theobalt, Bo Dai

Motivated by the observation that a 3D object should look realistic from multiple viewpoints, these methods introduce a multi-view constraint as regularization to learn valid 3D radiance fields from 2D images.

3D-Aware Image Synthesis 3D Shape Reconstruction +2

146

Paper
Code

Dense Intrinsic Appearance Flow for Human Pose Transfer

1 code implementation • CVPR 2019 • Yining Li, Chen Huang, Chen Change Loy

Unlike existing methods, we propose to estimate dense and intrinsic 3D appearance flow to better guide the transfer of pixels between poses.

Pose Transfer

147

Paper
Code

Video K-Net: A Simple, Strong, and Unified Baseline for Video Segmentation

1 code implementation • CVPR 2022 • Xiangtai Li, Wenwei Zhang, Jiangmiao Pang, Kai Chen, Guangliang Cheng, Yunhai Tong, Chen Change Loy

We hope this simple, yet effective method can serve as a new, flexible baseline in unified video segmentation design.

Ranked #1 on Video Panoptic Segmentation on KITTI-STEP (using extra training data)

Image Segmentation Instance Segmentation +5

150

Paper
Code

Image Aesthetic Assessment: An Experimental Survey

1 code implementation • 4 Oct 2016 • Yubin Deng, Chen Change Loy, Xiaoou Tang

This survey aims at reviewing recent computer vision techniques used in the assessment of image aesthetic quality.

Binary Classification

154

Paper
Code

Face alignment by coarse-to-fine shape searching

1 code implementation • 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2015 • Shizhan Zhu, Cheng Li, Chen Change Loy, Xiaoou Tang

We present a novel face alignment framework based on coarse-to-fine shape searching.

Ranked #20 on Face Alignment on AFLW-19

Face Alignment regression

156

Paper
Code

Contextual Object Detection with Multimodal Large Language Models

1 code implementation • 29 May 2023 • Yuhang Zang, Wei Li, Jun Han, Kaiyang Zhou, Chen Change Loy

Moreover, we present ContextDET, a unified multimodal model that is capable of end-to-end differentiable modeling of visual-language contexts, so as to locate, identify, and associate visual objects with language inputs for human-AI interaction.

Cloze Test Decoder +7

159

Paper
Code

Aligning Bag of Regions for Open-Vocabulary Object Detection

1 code implementation • CVPR 2023 • Size Wu, Wenwei Zhang, Sheng Jin, Wentao Liu, Chen Change Loy

The embeddings of regions in a bag are treated as embeddings of words in a sentence, and they are sent to the text encoder of a VLM to obtain the bag-of-regions embedding, which is learned to be aligned to the corresponding features extracted by a frozen VLM.

Ranked #7 on Open Vocabulary Object Detection on MSCOCO (using extra training data)

Object object-detection +2

161

Paper
Code

PERF: Panoramic Neural Radiance Field from a Single Panorama

1 code implementation • 25 Oct 2023 • Guangcong Wang, Peng Wang, Zhaoxi Chen, Wenping Wang, Chen Change Loy, Ziwei Liu

In this paper, we present PERF, a 360-degree novel view synthesis framework that trains a panoramic neural radiance field from a single panorama.

Novel View Synthesis Text to 3D

161

Paper
Code

Non-Local Recurrent Network for Image Restoration

1 code implementation • NeurIPS 2018 • Ding Liu, Bihan Wen, Yuchen Fan, Chen Change Loy, Thomas S. Huang

The main contributions of this work are: (1) Unlike existing methods that measure self-similarity in an isolated manner, the proposed non-local module can be flexibly integrated into existing deep networks for end-to-end training to capture deep feature correlation between each location and its neighborhood.

Ranked #1 on Grayscale Image Denoising on Set12 sigma30

Feature Correlation Image Denoising +2

167

Paper
Code

Learning Generative Structure Prior for Blind Text Image Super-resolution

1 code implementation • CVPR 2023 • Xiaoming Li, WangMeng Zuo, Chen Change Loy

To restrict the generative space of StyleGAN so that it obeys the structure of characters yet remains flexible in handling different font styles, we store the discrete features for each character in a codebook.

Image Super-Resolution

168

Paper
Code

TransEditor: Transformer-Based Dual-Space GAN for Highly Controllable Facial Editing

1 code implementation • CVPR 2022 • Yanbo Xu, Yueqin Yin, Liming Jiang, Qianyi Wu, Chengyao Zheng, Chen Change Loy, Bo Dai, Wayne Wu

In this study, we highlight the importance of interaction in a dual-space GAN for more controllable editing.

Attribute Disentanglement +1

173

Paper
Code

Learning Inclusion Matching for Animation Paint Bucket Colorization

1 code implementation • 27 Mar 2024 • Yuekun Dai, Shangchen Zhou, Qinyue Li, Chongyi Li, Chen Change Loy

In this work, we introduce a new learning-based inclusion matching pipeline, which directs the network to comprehend the inclusion relationships between segments rather than relying solely on direct visual correspondences.

Colorization

181

Paper
Code

Unsupervised Image-to-Image Translation with Generative Prior

1 code implementation • CVPR 2022 • Shuai Yang, Liming Jiang, Ziwei Liu, Chen Change Loy

In this work, we present a novel framework, Generative Prior-guided UNsupervised Image-to-image Translation (GP-UNIT), to improve the overall quality and applicability of the translation algorithm.

Translation Unsupervised Image-To-Image Translation

183

Paper
Code

GP-UNIT: Generative Prior for Versatile Unsupervised Image-to-Image Translation

1 code implementation • 7 Jun 2023 • Shuai Yang, Liming Jiang, Ziwei Liu, Chen Change Loy

In this paper, we introduce a novel versatile framework, Generative Prior-guided UNsupervised Image-to-image Translation (GP-UNIT), that improves the quality, applicability and controllability of the existing translation models.

Translation Unsupervised Image-To-Image Translation +1

183

Paper
Code

One-shot Face Reenactment

2 code implementations • 5 Aug 2019 • Yunxuan Zhang, Siwei Zhang, Yue He, Cheng Li, Chen Change Loy, Ziwei Liu

However, in real-world scenario end-users often only have one target face at hand, rendering existing methods inapplicable.

Decoder Face Reconstruction +1

190

Paper
Code

LEDNet: Joint Low-light Enhancement and Deblurring in the Dark

1 code implementation • 7 Feb 2022 • Shangchen Zhou, Chongyi Li, Chen Change Loy

With the pipeline, we present the first large-scale dataset for joint low-light enhancement and deblurring.

Ranked #2 on Low-Light Image Enhancement on Sony-Total-Dark

Deblurring Low-light Image Deblurring and Enhancement +1

191

Paper
Code

Robust Reference-based Super-Resolution via C2-Matching

1 code implementation • CVPR 2021 • Yuming Jiang, Kelvin C. K. Chan, Xintao Wang, Chen Change Loy, Ziwei Liu

However, performing local transfer is difficult because of two gaps between input and reference images: the transformation gap (e. g. scale and rotation) and the resolution gap (e. g. HR and LR).

Reference-based Super-Resolution

193

Paper
Code

Reference-based Image and Video Super-Resolution via C2-Matching

1 code implementation • 19 Dec 2022 • Yuming Jiang, Kelvin C. K. Chan, Xintao Wang, Chen Change Loy, Ziwei Liu

To tackle these challenges, we propose C2-Matching in this work, which performs explicit robust matching crossing transformation and resolution.

Image Super-Resolution Reference-based Super-Resolution +2

193

Paper
Code

Open-Vocabulary DETR with Conditional Matching

1 code implementation • 22 Mar 2022 • Yuhang Zang, Wei Li, Kaiyang Zhou, Chen Huang, Chen Change Loy

To this end, we propose a novel open-vocabulary detector based on DETR -- hence the name OV-DETR -- which, once trained, can detect any object given its class name or an exemplar image.

Ranked #21 on Open Vocabulary Object Detection on MSCOCO

Language Modelling object-detection +1

194

Paper
Code

ReenactGAN: Learning to Reenact Faces via Boundary Transfer

1 code implementation • ECCV 2018 • Wayne Wu, Yunxuan Zhang, Cheng Li, Chen Qian, Chen Change Loy

A transformer is subsequently used to adapt the boundary of source face to the boundary of target face.

Decoder Face Reenactment +2

195

Paper
Code

DNA-Rendering: A Diverse Neural Actor Repository for High-Fidelity Human-centric Rendering

1 code implementation • ICCV 2023 • Wei Cheng, Ruixiang Chen, Wanqi Yin, Siming Fan, Keyu Chen, Honglin He, Huiwen Luo, Zhongang Cai, Jingbo Wang, Yang Gao, Zhengming Yu, Zhengyu Lin, Daxuan Ren, Lei Yang, Ziwei Liu, Chen Change Loy, Chen Qian, Wayne Wu, Dahua Lin, Bo Dai, Kwan-Yee Lin

Realistic human-centric rendering plays a key role in both computer vision and computer graphics.

Camera Calibration Novel View Synthesis

201

Paper
Code

Crafting a Toolchain for Image Restoration by Deep Reinforcement Learning

2 code implementations • CVPR 2018 • Ke Yu, Chao Dong, Liang Lin, Chen Change Loy

We investigate a novel approach for image restoration by reinforcement learning.

Image Restoration reinforcement-learning +1

208

Paper
Code

Accelerating the Super-Resolution Convolutional Neural Network

14 code implementations • 1 Aug 2016 • Chao Dong, Chen Change Loy, Xiaoou Tang

As a successful deep model applied in image super-resolution (SR), the Super-Resolution Convolutional Neural Network (SRCNN) has demonstrated superior performance to the previous hand-crafted models either in speed and restoration quality.

Ranked #6 on Image Super-Resolution on FFHQ 256 x 256 - 4x upscaling

Image Super-Resolution

213

Paper
Code

3D Human Texture Estimation from a Single Image with Transformers

1 code implementation • ICCV 2021 • Xiangyu Xu, Chen Change Loy

We propose a Transformer-based framework for 3D human texture estimation from a single image.

Garment Reconstruction

216

Paper
Code

RenderMe-360: A Large Digital Asset Library and Benchmarks Towards High-fidelity Head Avatars

1 code implementation • NeurIPS 2023 • Dongwei Pan, Long Zhuo, Jingtan Piao, Huiwen Luo, Wei Cheng, Yuxin Wang, Siming Fan, Shengqi Liu, Lei Yang, Bo Dai, Ziwei Liu, Chen Change Loy, Chen Qian, Wayne Wu, Dahua Lin, Kwan-Yee Lin

It is a large-scale digital library for head avatars with three key attributes: 1) High Fidelity: all subjects are captured by 60 synchronized, high-resolution 2K cameras in 360 degrees.

2k Image Matting +2

219

Paper
Code

LiteFlowNet3: Resolving Correspondence Ambiguity for More Accurate Optical Flow Estimation

1 code implementation • ECCV 2020 • Tak-Wai Hui, Chen Change Loy

The keys to success lie in the use of cost volume and coarse-to-fine flow inference.

Ranked #4 on Optical Flow Estimation on KITTI 2012

Optical Flow Estimation Scene Flow Estimation

239

Paper
Code

Deceive D: Adaptive Pseudo Augmentation for GAN Training with Limited Data

2 code implementations • NeurIPS 2021 • Liming Jiang, Bo Dai, Wayne Wu, Chen Change Loy

Generative adversarial networks (GANs) typically require ample data for training in order to synthesize high-fidelity images.

251

Paper
Code

Robust Multi-Modality Multi-Object Tracking

1 code implementation • ICCV 2019 • Wenwei Zhang, Hui Zhou, Shuyang Sun, Zhe Wang, Jianping Shi, Chen Change Loy

Multi-sensor perception is crucial to ensure the reliability and accuracy in autonomous driving system, while multi-object tracking (MOT) improves that by tracing sequential movement of dynamic objects.

Ranked #10 on Multiple Object Tracking on KITTI Tracking test

Autonomous Driving Multi-Object Tracking +2

252

Paper
Code

Scenimefy: Learning to Craft Anime Scene via Semi-Supervised Image-to-Image Translation

1 code implementation • ICCV 2023 • Yuxin Jiang, Liming Jiang, Shuai Yang, Chen Change Loy

The challenges of this task lie in the complexity of the scenes, the unique features of anime style, and the lack of high-quality datasets to bridge the domain gap.

Image-to-Image Translation

252

Paper
Code

On-Device Domain Generalization

2 code implementations • 15 Sep 2022 • Kaiyang Zhou, Yuanhan Zhang, Yuhang Zang, Jingkang Yang, Chen Change Loy, Ziwei Liu

Another interesting observation is that the teacher-student gap on out-of-distribution data is bigger than that on in-distribution data, which highlights the capacity mismatch issue as well as the shortcoming of KD.

Data Augmentation Domain Generalization +2

255

Paper
Code

Exploring CLIP for Assessing the Look and Feel of Images

1 code implementation • 25 Jul 2022 • Jianyi Wang, Kelvin C. K. Chan, Chen Change Loy

Measuring the perception of visual content is a long-standing problem in computer vision.

Ranked #9 on Video Quality Assessment on MSU SR-QA Dataset

Image Quality Assessment Video Quality Assessment

263

Paper
Code

TSIT: A Simple and Versatile Framework for Image-to-Image Translation

1 code implementation • ECCV 2020 • Liming Jiang, Changxu Zhang, Mingyang Huang, Chunxiao Liu, Jianping Shi, Chen Change Loy

We introduce a simple and versatile framework for image-to-image translation.

Image-to-Image Translation Translation

272

Paper
Code

Audio-Driven Emotional Video Portraits

1 code implementation • CVPR 2021 • Xinya Ji, Hang Zhou, Kaisiyuan Wang, Wayne Wu, Chen Change Loy, Xun Cao, Feng Xu

In this work, we present Emotional Video Portraits (EVP), a system for synthesizing high-quality video portraits with vivid emotional dynamics driven by audios.

Disentanglement Face Generation

284

Paper
Code

Real or Not Real, that is the Question

2 code implementations • ICLR 2020 • Yuanbo Xiangli, Yubin Deng, Bo Dai, Chen Change Loy, Dahua Lin

While generative adversarial networks (GAN) have been widely adopted in various topics, in this paper we generalize the standard GAN to a new perspective by treating realness as a random variable that can be estimated from multiple angles.

286

Paper
Code

Video Object Segmentation with Re-identification

3 code implementations • 1 Aug 2017 • Xiaoxiao Li, Yuankai Qi, Zhe Wang, Kai Chen, Ziwei Liu, Jianping Shi, Ping Luo, Xiaoou Tang, Chen Change Loy

Specifically, our Video Object Segmentation with Re-identification (VS-ReID) model includes a mask propagation module and a ReID module.

Object Segmentation +4

289

Paper
Code

Deep Geometrized Cartoon Line Inbetweening

1 code implementation • ICCV 2023 • Li SiYao, Tianpei Gu, Weiye Xiao, Henghui Ding, Ziwei Liu, Chen Change Loy

To preserve the precision and detail of the line drawings, we propose a new approach, AnimeInbet, which geometrizes raster line drawings into graphs of endpoints and reframes the inbetweening task as a graph fusion problem with vertex repositioning.

289

Paper
Code

Cross-Scale Internal Graph Neural Network for Image Super-Resolution

1 code implementation • NeurIPS 2020 • Shangchen Zhou, Jiawei Zhang, WangMeng Zuo, Chen Change Loy

Specifically, we dynamically construct a cross-scale graph by searching k-nearest neighboring patches in the downsampled LR image for each query patch in the LR image.

Image Restoration Image Super-Resolution

304

Paper
Code

Talk-to-Edit: Fine-Grained Facial Editing via Dialog

1 code implementation • ICCV 2021 • Yuming Jiang, Ziqi Huang, Xingang Pan, Chen Change Loy, Ziwei Liu

In this work, we propose Talk-to-Edit, an interactive facial editing framework that performs fine-grained attribute manipulation through dialog between the user and the system.

Ranked #1 on Fine-Grained Facial Editing on CelebA-Dialog

Attribute Facial Editing +1

306

Paper
Code

Text2Performer: Text-Driven Human Video Generation

1 code implementation • ICCV 2023 • Yuming Jiang, Shuai Yang, Tong Liang Koh, Wayne Wu, Chen Change Loy, Ziwei Liu

In this work, we present Text2Performer to generate vivid human videos with articulated motions from texts.

Video Generation

308

Paper
Code

Learning to Enhance Low-Light Image via Zero-Reference Deep Curve Estimation

4 code implementations • 1 Mar 2021 • Chongyi Li, Chunle Guo, Chen Change Loy

This paper presents a novel method, Zero-Reference Deep Curve Estimation (Zero-DCE), which formulates light enhancement as a task of image-specific curve estimation with a deep network.

Face Detection Image Enhancement

311

Paper
Code

CelebV-HQ: A Large-Scale Video Facial Attributes Dataset

1 code implementation • 25 Jul 2022 • Hao Zhu, Wayne Wu, Wentao Zhu, Liming Jiang, Siwei Tang, Li Zhang, Ziwei Liu, Chen Change Loy

Large-scale datasets have played indispensable roles in the recent success of face generation/editing and significantly facilitated the advances of emerging research fields.

Ranked #1 on Unconditional Video Generation on CelebV-HQ

Attribute Face Generation +1

355

Paper
Code

Extract Free Dense Labels from CLIP

1 code implementation • 2 Dec 2021 • Chong Zhou, Chen Change Loy, Bo Dai

Contrastive Language-Image Pre-training (CLIP) has made a remarkable breakthrough in open-vocabulary zero-shot image recognition.

Ranked #3 on Unsupervised Semantic Segmentation with Language-image Pre-training on KITTI-STEP

Novel Concepts Open Vocabulary Panoptic Segmentation +5

361

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.