Search Results for author: Kelvin C. K. Chan

Found 32 papers, 19 papers with code

Consistent Subject Generation via Contrastive Instantiated Concepts

no code implementations31 Mar 2025 Lee Hsin-Ying, Kelvin C. K. Chan, Ming-Hsuan Yang

We introduce Contrastive Concept Instantiation (CoCoIns) to effectively synthesize consistent subjects across multiple independent creations.

Contrastive Learning

KITTEN: A Knowledge-Intensive Evaluation of Image Generation on Visual Entities

no code implementations15 Oct 2024 Hsin-Ping Huang, Xinyi Wang, Yonatan Bitton, Hagai Taitelbaum, Gaurav Singh Tomar, Ming-Wei Chang, Xuhui Jia, Kelvin C. K. Chan, Hexiang Hu, Yu-Chuan Su, Ming-Hsuan Yang

Using KITTEN, we conduct a systematic study on the fidelity of entities in text-to-image generation models, focusing on their ability to generate a wide range of real-world visual entities, such as landmark buildings, aircraft, plants, and animals.

Retrieval Text-to-Image Generation +1

A Simple Approach to Unifying Diffusion-based Conditional Generation

no code implementations15 Oct 2024 Xirui Li, Charles Herrmann, Kelvin C. K. Chan, Yinxiao Li, Deqing Sun, Chao Ma, Ming-Hsuan Yang

Recent progress in image generation has sparked research into controlling these models through condition signals, with various methods addressing specific challenges in conditional generation.

Image Generation

Re-boosting Self-Collaboration Parallel Prompt GAN for Unsupervised Image Restoration

2 code implementations17 Aug 2024 Xin Lin, Yuyan Zhou, Jingtong Yue, Chao Ren, Kelvin C. K. Chan, Lu Qi, Ming-Hsuan Yang

As SE increases computational complexity during inference, we propose a re-boosting module to the SC (Reb-SC) to improve the SC strategy further by incorporating SE into SC without increasing inference time.

Image Restoration Prompt Learning

Improving Subject-Driven Image Synthesis with Subject-Agnostic Guidance

no code implementations CVPR 2024 Kelvin C. K. Chan, Yang Zhao, Xuhui Jia, Ming-Hsuan Yang, Huisheng Wang

In subject-driven text-to-image synthesis, the synthesis process tends to be heavily influenced by the reference images provided by users, often overlooking crucial attributes detailed in the text prompt.

Image Generation

AdaIR: Exploiting Underlying Similarities of Image Restoration Tasks with Adapters

no code implementations17 Apr 2024 Hao-Wei Chen, Yu-Syuan Xu, Kelvin C. K. Chan, Hsien-Kai Kuo, Chun-Yi Lee, Ming-Hsuan Yang

Towards this goal, we propose AdaIR, a novel framework that enables low storage cost and efficient training without sacrificing performance.

Image Restoration

DreamInpainter: Text-Guided Subject-Driven Image Inpainting with Diffusion Models

no code implementations5 Dec 2023 Shaoan Xie, Yang Zhao, Zhisheng Xiao, Kelvin C. K. Chan, Yandong Li, Yanwu Xu, Kun Zhang, Tingbo Hou

Our extensive experiments demonstrate the superior performance of our method in terms of visual quality, identity preservation, and text control, showcasing its effectiveness in the context of text-guided subject-driven image inpainting.

Image Inpainting

Multi-task Image Restoration Guided By Robust DINO Features

no code implementations4 Dec 2023 Xin Lin, Jingtong Yue, Kelvin C. K. Chan, Lu Qi, Chao Ren, Jinshan Pan, Ming-Hsuan Yang

To guide the restoration model with the features of DINOv2, we develop a DINO-Restore adaption and fusion module to adjust the channel of fused features from PSF and then integrate them with the features from the restoration model.

Image Restoration

Effective Adapter for Face Recognition in the Wild

no code implementations4 Dec 2023 Yunhao Liu, Yu-Ju Tsai, Kelvin C. K. Chan, Xiangtai Li, Lu Qi, Ming-Hsuan Yang

Traditional heuristic approaches-either training models directly on these degraded images or their enhanced counterparts using face restoration techniques-have proven ineffective, primarily due to the degradation of facial features and the discrepancy in image domains.

Face Recognition

Dual Associated Encoder for Face Restoration

1 code implementation14 Aug 2023 Yu-Ju Tsai, Yu-Lun Liu, Lu Qi, Kelvin C. K. Chan, Ming-Hsuan Yang

Restoring facial details from low-quality (LQ) images has remained a challenging problem due to its ill-posedness induced by various degradations in the wild.

Blind Face Restoration

Exploiting Diffusion Prior for Real-World Image Super-Resolution

3 code implementations11 May 2023 Jianyi Wang, Zongsheng Yue, Shangchen Zhou, Kelvin C. K. Chan, Chen Change Loy

We present a novel approach to leverage prior knowledge encapsulated in pre-trained text-to-image diffusion models for blind super-resolution (SR).

Blind Super-Resolution Image Super-Resolution

Collaborative Diffusion for Multi-Modal Face Generation and Editing

1 code implementation CVPR 2023 Ziqi Huang, Kelvin C. K. Chan, Yuming Jiang, Ziwei Liu

In this work, we present Collaborative Diffusion, where pre-trained uni-modal diffusion models collaborate to achieve multi-modal face generation and editing without re-training.

Denoising Face Generation

Identity Encoder for Personalized Diffusion

no code implementations14 Apr 2023 Yu-Chuan Su, Kelvin C. K. Chan, Yandong Li, Yang Zhao, Han Zhang, Boqing Gong, Huisheng Wang, Xuhui Jia

Our approach greatly reduces the overhead for personalized image generation and is more applicable in many potential applications.

Image Enhancement Image Generation +1

ReVersion: Diffusion-Based Relation Inversion from Images

2 code implementations23 Mar 2023 Ziqi Huang, Tianxing Wu, Yuming Jiang, Kelvin C. K. Chan, Ziwei Liu

In this work, we propose the Relation Inversion task, which aims to learn a specific relation (represented as "relation prompt") from exemplar images.

Contrastive Learning Few-Shot Learning +1

Reference-based Image and Video Super-Resolution via C2-Matching

1 code implementation19 Dec 2022 Yuming Jiang, Kelvin C. K. Chan, Xintao Wang, Chen Change Loy, Ziwei Liu

To tackle these challenges, we propose C2-Matching in this work, which performs explicit robust matching crossing transformation and resolution.

Image Super-Resolution Reference-based Super-Resolution +2

GLEAN: Generative Latent Bank for Image Super-Resolution and Beyond

1 code implementation29 Jul 2022 Kelvin C. K. Chan, Xiangyu Xu, Xintao Wang, Jinwei Gu, Chen Change Loy

While most existing perceptual-oriented approaches attempt to generate realistic outputs through learning with adversarial loss, our method, Generative LatEnt bANk (GLEAN), goes beyond existing practices by directly leveraging rich and diverse priors encapsulated in a pre-trained GAN.

Colorization Decoder +3

Towards Robust Blind Face Restoration with Codebook Lookup Transformer

1 code implementation22 Jun 2022 Shangchen Zhou, Kelvin C. K. Chan, Chongyi Li, Chen Change Loy

In this paper, we demonstrate that a learned discrete codebook prior in a small proxy space largely reduces the uncertainty and ambiguity of restoration mapping by casting blind face restoration as a code prediction task, while providing rich visual atoms for generating high-quality faces.

Blind Face Restoration Prediction

On the Generalization of BasicVSR++ to Video Deblurring and Denoising

1 code implementation11 Apr 2022 Kelvin C. K. Chan, Shangchen Zhou, Xiangyu Xu, Chen Change Loy

The exploitation of long-term information has been a long-standing problem in video restoration.

Deblurring Denoising +3

Robust Reference-based Super-Resolution via C2-Matching

1 code implementation CVPR 2021 Yuming Jiang, Kelvin C. K. Chan, Xintao Wang, Chen Change Loy, Ziwei Liu

However, performing local transfer is difficult because of two gaps between input and reference images: the transformation gap (e. g. scale and rotation) and the resolution gap (e. g. HR and LR).

Reference-based Super-Resolution

BasicVSR++: Improving Video Super-Resolution with Enhanced Propagation and Alignment

3 code implementations CVPR 2022 Kelvin C. K. Chan, Shangchen Zhou, Xiangyu Xu, Chen Change Loy

We show that by empowering the recurrent framework with the enhanced propagation and alignment, one can exploit spatiotemporal information across misaligned video frames more effectively.

Analog Video Restoration Snow Removal +3

GLEAN: Generative Latent Bank for Large-Factor Image Super-Resolution

no code implementations CVPR 2021 Kelvin C. K. Chan, Xintao Wang, Xiangyu Xu, Jinwei Gu, Chen Change Loy

We show that pre-trained Generative Adversarial Networks (GANs), e. g., StyleGAN, can be used as a latent bank to improve the restoration quality of large-factor image super-resolution (SR).

Decoder Image Super-Resolution

Understanding Deformable Alignment in Video Super-Resolution

no code implementations15 Sep 2020 Kelvin C. K. Chan, Xintao Wang, Ke Yu, Chao Dong, Chen Change Loy

Aside from the contributions to deformable alignment, our formulation inspires a more flexible approach to introduce offset diversity to flow-based alignment, improving its performance.

Diversity Optical Flow Estimation +1

EDVR: Video Restoration with Enhanced Deformable Convolutional Networks

11 code implementations7 May 2019 Xintao Wang, Kelvin C. K. Chan, Ke Yu, Chao Dong, Chen Change Loy

In this work, we propose a novel Video Restoration framework with Enhanced Deformable networks, termed EDVR, to address these challenges.

Deblurring Video Enhancement +2

Cannot find the paper you are looking for? You can Submit a new open access paper.