Search Results for author: Mingrui Zhu

Found 12 papers, 2 papers with code

InstructBrush: Learning Attention-based Instruction Optimization for Image Editing

no code implementations27 Mar 2024 Ruoyu Zhao, Qingnan Fan, Fei Kou, Shuai Qin, Hong Gu, Wei Wu, Pengcheng Xu, Mingrui Zhu, Nannan Wang, Xinbo Gao

Two key techniques are introduced into InstructBrush, Attention-based Instruction Optimization and Transformation-oriented Instruction Initialization, to address the limitations of the previous method in terms of inversion effects and instruction generalization.

Bridging Generative and Discriminative Models for Unified Visual Perception with Diffusion Priors

no code implementations29 Jan 2024 Shiyin Dong, Mingrui Zhu, Kun Cheng, Nannan Wang, Xinbo Gao

Our purpose is to establish a unified visual perception framework, capitalizing on the potential synergies between generative and discriminative models.

Image Generation Open Vocabulary Semantic Segmentation +2

CatVersion: Concatenating Embeddings for Diffusion-Based Text-to-Image Personalization

no code implementations24 Nov 2023 Ruoyu Zhao, Mingrui Zhu, Shiyin Dong, Nannan Wang, Xinbo Gao

We propose CatVersion, an inversion-based method that learns the personalized concept through a handful of examples.

Image Generation

HFORD: High-Fidelity and Occlusion-Robust De-identification for Face Privacy Protection

no code implementations15 Nov 2023 Dongxin Chen, Mingrui Zhu, Nannan Wang, Xinbo Gao

To disentangle the latent codes in the GAN inversion space, we introduce an Identity Disentanglement Module (IDM).

Attribute De-identification +1

Diff-Privacy: Diffusion-based Face Privacy Protection

no code implementations11 Sep 2023 Xiao He, Mingrui Zhu, Dongxin Chen, Nannan Wang, Xinbo Gao

In this paper, we unify the task of anonymization and visual identity information hiding and propose a novel face privacy protection method based on diffusion models, dubbed Diff-Privacy.

Denoising Scheduling

Adapt and Align to Improve Zero-Shot Sketch-Based Image Retrieval

no code implementations9 May 2023 Shiyin Dong, Mingrui Zhu, Nannan Wang, Xinbo Gao

Zero-shot sketch-based image retrieval (ZS-SBIR) is challenging due to the cross-domain nature of sketches and photos, as well as the semantic gap between seen and unseen image distributions.

Retrieval Sketch-Based Image Retrieval +1

Few-shot Face Image Translation via GAN Prior Distillation

no code implementations28 Jan 2023 Ruoyu Zhao, Mingrui Zhu, Xiaoyu Wang, Nannan Wang

GPD contains two models: a teacher network with GAN Prior and a student network that fulfills end-to-end translation.

Knowledge Distillation Translation

Few-shot Font Generation by Learning Style Difference and Similarity

no code implementations24 Jan 2023 Xiao He, Mingrui Zhu, Nannan Wang, Xinbo Gao, Heng Yang

To address this issue, we propose a novel font generation approach by learning the Difference between different styles and the Similarity of the same style (DS-Font).

Contrastive Learning Font Generation

All-to-key Attention for Arbitrary Style Transfer

no code implementations ICCV 2023 Mingrui Zhu, Xiao He, Nannan Wang, Xiaoyu Wang, Xinbo Gao

In this paper, we propose a novel all-to-key attention mechanism -- each position of content features is matched to stable key positions of style features -- that is more in line with the characteristics of style transfer.

Position Style Transfer

VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild

1 code implementation27 Nov 2022 Kun Cheng, Xiaodong Cun, Yong Zhang, Menghan Xia, Fei Yin, Mingrui Zhu, Xuan Wang, Jue Wang, Nannan Wang

Our system disentangles this objective into three sequential tasks: (1) face video generation with a canonical expression; (2) audio-driven lip-sync; and (3) face enhancement for improving photo-realism.

Video Editing Video Generation

Semi-parametric Makeup Transfer via Semantic-aware Correspondence

1 code implementation4 Mar 2022 Mingrui Zhu, Yun Yi, Nannan Wang, Xiaoyu Wang, Xinbo Gao

The large discrepancy between the source non-makeup image and the reference makeup image is one of the key challenges in makeup transfer.

Dual Based DSP Bidding Strategy and its Application

no code implementations26 May 2017 Huahui Liu, Mingrui Zhu, Xiaonan Meng, Yi Hu, Hao Wang

In recent years, RTB(Real Time Bidding) becomes a popular online advertisement trading method.

Cannot find the paper you are looking for? You can Submit a new open access paper.