Search Results for author: Zhen Zhu

Found 22 papers, 11 papers with code

Survey on Memory-Augmented Neural Networks: Cognitive Insights to AI Applications

no code implementations • 11 Dec 2023 • Savya Khosla, Zhen Zhu, Yifei He

This paper explores Memory-Augmented Neural Networks (MANNs), delving into how they blend human-like memory processes into AI.

Retrieval

Paper
Add Code

SingleInsert: Inserting New Concepts from a Single Image into Text-to-Image Models for Flexible Editing

no code implementations • 12 Oct 2023 • Zijie Wu, Chaohui Yu, Zhen Zhu, Fan Wang, Xiang Bai

To utilize the abundant visual priors in the off-the-shelf T2I models, a series of methods try to invert an image to proper embedding that aligns with the semantic space of the T2I model.

Image Generation Novel View Synthesis

Paper
Add Code

Continual Learning in Open-vocabulary Classification with Complementary Memory Systems

no code implementations • 4 Jul 2023 • Zhen Zhu, Weijie Lyu, Yao Xiao, Derek Hoiem

We introduce a method for flexible and efficient continual learning in open-vocabulary image classification, drawing inspiration from the complementary learning systems observed in human cognition.

Continual Learning Image Classification

Paper
Add Code

Consistent Multimodal Generation via A Unified GAN Framework

no code implementations • 4 Jul 2023 • Zhen Zhu, Yijun Li, Weijie Lyu, Krishna Kumar Singh, Zhixin Shu, Soeren Pirk, Derek Hoiem

We investigate how to generate multimodal image outputs, such as RGB, depth, and surface normals, with a single generative model.

multimodal generation

Paper
Add Code

Learning in a Single Domain for Non-Stationary Multi-Texture Synthesis

no code implementations • 10 May 2023 • Xudong Xie, Zhen Zhu, Zijie Wu, Zhiliang Xu, Yingying Zhu

To our knowledge, ours is the first scheme for this challenging task, including model, training, and evaluation.

Texture Synthesis

Paper
Add Code

CCPL: Contrastive Coherence Preserving Loss for Versatile Style Transfer

1 code implementation • 11 Jul 2022 • Zijie Wu, Zhen Zhu, Junping Du, Xiang Bai

CCPL can preserve the coherence of the content source during style transfer without degrading stylization.

Image-to-Image Translation Style Transfer +1

188

Paper
Code

MobileFaceSwap: A Lightweight Framework for Video Face Swapping

1 code implementation • 11 Jan 2022 • Zhiliang Xu, Zhibin Hong, Changxing Ding, Zhen Zhu, Junyu Han, Jingtuo Liu, Errui Ding

In this work, we propose a lightweight Identity-aware Dynamic Network (IDN) for subject-agnostic face swapping by dynamically adjusting the model parameters according to the identity information.

Face Swapping Knowledge Distillation

280

Paper
Code

Progressive and Aligned Pose Attention Transfer for Person Image Generation

1 code implementation • 22 Mar 2021 • Zhen Zhu, Tengteng Huang, Mengde Xu, Baoguang Shi, Wenqing Cheng, Xiang Bai

This paper proposes a new generative adversarial network for pose transfer, i. e., transferring the pose of a given person to a target pose.

Data Augmentation Generative Adversarial Network +2

731

Paper
Code

FaceController: Controllable Attribute Editing for Face in the Wild

no code implementations • 23 Feb 2021 • Zhiliang Xu, Xiyu Yu, Zhibin Hong, Zhen Zhu, Junyu Han, Jingtuo Liu, Errui Ding, Xiang Bai

By simply employing some existing and easy-obtainable prior information, our method can control, transfer, and edit diverse attributes of faces in the wild.

Ranked #1 on Face Swapping on FaceForensics++ (FID metric)

Attribute Disentanglement +1

Paper
Add Code

WDNet: Watermark-Decomposition Network for Visible Watermark Removal

1 code implementation • 14 Dec 2020 • Yang Liu, Zhen Zhu, Xiang Bai

Visible watermarks are widely-used in images to protect copyright ownership.

Image-to-Image Translation

Paper
Code

Sustainable Border Control Policy in the COVID-19 Pandemic: A Math Modeling Study

no code implementations • 28 Aug 2020 • Zhen Zhu, Enzo Weber, Till Strohsal, Duaa Serhan

We aim to find out what sustainable border control options for different entities (e. g., countries, states) exist during the reopening phases, given their own choice of domestic control measures and new technologies such as contact tracing.

Math

Paper
Add Code

Counterfactual Learning to Rank using Heterogeneous Treatment Effect Estimation

1 code implementation • 19 Jul 2020 • Mucun Tian, Chun Guo, Vito Ostuni, Zhen Zhu

To unbiasedly learn to rank, existing counterfactual frameworks first estimate the propensity (probability) of missing clicks with intervention data from a small portion of search traffic, and then use inverse propensity score (IPS) to debias LTR algorithms on the whole data set.

counterfactual Learning-To-Rank +1

Paper
Code

Semantically Multi-modal Image Synthesis

1 code implementation • CVPR 2020 • Zhen Zhu, Zhiliang Xu, Ansheng You, Xiang Bai

Experiments on several challenging datasets demonstrate the superiority of GroupDNet on performing the SMIS task.

Image Generation

320

Paper
Code

Semantic Flow for Fast and Accurate Scene Parsing

6 code implementations • ECCV 2020 • Xiangtai Li, Ansheng You, Zhen Zhu, Houlong Zhao, Maoke Yang, Kuiyuan Yang, Yunhai Tong

A common practice to improve the performance is to attain high resolution feature maps with strong semantic representation.

Ranked #2 on Real-Time Semantic Segmentation on Cityscapes test

Optical Flow Estimation Real-Time Semantic Segmentation +1

8,248

Paper
Code

Asymmetric Non-local Neural Networks for Semantic Segmentation

5 code implementations • ICCV 2019 • Zhen Zhu, Mengde Xu, Song Bai, Tengteng Huang, Xiang Bai

The non-local module works as a particularly useful technique for semantic segmentation while criticized for its prohibitive computation and GPU memory occupation.

Ranked #15 on Semantic Segmentation on COCO-Stuff test

Segmentation Semantic Segmentation

8,248

Paper
Code

Progressive Pose Attention Transfer for Person Image Generation

2 code implementations • CVPR 2019 • Zhen Zhu, Tengteng Huang, Baoguang Shi, Miao Yu, Bofei Wang, Xiang Bai

This paper proposes a new generative adversarial network for pose transfer, i. e., transferring the pose of a given person to a target pose.

Ranked #1 on Pose Transfer on Market-1501

Generative Adversarial Network Person Re-Identification +1

731

Paper
Code

Non-Stationary Texture Synthesis by Adversarial Expansion

1 code implementation • 11 May 2018 • Yang Zhou, Zhen Zhu, Xiang Bai, Dani Lischinski, Daniel Cohen-Or, Hui Huang

We demonstrate that this conceptually simple approach is highly effective for capturing large-scale structures, as well as other non-stationary attributes of the input exemplar.

Generative Adversarial Network Texture Synthesis

369

Paper
Code

Rotation-Sensitive Regression for Oriented Scene Text Detection

no code implementations • CVPR 2018 • Minghui Liao, Zhen Zhu, Baoguang Shi, Gui-Song Xia, Xiang Bai

Previous methods rely on shared features for both tasks, resulting in degraded performance due to the incompatibility of the two tasks.

Ranked #14 on Scene Text Detection on MSRA-TD500

Classification General Classification +6

Paper
Add Code

Hierarchical Modeling and Shrinkage for User Session Length Prediction in Media Streaming

no code implementations • 4 Mar 2018 • Antoine Dedieu, Rahul Mazumder, Zhen Zhu, Hossein Vahabi

In this work we present a novel framework inspired by hierarchical Bayesian modeling to predict, at the moment of login, the amount of time a user will spend in the streaming service.

Paper
Add Code

DOTA: A Large-scale Dataset for Object Detection in Aerial Images

6 code implementations • CVPR 2018 • Gui-Song Xia, Xiang Bai, Jian Ding, Zhen Zhu, Serge Belongie, Jiebo Luo, Mihai Datcu, Marcello Pelillo, Liangpei Zhang

The fully annotated DOTA images contains $188, 282$ instances, each of which is labeled by an arbitrary (8 d. o. f.)

Ranked #52 on Object Detection In Aerial Images on DOTA (using extra training data)

Earth Observation Object +2

12,048

Paper
Code

Auto-Encoder Guided GAN for Chinese Calligraphy Synthesis

no code implementations • 27 Jun 2017 • Pengyuan Lyu, Xiang Bai, Cong Yao, Zhen Zhu, Tengteng Huang, Wenyu Liu

In this paper, we investigate the Chinese calligraphy synthesis problem: synthesizing Chinese calligraphy images with specified style from standard font(eg.

Image-to-Image Translation Translation

Paper
Add Code

Method of Tibetan Person Knowledge Extraction

no code implementations • 11 Apr 2016 • Yuan Sun, Zhen Zhu

Person knowledge extraction is the foundation of the Tibetan knowledge graph construction, which provides support for Tibetan question answering system, information retrieval, information extraction and other researches, and promotes national unity and social stability.

graph construction Information Retrieval +3

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.