no code implementations • 11 Dec 2023 • Savya Khosla, Zhen Zhu, Yifei He
This paper explores Memory-Augmented Neural Networks (MANNs), delving into how they blend human-like memory processes into AI.
no code implementations • 12 Oct 2023 • Zijie Wu, Chaohui Yu, Zhen Zhu, Fan Wang, Xiang Bai
To utilize the abundant visual priors in the off-the-shelf T2I models, a series of methods try to invert an image to proper embedding that aligns with the semantic space of the T2I model.
no code implementations • 4 Jul 2023 • Zhen Zhu, Weijie Lyu, Yao Xiao, Derek Hoiem
We introduce a method for flexible and efficient continual learning in open-vocabulary image classification, drawing inspiration from the complementary learning systems observed in human cognition.
no code implementations • 4 Jul 2023 • Zhen Zhu, Yijun Li, Weijie Lyu, Krishna Kumar Singh, Zhixin Shu, Soeren Pirk, Derek Hoiem
We investigate how to generate multimodal image outputs, such as RGB, depth, and surface normals, with a single generative model.
no code implementations • 10 May 2023 • Xudong Xie, Zhen Zhu, Zijie Wu, Zhiliang Xu, Yingying Zhu
To our knowledge, ours is the first scheme for this challenging task, including model, training, and evaluation.
1 code implementation • 11 Jul 2022 • Zijie Wu, Zhen Zhu, Junping Du, Xiang Bai
CCPL can preserve the coherence of the content source during style transfer without degrading stylization.
1 code implementation • 11 Jan 2022 • Zhiliang Xu, Zhibin Hong, Changxing Ding, Zhen Zhu, Junyu Han, Jingtuo Liu, Errui Ding
In this work, we propose a lightweight Identity-aware Dynamic Network (IDN) for subject-agnostic face swapping by dynamically adjusting the model parameters according to the identity information.
1 code implementation • 22 Mar 2021 • Zhen Zhu, Tengteng Huang, Mengde Xu, Baoguang Shi, Wenqing Cheng, Xiang Bai
This paper proposes a new generative adversarial network for pose transfer, i. e., transferring the pose of a given person to a target pose.
no code implementations • 23 Feb 2021 • Zhiliang Xu, Xiyu Yu, Zhibin Hong, Zhen Zhu, Junyu Han, Jingtuo Liu, Errui Ding, Xiang Bai
By simply employing some existing and easy-obtainable prior information, our method can control, transfer, and edit diverse attributes of faces in the wild.
Ranked #1 on Face Swapping on FaceForensics++ (FID metric)
1 code implementation • 14 Dec 2020 • Yang Liu, Zhen Zhu, Xiang Bai
Visible watermarks are widely-used in images to protect copyright ownership.
no code implementations • 28 Aug 2020 • Zhen Zhu, Enzo Weber, Till Strohsal, Duaa Serhan
We aim to find out what sustainable border control options for different entities (e. g., countries, states) exist during the reopening phases, given their own choice of domestic control measures and new technologies such as contact tracing.
1 code implementation • 19 Jul 2020 • Mucun Tian, Chun Guo, Vito Ostuni, Zhen Zhu
To unbiasedly learn to rank, existing counterfactual frameworks first estimate the propensity (probability) of missing clicks with intervention data from a small portion of search traffic, and then use inverse propensity score (IPS) to debias LTR algorithms on the whole data set.
1 code implementation • CVPR 2020 • Zhen Zhu, Zhiliang Xu, Ansheng You, Xiang Bai
Experiments on several challenging datasets demonstrate the superiority of GroupDNet on performing the SMIS task.
6 code implementations • ECCV 2020 • Xiangtai Li, Ansheng You, Zhen Zhu, Houlong Zhao, Maoke Yang, Kuiyuan Yang, Yunhai Tong
A common practice to improve the performance is to attain high resolution feature maps with strong semantic representation.
Ranked #2 on Real-Time Semantic Segmentation on Cityscapes test
5 code implementations • ICCV 2019 • Zhen Zhu, Mengde Xu, Song Bai, Tengteng Huang, Xiang Bai
The non-local module works as a particularly useful technique for semantic segmentation while criticized for its prohibitive computation and GPU memory occupation.
Ranked #15 on Semantic Segmentation on COCO-Stuff test
2 code implementations • CVPR 2019 • Zhen Zhu, Tengteng Huang, Baoguang Shi, Miao Yu, Bofei Wang, Xiang Bai
This paper proposes a new generative adversarial network for pose transfer, i. e., transferring the pose of a given person to a target pose.
Ranked #1 on Pose Transfer on Market-1501
1 code implementation • 11 May 2018 • Yang Zhou, Zhen Zhu, Xiang Bai, Dani Lischinski, Daniel Cohen-Or, Hui Huang
We demonstrate that this conceptually simple approach is highly effective for capturing large-scale structures, as well as other non-stationary attributes of the input exemplar.
no code implementations • CVPR 2018 • Minghui Liao, Zhen Zhu, Baoguang Shi, Gui-Song Xia, Xiang Bai
Previous methods rely on shared features for both tasks, resulting in degraded performance due to the incompatibility of the two tasks.
Ranked #14 on Scene Text Detection on MSRA-TD500
no code implementations • 4 Mar 2018 • Antoine Dedieu, Rahul Mazumder, Zhen Zhu, Hossein Vahabi
In this work we present a novel framework inspired by hierarchical Bayesian modeling to predict, at the moment of login, the amount of time a user will spend in the streaming service.
6 code implementations • CVPR 2018 • Gui-Song Xia, Xiang Bai, Jian Ding, Zhen Zhu, Serge Belongie, Jiebo Luo, Mihai Datcu, Marcello Pelillo, Liangpei Zhang
The fully annotated DOTA images contains $188, 282$ instances, each of which is labeled by an arbitrary (8 d. o. f.)
Ranked #52 on Object Detection In Aerial Images on DOTA (using extra training data)
no code implementations • 27 Jun 2017 • Pengyuan Lyu, Xiang Bai, Cong Yao, Zhen Zhu, Tengteng Huang, Wenyu Liu
In this paper, we investigate the Chinese calligraphy synthesis problem: synthesizing Chinese calligraphy images with specified style from standard font(eg.
no code implementations • 11 Apr 2016 • Yuan Sun, Zhen Zhu
Person knowledge extraction is the foundation of the Tibetan knowledge graph construction, which provides support for Tibetan question answering system, information retrieval, information extraction and other researches, and promotes national unity and social stability.