1 code implementation • ECCV 2020 • Xiangyu Zhu, Fan Yang, Di Huang, Chang Yu, Hao Wang, Jianzhu Guo, Zhen Lei, Stan Z. Li
However, most of their training data is constructed by 3D Morphable Model, whose space spanned is only a small part of the shape space.
no code implementations • 10 Apr 2023 • Weisong Zhao, Xiangyu Zhu, Kaiwen Guo, Xiao-Yu Zhang, Zhen Lei
Therefore, we seek to probe the target logits to extract the primary knowledge related to face identity, and discard the others, to make the distillation more achievable for the student network.
1 code implementation • CVPR 2023 • Tingting Liao, Xiaomei Zhang, Yuliang Xiu, Hongwei Yi, Xudong Liu, Guo-Jun Qi, Yong Zhang, Xuan Wang, Xiangyu Zhu, Zhen Lei
This paper presents a framework for efficient 3D clothed avatar reconstruction.
no code implementations • CVPR 2023 • Xiangyu Zhu, Dong Du, Weikai Chen, Zhiyou Zhao, Yinyu Nie, Xiaoguang Han
We show that a simple network based on NerVE can already outperform the previous state-of-the-art methods by a great margin.
1 code implementation • CVPR 2023 • Zhiyuan Ma, Xiangyu Zhu, GuoJun Qi, Zhen Lei, Lei Zhang
In this paper, we propose One-shot Talking face Avatar (OTAvatar), which constructs face avatars by a generalized controllable tri-plane rendering solution so that each personalized avatar can be constructed from only one portrait as the reference.
1 code implementation • 20 Mar 2023 • Ziqiao Peng, HaoYu Wu, Zhenbo Song, Hao Xu, Xiangyu Zhu, Hongyan Liu, Jun He, Zhaoxin Fan
Specifically, we introduce the emotion disentangling encoder (EDE) to disentangle the emotion and content in the speech by cross-reconstructed speech signals with different emotion labels.
no code implementations • CVPR 2023 • Chang Yu, Xiangyu Zhu, Xiaomei Zhang, Zhaoxiang Zhang, Zhen Lei
The function of constructing the hierarchy of objects is important to the visual process of the human brain.
no code implementations • CVPR 2023 • Qu Tang, Xiangyu Zhu, Zhen Lei, Zhaoxiang Zhang
The ability to discover abstract physical concepts and understand how they work in the world through observing lies at the core of human intelligence.
no code implementations • 29 Jan 2023 • Xiaomei Zhang, Xiangyu Zhu, Ming Tang, Zhen Lei
Human parsing is a key topic in image processing with many applications, such as surveillance analysis, human-robot interaction, person search, and clothing category classification, among many others.
1 code implementation • 23 May 2022 • Jianxiong Li, Xianyuan Zhan, Haoran Xu, Xiangyu Zhu, Jingjing Liu, Ya-Qin Zhang
In offline reinforcement learning (RL), one detrimental issue to policy learning is the error accumulation of deep Q function in out-of-distribution (OOD) areas.
no code implementations • 9 May 2022 • Yueying Kao, Bowen Pan, Miao Xu, Jiangjing Lyu, Xiangyu Zhu, Yuanzhang Chang, Xiaobo Li, Zhen Lei
In 3D face reconstruction, orthogonal projection has been widely employed to substitute perspective projection to simplify the fitting process.
1 code implementation • 27 Apr 2022 • Zeyuan Chen, He Wang, Xiangyu Zhu, Haiyan Wu, Congcong Gu, Shumeng Liu, Jinchao Huang, Wei zhang
The proposed solution of our team WSDM_Coggle_ is selected as the second place submission.
no code implementations • 24 Apr 2022 • Xiangyu Zhu, Tingting Liao, Jiangjing Lyu, Xiang Yan, Yunfeng Wang, Kan Guo, Qiong Cao, Stan Z. Li, Zhen Lei
In this paper, we consider a novel problem of reconstructing a 3D human avatar from multiple unconstrained frames, independent of assumptions on camera calibration, capture space, and constrained actions.
no code implementations • 21 Apr 2022 • Lu Zhang, Zhiyong Liu, Xiangyu Zhu, Zhan Song, Xu Yang, Zhen Lei, Hong Qiao
In this article, we propose a general multimodal detector named aligned region CNN (AR-CNN) to tackle the position shift problem.
no code implementations • 9 Apr 2022 • Xiangyu Zhu, Chang Yu, Di Huang, Zhen Lei, Hao Wang, Stan Z. Li
3D Morphable Model (3DMM) fitting has widely benefited face analysis due to its strong 3D priori.
no code implementations • CVPR 2022 • Chang Yu, Xiangyu Zhu, Xiaomei Zhang, Zidu Wang, Zhaoxiang Zhang, Zhen Lei
Capsule networks are designed to present the objects by a set of parts and their relationships, which provide an insight into the procedure of visual perception.
no code implementations • 24 Dec 2021 • Zhiwei Liu, Xiangyu Zhu, Lu Yang, Xiang Yan, Ming Tang, Zhen Lei, Guibo Zhu, Xuetao Feng, Yan Wang, Jinqiao Wang
In the second stage, we design a mesh refinement transformer (MRT) to respectively refine each coarse reconstruction result via a self-attention mechanism.
Ranked #29 on
3D Human Pose Estimation
on 3DPW
(MPJPE metric)
no code implementations • 13 Dec 2021 • Junjun Hu, Yanhao Zhu, Bo Zhao, Jiexin Zheng, Chenxu Zhao, Xiangyu Zhu, Kangle Wu, Darun Tang
One of the challenges of logo recognition lies in the diversity of forms, such as symbols, texts or a combination of both; further, logos tend to be extremely concise in design while similar in appearance, suggesting the difficulty of learning discriminative representations.
no code implementations • 28 Oct 2021 • Congqing He, Jie Zhang, Xiangyu Zhu, Huan Liu, Yukun Huang
To this end, we introduce a fresh perspective to revisit the relational event-cause extraction task and propose a novel sequence tagging framework, instead of extracting event types and events-causes separately.
no code implementations • ICLR 2022 • Qu Tang, Xiangyu Zhu, Zhen Lei, Zhaoxiang Zhang
In this paper, we work on object dynamics and propose Object Dynamics Distillation Network (ODDN), a framework that distillates explicit object dynamics (e. g., velocity) from sequential static representations.
no code implementations • 19 Jul 2021 • Haoran Xu, Xianyuan Zhan, Xiangyu Zhu
We study the problem of safe offline reinforcement learning (RL), the goal is to learn a policy that maximizes long-term reward while satisfying safety constraints given only offline data, without further interaction with the environment.
1 code implementation • 16 May 2021 • Xianyuan Zhan, Xiangyu Zhu, Haoran Xu
The recent offline reinforcement learning (RL) studies have achieved much progress to make RL usable in real-world systems by learning policies from pre-collected datasets without environment interaction.
no code implementations • 23 Feb 2021 • Xianyuan Zhan, Haoran Xu, Yue Zhang, Xiangyu Zhu, Honglei Yin, Yu Zheng
Optimizing the combustion efficiency of a thermal power generating unit (TPGU) is a highly challenging and critical task in the energy industry.
no code implementations • CVPR 2021 • Xiangyu Zhu, Hao Wang, Hongyan Fei, Zhen Lei, Stan Z. Li
Detecting digital face manipulation has attracted extensive attention due to fake media's potential harms to the public.
3 code implementations • ECCV 2020 • Jianzhu Guo, Xiangyu Zhu, Yang Yang, Fan Yang, Zhen Lei, Stan Z. Li
Firstly, on the basis of a lightweight backbone, we propose a meta-joint optimization strategy to dynamically regress a small set of 3DMM parameters, which greatly enhances speed and accuracy simultaneously.
Ranked #1 on
3D Face Reconstruction
on Florence
(Mean NME metric)
no code implementations • 28 Aug 2020 • Jiangjing Lyu, Xiaobo Li, Xiangyu Zhu, Cheng Cheng
It is also a challenging task due to the lack of high-quality datasets that can fuel current deep learning-based methods.
1 code implementation • 20 Apr 2020 • Xiangyu Zhu, Zhenbo Luo, Pei Fu, Xiang Ji
Then we use orientation and camera similarity as penalty to get final similarity.
no code implementations • CVPR 2020 • Dong Cao, Xiangyu Zhu, Xingyu Huang, Jianzhu Guo, Zhen Lei
Finally, we propose a Domain Balancing Margin (DBM) in the loss function to further optimize the feature space of the tail domains to improve generalization.
1 code implementation • CVPR 2020 • Zezheng Wang, Zitong Yu, Chenxu Zhao, Xiangyu Zhu, Yunxiao Qin, Qiusheng Zhou, Feng Zhou, Zhen Lei
Depth supervised learning has been proven as one of the most effective methods for face anti-spoofing.
1 code implementation • CVPR 2020 • Jianzhu Guo, Xiangyu Zhu, Chenxu Zhao, Dong Cao, Zhen Lei, Stan Z. Li
Face recognition systems are usually faced with unseen domains in real-world applications and show unsatisfactory performance due to their poor generalization.
no code implementations • 9 May 2019 • Yinglu Liu, Hao Shen, Yue Si, Xiaobo Wang, Xiangyu Zhu, Hailin Shi, Zhibin Hong, Hanqi Guo, Ziyuan Guo, Yanqin Chen, Bi Li, Teng Xi, Jun Yu, Haonian Xie, Guochen Xie, Mengyan Li, Qing Lu, Zengfu Wang, Shenqi Lai, Zhenhua Chai, Xiaoming Wei
However, previous competitions on facial landmark localization (i. e., the 300-W, 300-VW and Menpo challenges) aim to predict 68-point landmarks, which are incompetent to depict the structure of facial components.
no code implementations • 29 Apr 2019 • Yunxiao Qin, Chenxu Zhao, Xiangyu Zhu, Zezheng Wang, Zitong Yu, Tianyu Fu, Feng Zhou, Jingping Shi, Zhen Lei
Therefore, we define face anti-spoofing as a zero- and few-shot learning problem.
no code implementations • CVPR 2019 • Zhiwei Liu, Xiangyu Zhu, Guosheng Hu, Haiyun Guo, Ming Tang, Zhen Lei, Neil M. Robertson, Jinqiao Wang
Despite this, we notice that the semantic ambiguity greatly degrades the detection performance.
Ranked #1 on
Face Alignment
on 300W
(NME_inter-pupil (%, Full) metric)
no code implementations • 28 Feb 2019 • Haonan Qiu, Chuan Wang, Hang Zhu, Xiangyu Zhu, Jinjin Gu, Xiaoguang Han
Generating plausible hair image given limited guidance, such as sparse sketches or low-resolution image, has been made possible with the rise of Generative Adversarial Networks (GANs).
no code implementations • ICCV 2019 • Lu Zhang, Xiangyu Zhu, Xiangyu Chen, Xu Yang, Zhen Lei, Zhi-Yong Liu
In this paper, we propose a novel Aligned Region CNN (AR-CNN) to handle the weakly aligned multispectral data in an end-to-end way.
no code implementations • 2 Jan 2019 • Jianzhu Guo, Xiangyu Zhu, Jinchuan Xiao, Zhen Lei, Genxun Wan, Stan Z. Li
Specifically, we consider a printed photo as a flat surface and mesh it into a 3D object, which is then randomly bent and rotated in 3D space.
Ranked #1 on
Face Anti-Spoofing
on CASIA-MFSD
no code implementations • 11 Dec 2018 • Yunxiao Qin, WeiGuo Zhang, Chenxu Zhao, Zezheng Wang, Xiangyu Zhu, Guo-Jun Qi, Jingping Shi, Zhen Lei
In this paper, inspired by the human cognition process which utilizes both prior-knowledge and vision attention in learning new knowledge, we present a novel paradigm of meta-learning approach with three developments to introduce attention mechanism and prior-knowledge for meta-learning.
no code implementations • 10 Oct 2018 • Congqing He, Li Peng, Yuquan Le, JiaWei He, Xiangyu Zhu
In this paper, we propose a Sequence Enhanced Capsule model, dubbed as SECaps model, to relieve this problem.
no code implementations • 8 Jun 2018 • Xiangyu Zhu, Hao liu, Zhen Lei, Hailin Shi, Fan Yang, Dong Yi, Guo-Jun Qi, Stan Z. Li
In this paper, we propose a deep learning based large-scale bisample learning (LBL) method for IvS face recognition.
1 code implementation • 4 Jun 2018 • Jianzhu Guo, Xiangyu Zhu, Zhen Lei, Stan Z. Li
A feasible method is to collect large-scale face images with eyeglasses for training deep learning methods.
2 code implementations • 2 Apr 2018 • Xiangyu Zhu, Xiaoming Liu, Zhen Lei, Stan Z. Li
In this paper, we propose to tackle these three challenges in an new alignment framework termed 3D Dense Face Alignment (3DDFA), in which a dense 3D Morphable Model (3DMM) is fitted to the image via Cascaded Convolutional Neural Networks.
Ranked #3 on
Face Alignment
on AFLW
no code implementations • 11 Nov 2017 • Xiangyu Zhu, Yingying Jiang, Shuli Yang, Xiaobing Wang, Wei Li, Pei Fu, Hua Wang, Zhenbo Luo
Scene text detection is a challenging problem in computer vision.
no code implementations • ICCV 2017 • Shifeng Zhang, Xiangyu Zhu, Zhen Lei, Hailin Shi, Xiaobo Wang, Stan Z. Li
This paper presents a real-time face detector, named Single Shot Scale-invariant Face Detector (S3FD), which performs superiorly on various scales of faces with a single deep neural network, especially for small faces.
7 code implementations • 17 Aug 2017 • Shifeng Zhang, Xiangyu Zhu, Zhen Lei, Hailin Shi, Xiaobo Wang, Stan Z. Li
The MSCL aims at enriching the receptive fields and discretizing anchors over different layers to handle faces of various scales.
Ranked #3 on
Face Detection
on PASCAL Face
3 code implementations • 17 Aug 2017 • Shifeng Zhang, Xiangyu Zhu, Zhen Lei, Hailin Shi, Xiaobo Wang, Stan Z. Li
This paper presents a real-time face detector, named Single Shot Scale-invariant Face Detector (S$^3$FD), which performs superiorly on various scales of faces with a single deep neural network, especially for small faces.
Ranked #2 on
Face Detection
on PASCAL Face
1 code implementation • 29 Jun 2017 • Yingying Jiang, Xiangyu Zhu, Xiaobing Wang, Shuli Yang, Wei Li, Hua Wang, Pei Fu, Zhenbo Luo
In this paper, we propose a novel method called Rotational Region CNN (R2CNN) for detecting arbitrary-oriented texts in natural scene images.
no code implementations • 1 Nov 2016 • Hailin Shi, Yang Yang, Xiangyu Zhu, Shengcai Liao, Zhen Lei, Wei-Shi Zheng, Stan Z. Li
From this point of view, selecting suitable positive i. e. intra-class) training samples within a local range is critical for training the CNN embedding, especially when the data has large intra-class variations.
no code implementations • 9 May 2016 • Hailin Shi, Xiangyu Zhu, Zhen Lei, Shengcai Liao, Stan Z. Li
Deep neural networks usually benefit from unsupervised pre-training, e. g. auto-encoders.
no code implementations • 24 Nov 2015 • Hailin Shi, Xiangyu Zhu, Shengcai Liao, Zhen Lei, Yang Yang, Stan Z. Li
In this paper, we propose a novel CNN-based method to learn a discriminative metric with good robustness to the over-fitting problem in person re-identification.
no code implementations • CVPR 2016 • Xiangyu Zhu, Zhen Lei, Xiaoming Liu, Hailin Shi, Stan Z. Li
Face alignment, which fits a face model to an image and extracts the semantic meanings of facial pixels, has been an important topic in CV community.
Ranked #3 on
3D Face Reconstruction
on Florence
no code implementations • CVPR 2015 • Junjie Yan, Yinan Yu, Xiangyu Zhu, Zhen Lei, Stan Z. Li
Object detection is always conducted by object proposal generation and classification sequentially.
no code implementations • CVPR 2015 • Xiangyu Zhu, Zhen Lei, Junjie Yan, Dong Yi, Stan Z. Li
Pose and expression normalization is a crucial step to recover the canonical view of faces under arbitrary conditions, so as to improve the face recognition performance.
1 code implementation • CVPR 2015 • Shengcai Liao, Yang Hu, Xiangyu Zhu, Stan Z. Li
In this paper, we propose an effective feature representation called Local Maximal Occurrence (LOMO), and a subspace and metric learning method called Cross-view Quadratic Discriminant Analysis (XQDA).
Ranked #86 on
Person Re-Identification
on DukeMTMC-reID