Search Results for author: Zhen Lei

Found 124 papers, 41 papers with code

Beyond 3DMM Space: Towards Fine-grained 3D Face Reconstruction

1 code implementation ECCV 2020 Xiangyu Zhu, Fan Yang, Di Huang, Chang Yu, Hao Wang, Jianzhu Guo, Zhen Lei, Stan Z. Li

However, most of their training data is constructed by 3D Morphable Model, whose space spanned is only a small part of the shape space.

3D Face Reconstruction

Exclusivity-Consistency Regularized Knowledge Distillation for Face Recognition

no code implementations ECCV 2020 Xiaobo Wang, Tianyu Fu, Shengcai Liao, Shuo Wang, Zhen Lei, Tao Mei

Knowledge distillation is an effective tool to compress large pre-trained Convolutional Neural Networks (CNNs) or their ensembles into models applicable to mobile and embedded devices.

Face Recognition Knowledge Distillation +1

AccidentBlip2: Accident Detection With Multi-View MotionBlip2

1 code implementation18 Apr 2024 Yihua Shao, Hongyi Cai, Wenxin Long, Weiyi Lang, Zhe Wang, Haoran Wu, Yan Wang, Yang Yang, Zhen Lei

Multimodal Large Language Models (MLLMs) have shown outstanding capabilities in many areas of multimodal reasoning.

Towards Multi-agent Reinforcement Learning based Traffic Signal Control through Spatio-temporal Hypergraphs

no code implementations17 Apr 2024 Kang Wang, Zhishu Shen, Zhen Lei, Tiehua Zhang

Traffic signal control systems (TSCSs) are integral to intelligent traffic management, fostering efficient vehicle flow.

Edge-computing Management +1

FusionMamba: Efficient Image Fusion with State Space Model

no code implementations11 Apr 2024 Siran Peng, Xiangyu Zhu, Haoyu Deng, Zhen Lei, Liang-Jian Deng

Image fusion aims to generate a high-resolution multi/hyper-spectral image by combining a high-resolution image with limited spectral information and a low-resolution image with abundant spectral data.

Solving Parametric PDEs with Radial Basis Functions and Deep Neural Networks

no code implementations10 Apr 2024 Guanhang Lei, Zhen Lei, Lei Shi, Chenyu Zeng

We propose the POD-DNN, a novel algorithm leveraging deep neural networks (DNNs) along with radial basis functions (RBFs) in the context of the proper orthogonal decomposition (POD) reduced basis method (RBM), aimed at approximating the parametric mapping of parametric partial differential equations on irregular domains.

Unified Physical-Digital Attack Detection Challenge

no code implementations9 Apr 2024 Haocheng Yuan, Ajian Liu, Junze Zheng, Jun Wan, Jiankang Deng, Sergio Escalera, Hugo Jair Escalante, Isabelle Guyon, Zhen Lei

Based on this dataset, we organized a Unified Physical-Digital Face Attack Detection Challenge to boost the research in Unified Attack Detections.

Face Anti-Spoofing Face Recognition

Generative Active Learning for Image Synthesis Personalization

1 code implementation22 Mar 2024 Xulu Zhang, WengYu Zhang, Xiao-Yong Wei, Jinlin Wu, Zhaoxiang Zhang, Zhen Lei, Qing Li

The primary challenge in conducting active learning on generative models lies in the open-ended nature of querying, which differs from the closed form of querying in discriminative models that typically target a single concept.

Active Learning Image Generation

CFPL-FAS: Class Free Prompt Learning for Generalizable Face Anti-spoofing

no code implementations21 Mar 2024 Ajian Liu, Shuai Xue, Jianwen Gan, Jun Wan, Yanyan Liang, Jiankang Deng, Sergio Escalera, Zhen Lei

Specifically, we propose a novel Class Free Prompt Learning (CFPL) paradigm for DG FAS, which utilizes two lightweight transformers, namely Content Q-Former (CQF) and Style Q-Former (SQF), to learn the different semantic prompts conditioned on content and style features by using a set of learnable query vectors, respectively.

Domain Generalization Face Anti-Spoofing

Factorized Learning Assisted with Large Language Model for Gloss-free Sign Language Translation

no code implementations19 Mar 2024 Zhigang Chen, Benjia Zhou, Jun Li, Jun Wan, Zhen Lei, Ning Jiang, Quan Lu, Guoqing Zhao

Although some approaches work towards gloss-free SLT through jointly training the visual encoder and translation network, these efforts still suffer from poor performance and inefficient use of the powerful Large Language Model (LLM).

Gloss-free Sign Language Translation Language Modelling +3

DiffSpeaker: Speech-Driven 3D Facial Animation with Diffusion Transformer

1 code implementation8 Feb 2024 Zhiyuan Ma, Xiangyu Zhu, GuoJun Qi, Chen Qian, Zhaoxiang Zhang, Zhen Lei

We suspect this is due to a shortage of paired audio-4D data, which is crucial for the Transformer to effectively perform as a denoiser within the Diffusion framework.

Segment Anything in 3D Gaussians

no code implementations31 Jan 2024 Xu Hu, Yuxi Wang, Lue Fan, Junsong Fan, Junran Peng, Zhen Lei, Qing Li, Zhaoxiang Zhang

In this paper, we propose a novel approach to achieve object segmentation in 3D Gaussian via an interactive procedure without any training process and learned parameters.

Segmentation Semantic Segmentation

PVLR: Prompt-driven Visual-Linguistic Representation Learning for Multi-Label Image Recognition

no code implementations31 Jan 2024 Hao Tan, Zichang Tan, Jun Li, Jun Wan, Zhen Lei

In contrast to the unidirectional fusion in previous works, we introduce a Dual-Modal Attention (DMA) that enables bidirectional interaction between textual and visual features, yielding context-aware label representations and semantic-related visual representations, which are subsequently used to calculate similarities and generate final predictions for all labels.

Representation Learning

Compositional Inversion for Stable Diffusion Models

1 code implementation13 Dec 2023 Xulu Zhang, Xiao-Yong Wei, Jinlin Wu, Tianyi Zhang, Zhaoxiang Zhang, Zhen Lei, Qing Li

It stems from the fact that during inversion, the irrelevant semantics in the user images are also encoded, forcing the inverted concepts to occupy locations far from the core distribution in the embedding space.

Compound Text-Guided Prompt Tuning via Image-Adaptive Cues

1 code implementation11 Dec 2023 Hao Tan, Jun Li, Yizhuang Zhou, Jun Wan, Zhen Lei, Xiangyu Zhang

We introduce text supervision to the optimization of prompts, which enables two benefits: 1) releasing the model reliance on the pre-defined category names during inference, thereby enabling more flexible prompt generation; 2) reducing the number of inputs to the text encoder, which decreases GPU memory consumption significantly.

Domain Generalization

GPT4SGG: Synthesizing Scene Graphs from Holistic and Region-specific Narratives

no code implementations7 Dec 2023 Zuyao Chen, Jinlin Wu, Zhen Lei, Zhaoxiang Zhang, Changwen Chen

Learning scene graphs from natural language descriptions has proven to be a cheap and promising scheme for Scene Graph Generation (SGG).

Graph Generation Scene Graph Generation +1

3D Face Reconstruction with the Geometric Guidance of Facial Part Segmentation

1 code implementation1 Dec 2023 Zidu Wang, Xiangyu Zhu, Tianshuo Zhang, Baiqin Wang, Zhen Lei

In this paper, we fully utilize the facial part segmentation geometry by introducing Part Re-projection Distance Loss (PRDL).

3D Face Reconstruction Segmentation

Expanding Scene Graph Boundaries: Fully Open-vocabulary Scene Graph Generation via Visual-Concept Alignment and Retention

no code implementations18 Nov 2023 Zuyao Chen, Jinlin Wu, Zhen Lei, Zhaoxiang Zhang, Changwen Chen

For the more challenging settings of relation-involved open vocabulary SGG, the proposed approach integrates relation-aware pre-training utilizing image-caption data and retains visual-concept alignment through knowledge distillation.

Concept Alignment Graph Generation +6

PWISeg: Point-based Weakly-supervised Instance Segmentation for Surgical Instruments

1 code implementation16 Nov 2023 Zhen Sun, Huan Xu, Jinlin Wu, Zhen Chen, Zhen Lei, Hongbin Liu

To address this issue, we propose a novel yet effective weakly-supervised surgical instrument instance segmentation approach, named Point-based Weakly-supervised Instance Segmentation (PWISeg).

Instance Segmentation Segmentation +4

SurgPLAN: Surgical Phase Localization Network for Phase Recognition

no code implementations16 Nov 2023 Xingjian Luo, You Pang, Zhen Chen, Jinlin Wu, Zongmin Zhang, Zhen Lei, Hongbin Liu

To address these two challenges, we propose a Surgical Phase LocAlization Network, named SurgPLAN, to facilitate a more accurate and stable surgical phase recognition with the principle of temporal detection.

Surgical phase recognition

Visual Commonsense based Heterogeneous Graph Contrastive Learning

no code implementations11 Nov 2023 Zongzhao Li, Xiangyu Zhu, Xi Zhang, Zhaoxiang Zhang, Zhen Lei

Specifically, our model contains two key components: the Commonsense-based Contrastive Learning and the Graph Relation Network.

Contrastive Learning Question Answering +4

Solving PDEs on Spheres with Physics-Informed Convolutional Neural Networks

no code implementations18 Aug 2023 Guanhang Lei, Zhen Lei, Lei Shi, Chenyu Zeng, Ding-Xuan Zhou

In this paper, we establish rigorous analysis of the physics-informed convolutional neural network (PICNN) for solving PDEs on the sphere.

DiffusePast: Diffusion-based Generative Replay for Class Incremental Semantic Segmentation

no code implementations2 Aug 2023 Jingfan Chen, Yuxi Wang, Pengfei Wang, Xiao Chen, Zhaoxiang Zhang, Zhen Lei, Qing Li

The Class Incremental Semantic Segmentation (CISS) extends the traditional segmentation task by incrementally learning newly added classes.

Class-Incremental Semantic Segmentation Segmentation

Gloss-free Sign Language Translation: Improving from Visual-Language Pretraining

1 code implementation ICCV 2023 Benjia Zhou, Zhigang Chen, Albert Clapés, Jun Wan, Yanyan Liang, Sergio Escalera, Zhen Lei, Du Zhang

Many previous methods employ an intermediate representation, i. e., gloss sequences, to facilitate SLT, thus transforming it into a two-stage task of sign language recognition (SLR) followed by sign language translation (SLT).

Gloss-free Sign Language Translation Self-Supervised Learning +3

General vs. Long-Tailed Age Estimation: An Approach to Kill Two Birds with One Stone

no code implementations19 Jul 2023 Zenghao Bao, Zichang Tan, Jun Li, Jun Wan, Xibo Ma, Zhen Lei

Driven by this, some works suggest that each class should be treated equally to improve performance in tail classes (with a minority of samples), which can be summarized as Long-tailed Age Estimation.

Age Estimation MORPH

NCL++: Nested Collaborative Learning for Long-Tailed Visual Recognition

no code implementations29 Jun 2023 Zichang Tan, Jun Li, Jinhao Du, Jun Wan, Zhen Lei, Guodong Guo

To achieve the collaborative learning in long-tailed learning, the balanced online distillation is proposed to force the consistent predictions among different experts and augmented copies, which reduces the learning uncertainties.

Cross Architecture Distillation for Face Recognition

no code implementations26 Jun 2023 Weisong Zhao, Xiangyu Zhu, Zhixiang He, Xiao-Yu Zhang, Zhen Lei

Transformers have emerged as the superior choice for face recognition tasks, but their insufficient platform acceleration hinders their application on mobile devices.

Face Recognition Knowledge Distillation

Surveillance Face Presentation Attack Detection Challenge

no code implementations15 Apr 2023 Hao Fang, Ajian Liu, Jun Wan, Sergio Escalera, Hugo Jair Escalante, Zhen Lei

Based on this dataset and protocol-$3$ for evaluating the robustness of the algorithm under quality changes, we organized a face presentation attack detection challenge in surveillance scenarios.

Face Anti-Spoofing Face Presentation Attack Detection +1

Wild Face Anti-Spoofing Challenge 2023: Benchmark and Results

1 code implementation12 Apr 2023 Dong Wang, Jia Guo, Qiqi Shao, Haochi He, Zhian Chen, Chuanbao Xiao, Ajian Liu, Sergio Escalera, Hugo Jair Escalante, Zhen Lei, Jun Wan, Jiankang Deng

Leveraging the WFAS dataset and Protocol 1 (Known-Type), we host the Wild Face Anti-Spoofing Challenge at the CVPR2023 workshop.

Face Anti-Spoofing Face Recognition

Grouped Knowledge Distillation for Deep Face Recognition

no code implementations10 Apr 2023 Weisong Zhao, Xiangyu Zhu, Kaiwen Guo, Xiao-Yu Zhang, Zhen Lei

Therefore, we seek to probe the target logits to extract the primary knowledge related to face identity, and discard the others, to make the distillation more achievable for the student network.

Face Recognition Knowledge Distillation

OTAvatar: One-shot Talking Face Avatar with Controllable Tri-plane Rendering

1 code implementation CVPR 2023 Zhiyuan Ma, Xiangyu Zhu, GuoJun Qi, Zhen Lei, Lei Zhang

In this paper, we propose One-shot Talking face Avatar (OTAvatar), which constructs face avatars by a generalized controllable tri-plane rendering solution so that each personalized avatar can be constructed from only one portrait as the reference.

Sharpness-Aware Gradient Matching for Domain Generalization

1 code implementation CVPR 2023 Pengfei Wang, Zhaoxiang Zhang, Zhen Lei, Lei Zhang

In this paper, we present two conditions to ensure that the model could converge to a flat minimum with a small loss, and present an algorithm, named Sharpness-Aware Gradient Matching (SAGM), to meet the two conditions for improving model generalization capability.

Domain Generalization

Intrinsic Physical Concepts Discovery with Object-Centric Predictive Models

no code implementations CVPR 2023 Qu Tang, Xiangyu Zhu, Zhen Lei, Zhaoxiang Zhang

The ability to discover abstract physical concepts and understand how they work in the world through observing lies at the core of human intelligence.

Self-similarity Driven Scale-invariant Learning for Weakly Supervised Person Search

no code implementations ICCV 2023 Benzhi Wang, Yang Yang, Jinlin Wu, Guo-Jun Qi, Zhen Lei

On the other hand, the similarity of cross-scale images is often smaller than that of images with the same scale for a person, which will increase the difficulty of matching.

Person Search

Deep Learning for Human Parsing: A Survey

no code implementations29 Jan 2023 Xiaomei Zhang, Xiangyu Zhu, Ming Tang, Zhen Lei

Human parsing is a key topic in image processing with many applications, such as surveillance analysis, human-robot interaction, person search, and clothing category classification, among many others.

Human Parsing Person Search

Surveillance Face Anti-spoofing

no code implementations3 Jan 2023 Hao Fang, Ajian Liu, Jun Wan, Sergio Escalera, Chenxu Zhao, Xu Zhang, Stan Z. Li, Zhen Lei

In order to promote relevant research and fill this gap in the community, we collect a large-scale Surveillance High-Fidelity Mask (SuHiFiMask) dataset captured under 40 surveillance scenes, which has 101 subjects from different age groups with 232 3D attacks (high-fidelity masks), 200 2D attacks (posters, portraits, and screens), and 2 adversarial attacks.

Contrastive Learning Face Anti-Spoofing +2

Face Presentation Attack Detection

no code implementations7 Dec 2022 Zitong Yu, Chenxu Zhao, Zhen Lei

Face recognition technology has been widely used in daily interactive applications such as checking-in and mobile payment due to its convenience and high accuracy.

Face Anti-Spoofing Face Presentation Attack Detection +1

Towards 3D Face Reconstruction in Perspective Projection: Estimating 6DoF Face Pose from Monocular Image

1 code implementation9 May 2022 Yueying Kao, Bowen Pan, Miao Xu, Jiangjing Lyu, Xiangyu Zhu, Yuanzhang Chang, Xiaobo Li, Zhen Lei

In 3D face reconstruction, orthogonal projection has been widely employed to substitute perspective projection to simplify the fitting process.

3D Face Reconstruction

MVP-Human Dataset for 3D Human Avatar Reconstruction from Unconstrained Frames

1 code implementation24 Apr 2022 Xiangyu Zhu, Tingting Liao, Jiangjing Lyu, Xiang Yan, Yunfeng Wang, Kan Guo, Qiong Cao, Stan Z. Li, Zhen Lei

In this paper, we consider a novel problem of reconstructing a 3D human avatar from multiple unconstrained frames, independent of assumptions on camera calibration, capture space, and constrained actions.

Camera Calibration

Weakly Aligned Feature Fusion for Multimodal Object Detection

no code implementations21 Apr 2022 Lu Zhang, Zhiyong Liu, Xiangyu Zhu, Zhan Song, Xu Yang, Zhen Lei, Hong Qiao

In this article, we propose a general multimodal detector named aligned region CNN (AR-CNN) to tackle the position shift problem.

Object object-detection +2

Beyond 3DMM: Learning to Capture High-fidelity 3D Face Shape

no code implementations9 Apr 2022 Xiangyu Zhu, Chang Yu, Di Huang, Zhen Lei, Hao Wang, Stan Z. Li

3D Morphable Model (3DMM) fitting has widely benefited face analysis due to its strong 3D priori.

Vocal Bursts Intensity Prediction

Nested Collaborative Learning for Long-Tailed Visual Recognition

1 code implementation CVPR 2022 Jun Li, Zichang Tan, Jun Wan, Zhen Lei, Guodong Guo

NCL consists of two core components, namely Nested Individual Learning (NIL) and Nested Balanced Online Distillation (NBOD), which focus on the individual supervised learning for each single expert and the knowledge transferring among multiple experts, respectively.

Image Classification Long-tail Learning

HP-Capsule: Unsupervised Face Part Discovery by Hierarchical Parsing Capsule Network

no code implementations CVPR 2022 Chang Yu, Xiangyu Zhu, Xiaomei Zhang, Zidu Wang, Zhaoxiang Zhang, Zhen Lei

Capsule networks are designed to present the objects by a set of parts and their relationships, which provide an insight into the procedure of visual perception.

Solving parametric partial differential equations with deep rectified quadratic unit neural networks

no code implementations14 Mar 2022 Zhen Lei, Lei Shi, Chenyu Zeng

In this study, we investigate the expressive power of deep rectified quadratic unit (ReQU) neural networks for approximating the solution maps of parametric PDEs.

VLAD-VSA: Cross-Domain Face Presentation Attack Detection with Vocabulary Separation and Adaptation

1 code implementation21 Feb 2022 Jiong Wang, Zhou Zhao, Weike Jin, Xinyu Duan, Zhen Lei, Baoxing Huai, Yiling Wu, Xiaofei He

In this paper, the VLAD aggregation method is adopted to quantize local features with visual vocabulary locally partitioning the feature space, and hence preserve the local discriminability.

Face Presentation Attack Detection

Multi-initialization Optimization Network for Accurate 3D Human Pose and Shape Estimation

no code implementations24 Dec 2021 Zhiwei Liu, Xiangyu Zhu, Lu Yang, Xiang Yan, Ming Tang, Zhen Lei, Guibo Zhu, Xuetao Feng, Yan Wang, Jinqiao Wang

In the second stage, we design a mesh refinement transformer (MRT) to respectively refine each coarse reconstruction result via a self-attention mechanism.

Ranked #65 on 3D Human Pose Estimation on 3DPW (MPJPE metric)

3D human pose and shape estimation 3D Reconstruction

Meta-Teacher For Face Anti-Spoofing

no code implementations12 Nov 2021 Yunxiao Qin, Zitong Yu, Longbin Yan, Zezheng Wang, Chenxu Zhao, Zhen Lei

The meta-teacher is trained in a bi-level optimization manner to learn the ability to supervise the PA detectors learning rich spoofing cues.

Face Anti-Spoofing Face Recognition

LAE : Long-tailed Age Estimation

no code implementations25 Oct 2021 Zenghao Bao, Zichang Tan, Yu Zhu, Jun Wan, Xibo Ma, Zhen Lei, Guodong Guo

To improve the performance of facial age estimation, we first formulate a simple standard baseline and build a much strong one by collecting the tricks in pre-training, data augmentation, model architecture, and so on.

Age Estimation Data Augmentation +1

OBJECT DYNAMICS DISTILLATION FOR SCENE DECOMPOSITION AND REPRESENTATION

no code implementations ICLR 2022 Qu Tang, Xiangyu Zhu, Zhen Lei, Zhaoxiang Zhang

In this paper, we work on object dynamics and propose Object Dynamics Distillation Network (ODDN), a framework that distillates explicit object dynamics (e. g., velocity) from sequential static representations.

Object Predict Future Video Frames +1

PoseFace: Pose-Invariant Features and Pose-Adaptive Loss for Face Recognition

no code implementations25 Jul 2021 Qiang Meng, Xiaqing Xu, Xiaobo Wang, Yang Qian, Yunxiao Qin, Zezheng Wang, Chenxu Zhao, Feng Zhou, Zhen Lei

Despite the great success achieved by deep learning methods in face recognition, severe performance drops are observed for large pose variations in unconstrained environments (e. g., in cases of surveillance and photo-tagging).

Face Recognition

Deep Learning for Face Anti-Spoofing: A Survey

2 code implementations28 Jun 2021 Zitong Yu, Yunxiao Qin, Xiaobai Li, Chenxu Zhao, Zhen Lei, Guoying Zhao

Face anti-spoofing (FAS) has lately attracted increasing attention due to its vital role in securing face recognition systems from presentation attacks (PAs).

Domain Generalization Face Anti-Spoofing +1

Represent Items by Items: An Enhanced Representation of the Target Item for Recommendation

no code implementations26 Apr 2021 Yinjiang Cai, Zeyu Cui, Shu Wu, Zhen Lei, Xibo Ma

Our proposed Co-occurrence based Enhanced Representation model (CER) learns the scoring function by a deep neural network with the attentive user representation and fusion of raw representation and enhanced representation of target item as input.

Collaborative Filtering Recommendation Systems

Searching for Alignment in Face Recognition

no code implementations10 Feb 2021 Xiaqing Xu, Qiang Meng, Yunxiao Qin, Jianzhu Guo, Chenxu Zhao, Feng Zhou, Zhen Lei

A standard pipeline of current face recognition frameworks consists of four individual steps: locating a face with a rough bounding box and several fiducial landmarks, aligning the face image using a pre-defined template, extracting representations and comparing.

Face Alignment Face Detection +2

Face Forgery Detection by 3D Decomposition

no code implementations CVPR 2021 Xiangyu Zhu, Hao Wang, Hongyan Fei, Zhen Lei, Stan Z. Li

Detecting digital face manipulation has attracted extensive attention due to fake media's potential harms to the public.

Towards Fast, Accurate and Stable 3D Dense Face Alignment

3 code implementations ECCV 2020 Jianzhu Guo, Xiangyu Zhu, Yang Yang, Fan Yang, Zhen Lei, Stan Z. Li

Firstly, on the basis of a lightweight backbone, we propose a meta-joint optimization strategy to dynamically regress a small set of 3DMM parameters, which greatly enhances speed and accuracy simultaneously.

 Ranked #1 on 3D Face Reconstruction on Florence (Mean NME metric)

3D Face Modelling 3D Face Reconstruction +2

SADet: Learning An Efficient and Accurate Pedestrian Detector

no code implementations26 Jul 2020 Chubin Zhuang, Zhen Lei, Stan Z. Li

Although the anchor-based detectors have taken a big step forward in pedestrian detection, the overall performance of algorithm still needs further improvement for practical applications, \emph{e. g.}, a good trade-off between the accuracy and efficiency.

Human Detection Pedestrian Detection +2

NPCFace: Negative-Positive Collaborative Training for Large-scale Face Recognition

no code implementations20 Jul 2020 Dan Zeng, Hailin Shi, Hang Du, Jun Wang, Zhen Lei, Tao Mei

However, the correlation between hard positive and hard negative is overlooked, and so is the relation between the margins in positive and negative logits.

Face Recognition

Semi-Siamese Training for Shallow Face Learning

3 code implementations ECCV 2020 Hang Du, Hailin Shi, Yuchi Liu, Jun Wang, Zhen Lei, Dan Zeng, Tao Mei

Extensive experiments on various benchmarks of face recognition show the proposed method significantly improves the training, not only in shallow face learning, but also for conventional deep face data.

Face Recognition

Multi-Modal Face Anti-Spoofing Based on Central Difference Networks

1 code implementation17 Apr 2020 Zitong Yu, Yunxiao Qin, Xiaobai Li, Zezheng Wang, Chenxu Zhao, Zhen Lei, Guoying Zhao

Face anti-spoofing (FAS) plays a vital role in securing face recognition systems from presentation attacks.

Face Anti-Spoofing Face Recognition

Domain Balancing: Face Recognition on Long-Tailed Domains

no code implementations CVPR 2020 Dong Cao, Xiangyu Zhu, Xingyu Huang, Jianzhu Guo, Zhen Lei

Finally, we propose a Domain Balancing Margin (DBM) in the loss function to further optimize the feature space of the tail domains to improve generalization.

Face Recognition

Learning Meta Face Recognition in Unseen Domains

5 code implementations CVPR 2020 Jianzhu Guo, Xiangyu Zhu, Chenxu Zhao, Dong Cao, Zhen Lei, Stan Z. Li

Face recognition systems are usually faced with unseen domains in real-world applications and show unsatisfactory performance due to their poor generalization.

Face Recognition Meta-Learning

LAMP-HQ: A Large-Scale Multi-Pose High-Quality Database and Benchmark for NIR-VIS Face Recognition

no code implementations17 Dec 2019 Aijing Yu, Haoxue Wu, Huaibo Huang, Zhen Lei, Ran He

A spectral conditional attention module is introduced to reduce the domain gap between NIR and VIS data and then improve the performance of NIR-VIS heterogeneous face recognition on various databases including the LAMP-HQ.

Attribute Face Recognition +1

Bridging the Gap Between Anchor-based and Anchor-free Detection via Adaptive Training Sample Selection

11 code implementations CVPR 2020 Shifeng Zhang, Cheng Chi, Yongqiang Yao, Zhen Lei, Stan Z. Li

In this paper, we first point out that the essential difference between anchor-based and anchor-free detection is actually how to define positive and negative training samples, which leads to the performance gap between them.

Object object-detection +1

Relational Learning for Joint Head and Human Detection

1 code implementation24 Sep 2019 Cheng Chi, Shifeng Zhang, Junliang Xing, Zhen Lei, Stan Z. Li, Xudong Zou

Head and human detection have been rapidly improved with the development of deep convolutional neural networks.

Head Detection Human Detection +1

PedHunter: Occlusion Robust Pedestrian Detector in Crowded Scenes

no code implementations15 Sep 2019 Cheng Chi, Shifeng Zhang, Junliang Xing, Zhen Lei, Stan Z. Li, Xudong Zou

Pedestrian detection in crowded scenes is a challenging problem, because occlusion happens frequently among different pedestrians.

Data Augmentation Occlusion Handling +2

RefineFace: Refinement Neural Network for High Performance Face Detection

no code implementations10 Sep 2019 Shifeng Zhang, Cheng Chi, Zhen Lei, Stan Z. Li

To improve the classification ability for high recall efficiency, STC first filters out most simple negatives from low level detection layers to reduce search space for subsequent classifier, then SML is applied to better distinguish faces from background at various scales and FSM is introduced to let the backbone learn more discriminative features for classification.

Classification Face Detection +3

Domain Adaptive Person Re-Identification via Camera Style Generation and Label Propagation

no code implementations14 May 2019 Chuan-Xian Ren, Bo-Hua Liang, Zhen Lei

We derive a camera style adaptation framework to learn the style-based mappings between different camera views, from the target domain to the source domain, and then we can transfer the identity-based distribution from the source domain to the target domain on the camera level.

Domain Adaptive Person Re-Identification Person Re-Identification +1

Weakly Aligned Cross-Modal Learning for Multispectral Pedestrian Detection

no code implementations ICCV 2019 Lu Zhang, Xiangyu Zhu, Xiangyu Chen, Xu Yang, Zhen Lei, Zhi-Yong Liu

In this paper, we propose a novel Aligned Region CNN (AR-CNN) to handle the weakly aligned multispectral data in an end-to-end way.

Position

Improving Face Anti-Spoofing by 3D Virtual Synthesis

2 code implementations2 Jan 2019 Jianzhu Guo, Xiangyu Zhu, Jinchuan Xiao, Zhen Lei, Genxun Wan, Stan Z. Li

Specifically, we consider a printed photo as a flat surface and mesh it into a 3D object, which is then randomly bent and rotated in 3D space.

Face Anti-Spoofing Face Recognition

Prior-Knowledge and Attention-based Meta-Learning for Few-Shot Learning

no code implementations11 Dec 2018 Yunxiao Qin, WeiGuo Zhang, Chenxu Zhao, Zezheng Wang, Xiangyu Zhu, Guo-Jun Qi, Jingping Shi, Zhen Lei

In this paper, inspired by the human cognition process which utilizes both prior-knowledge and vision attention in learning new knowledge, we present a novel paradigm of meta-learning approach with three developments to introduce attention mechanism and prior-knowledge for meta-learning.

Few-Shot Learning

Representation based and Attention augmented Meta learning

no code implementations19 Nov 2018 Yunxiao Qin, Chenxu Zhao, Zezheng Wang, Junliang Xing, Jun Wan, Zhen Lei

The method RAML aims to give the Meta learner the ability of leveraging the past learned knowledge to reduce the dimension of the original input data by expressing it into high representations, and help the Meta learner to perform well.

Few-Shot Learning

Vehicle Re-identification Using Quadruple Directional Deep Learning Features

no code implementations13 Nov 2018 Jianqing Zhu, Huanqiang Zeng, Jingchang Huang, Shengcai Liao, Zhen Lei, Canhui Cai, Lixin Zheng

Specifically, the same basic deep learning architecture is a shortly and densely connected convolutional neural network to extract basic feature maps of an input square vehicle image in the first stage.

Vehicle Re-Identification

Selective Refinement Network for High Performance Face Detection

3 code implementations7 Sep 2018 Cheng Chi, Shifeng Zhang, Junliang Xing, Zhen Lei, Stan Z. Li, Xudong Zou

In particular, the SRN consists of two modules: the Selective Two-step Classification (STC) module and the Selective Two-step Regression (STR) module.

Face Detection General Classification +2

Occlusion-aware R-CNN: Detecting Pedestrians in a Crowd

no code implementations ECCV 2018 Shifeng Zhang, Longyin Wen, Xiao Bian, Zhen Lei, Stan Z. Li

Pedestrian detection in crowded scenes is a challenging problem since the pedestrians often gather together and occlude each other.

Ranked #10 on Pedestrian Detection on Caltech (using extra training data)

Pedestrian Detection

Large-scale Bisample Learning on ID Versus Spot Face Recognition

no code implementations8 Jun 2018 Xiangyu Zhu, Hao liu, Zhen Lei, Hailin Shi, Fan Yang, Dong Yi, Guo-Jun Qi, Stan Z. Li

In this paper, we propose a deep learning based large-scale bisample learning (LBL) method for IvS face recognition.

Face Recognition General Classification

Face Synthesis for Eyeglass-Robust Face Recognition

1 code implementation4 Jun 2018 Jianzhu Guo, Xiangyu Zhu, Zhen Lei, Stan Z. Li

A feasible method is to collect large-scale face images with eyeglasses for training deep learning methods.

Face Generation Face Model +2

Face Alignment in Full Pose Range: A 3D Total Solution

2 code implementations2 Apr 2018 Xiangyu Zhu, Xiaoming Liu, Zhen Lei, Stan Z. Li

In this paper, we propose to tackle these three challenges in an new alignment framework termed 3D Dense Face Alignment (3DDFA), in which a dense 3D Morphable Model (3DMM) is fitted to the image via Cascaded Convolutional Neural Networks.

3D Pose Estimation Depth Image Estimation +3

Single-Shot Refinement Neural Network for Object Detection

12 code implementations CVPR 2018 Shifeng Zhang, Longyin Wen, Xiao Bian, Zhen Lei, Stan Z. Li

For object detection, the two-stage approach (e. g., Faster R-CNN) has been achieving the highest accuracy, whereas the one-stage approach (e. g., SSD) has the advantage of high efficiency.

Object object-detection +1

S3FD: Single Shot Scale-Invariant Face Detector

no code implementations ICCV 2017 Shifeng Zhang, Xiangyu Zhu, Zhen Lei, Hailin Shi, Xiaobo Wang, Stan Z. Li

This paper presents a real-time face detector, named Single Shot Scale-invariant Face Detector (S3FD), which performs superiorly on various scales of faces with a single deep neural network, especially for small faces.

Face Detection

FaceBoxes: A CPU Real-time Face Detector with High Accuracy

10 code implementations17 Aug 2017 Shifeng Zhang, Xiangyu Zhu, Zhen Lei, Hailin Shi, Xiaobo Wang, Stan Z. Li

The MSCL aims at enriching the receptive fields and discretizing anchors over different layers to handle faces of various scales.

Face Detection Vocal Bursts Intensity Prediction

S$^3$FD: Single Shot Scale-invariant Face Detector

3 code implementations17 Aug 2017 Shifeng Zhang, Xiangyu Zhu, Zhen Lei, Hailin Shi, Xiaobo Wang, Stan Z. Li

This paper presents a real-time face detector, named Single Shot Scale-invariant Face Detector (S$^3$FD), which performs superiorly on various scales of faces with a single deep neural network, especially for small faces.

Face Detection

Learning Efficient Image Representation for Person Re-Identification

no code implementations7 Jul 2017 Yang Yang, Shengcai Liao, Zhen Lei, Stan Z. Li

Then, a robust image representation based on color names is obtained by concatenating the statistical descriptors in each stripe.

Person Re-Identification

Deep Hybrid Similarity Learning for Person Re-identification

no code implementations16 Feb 2017 Jianqing Zhu, Huanqiang Zeng, Shengcai Liao, Zhen Lei, Canhui Cai, Lixin Zheng

In this paper, a deep hybrid similarity learning (DHSL) method for person Re-ID based on a convolution neural network (CNN) is proposed.

Metric Learning Person Re-Identification

Embedding Deep Metric for Person Re-identication A Study Against Large Variations

no code implementations1 Nov 2016 Hailin Shi, Yang Yang, Xiangyu Zhu, Shengcai Liao, Zhen Lei, Wei-Shi Zheng, Stan Z. Li

From this point of view, selecting suitable positive i. e. intra-class) training samples within a local range is critical for training the CNN embedding, especially when the data has large intra-class variations.

Person Re-Identification

CRAFT Objects from Images

1 code implementation CVPR 2016 Bin Yang, Junjie Yan, Zhen Lei, Stan Z. Li

They decompose the object detection problem into two cascaded easier tasks: 1) generating object proposals from images, 2) classifying proposals into various object categories.

Object object-detection +2

Constrained Deep Metric Learning for Person Re-identification

no code implementations24 Nov 2015 Hailin Shi, Xiangyu Zhu, Shengcai Liao, Zhen Lei, Yang Yang, Stan Z. Li

In this paper, we propose a novel CNN-based method to learn a discriminative metric with good robustness to the over-fitting problem in person re-identification.

Metric Learning Person Re-Identification

Face Alignment Across Large Poses: A 3D Solution

no code implementations CVPR 2016 Xiangyu Zhu, Zhen Lei, Xiaoming Liu, Hailin Shi, Stan Z. Li

Face alignment, which fits a face model to an image and extracts the semantic meanings of facial pixels, has been an important topic in CV community.

3D Face Reconstruction Face Alignment +2

UA-DETRAC: A New Benchmark and Protocol for Multi-Object Detection and Tracking

no code implementations13 Nov 2015 Longyin Wen, Dawei Du, Zhaowei Cai, Zhen Lei, Ming-Ching Chang, Honggang Qi, Jongwoo Lim, Ming-Hsuan Yang, Siwei Lyu

In this work, we perform a comprehensive quantitative study on the effects of object detection accuracy to the overall MOT performance, using the new large-scale University at Albany DETection and tRACking (UA-DETRAC) benchmark dataset.

Multi-Object Tracking Object +2

JOTS: Joint Online Tracking and Segmentation

no code implementations CVPR 2015 Longyin Wen, Dawei Du, Zhen Lei, Stan Z. Li, Ming-Hsuan Yang

We present a novel Joint Online Tracking and Segmentation (JOTS) algorithm which integrates the multi-part tracking and segmentation into a unified energy optimization framework to handle the video segmentation task.

Segmentation Video Segmentation +1

High-Fidelity Pose and Expression Normalization for Face Recognition in the Wild

no code implementations CVPR 2015 Xiangyu Zhu, Zhen Lei, Junjie Yan, Dong Yi, Stan Z. Li

Pose and expression normalization is a crucial step to recover the canonical view of faces under arbitrary conditions, so as to improve the face recognition performance.

Face Recognition Vocal Bursts Intensity Prediction

Convolutional Channel Features

1 code implementation ICCV 2015 Bin Yang, Junjie Yan, Zhen Lei, Stan Z. Li

With the combination of CNN features and boosting forest, CCF benefits from the richer capacity in feature representation compared with channel features, as well as lower cost in computation and storage compared with end-to-end CNN methods.

Edge Detection Face Detection +2

Learning Face Representation from Scratch

14 code implementations28 Nov 2014 Dong Yi, Zhen Lei, Shengcai Liao, Stan Z. Li

The current situation in the field of face recognition is that data is more important than algorithm.

Face Recognition

Learn Convolutional Neural Network for Face Anti-Spoofing

3 code implementations24 Aug 2014 Jianwei Yang, Zhen Lei, Stan Z. Li

Moreover, the nets trained using combined data from two datasets have less biases between two datasets.

Face Anti-Spoofing

Deep Metric Learning for Practical Person Re-Identification

no code implementations18 Jul 2014 Dong Yi, Zhen Lei, Stan Z. Li

Compared to existing researches, a more practical setting is studied in the experiments that is training and test on different datasets (cross dataset person re-identification).

Metric Learning Person Re-Identification

Aggregate channel features for multi-view face detection

no code implementations15 Jul 2014 Bin Yang, Junjie Yan, Zhen Lei, Stan Z. Li

Face detection has drawn much attention in recent decades since the seminal work by Viola and Jones.

Face Detection Re-Ranking

Shared Representation Learning for Heterogeneous Face Recognition

no code implementations5 Jun 2014 Dong Yi, Zhen Lei, Shengcai Liao, Stan Z. Li

For NIR-VIS problem, we produce new state-of-the-art performance on the CASIA HFB and NIR-VIS 2. 0 databases.

Face Recognition Heterogeneous Face Recognition +1

The Fastest Deformable Part Model for Object Detection

no code implementations CVPR 2014 Junjie Yan, Zhen Lei, Longyin Wen, Stan Z. Li

Three prohibitive steps in cascade version of DPM are accelerated, including 2D correlation between root filter and feature map, cascade part pruning and HOG feature extraction.

Face Detection Object +2

Robust Multi-resolution Pedestrian Detection in Traffic Scenes

no code implementations CVPR 2013 Junjie Yan, Xucong Zhang, Zhen Lei, Shengcai Liao, Stan Z. Li

The model contains resolution aware transformations to map pedestrians in different resolutions to a common space, where a shared detector is constructed to distinguish pedestrians from background.

Pedestrian Detection

Towards Pose Robust Face Recognition

no code implementations CVPR 2013 Dong Yi, Zhen Lei, Stan Z. Li

In this paper, we propose a novel method for pose robust face recognition towards practical applications, which is fast, pose robust and can work well under unconstrained environments.

Face Recognition Robust Face Recognition

Fast Matching by 2 Lines of Code for Large Scale Face Recognition Systems

no code implementations28 Feb 2013 Dong Yi, Zhen Lei, Yang Hu, Stan Z. Li

However, the use of this method is very generic and not limited in face recognition, which can be easily generalized to other biometrics as a post-processing module.

Computational Efficiency Face Recognition

Cannot find the paper you are looking for? You can Submit a new open access paper.