Search Results for author: Zhen Lei

Found 125 papers, 42 papers with code

Beyond 3DMM Space: Towards Fine-grained 3D Face Reconstruction

1 code implementation • ECCV 2020 • Xiangyu Zhu, Fan Yang, Di Huang, Chang Yu, Hao Wang, Jianzhu Guo, Zhen Lei, Stan Z. Li

However, most of their training data is constructed by 3D Morphable Model, whose space spanned is only a small part of the shape space.

3D Face Reconstruction

104

Paper
Code

Exclusivity-Consistency Regularized Knowledge Distillation for Face Recognition

no code implementations • ECCV 2020 • Xiaobo Wang, Tianyu Fu, Shengcai Liao, Shuo Wang, Zhen Lei, Tao Mei

Knowledge distillation is an effective tool to compress large pre-trained Convolutional Neural Networks (CNNs) or their ensembles into models applicable to mobile and embedded devices.

Face Recognition Knowledge Distillation +1

Paper
Add Code

Enhancing Surgical Robots with Embodied Intelligence for Autonomous Ultrasound Scanning

no code implementations • 1 May 2024 • Huan Xu, Jinlin Wu, Guanglin Cao, Zhen Lei, Zhen Chen, Hongbin Liu

Ultrasound robots are increasingly used in medical diagnostics and early disease screening.

Paper
Add Code

AccidentBlip2: Accident Detection With Multi-View MotionBlip2

1 code implementation • 18 Apr 2024 • Yihua Shao, Hongyi Cai, Xinwei Long, Weiyi Lang, Zhe Wang, Haoran Wu, Yan Wang, Jiayi Yin, Yang Yang, Zhen Lei

We also extend our approach to a multi-vehicle cooperative system by deploying Motion Qformer on each vehicle and simultaneously inputting the inference-generated query into the MLP for autoregressive inference.

Language Modelling Large Language Model +2

Paper
Code

Towards Multi-agent Reinforcement Learning based Traffic Signal Control through Spatio-temporal Hypergraphs

no code implementations • 17 Apr 2024 • Kang Wang, Zhishu Shen, Zhen Lei, Tiehua Zhang

Traffic signal control systems (TSCSs) are integral to intelligent traffic management, fostering efficient vehicle flow.

Edge-computing Management +1

Paper
Add Code

Second Edition FRCSyn Challenge at CVPR 2024: Face Recognition Challenge in the Era of Synthetic Data

2 code implementations • 16 Apr 2024 • Ivan DeAndres-Tame, Ruben Tolosana, Pietro Melzi, Ruben Vera-Rodriguez, Minchul Kim, Christian Rathgeb, Xiaoming Liu, Aythami Morales, Julian Fierrez, Javier Ortega-Garcia, Zhizhou Zhong, Yuge Huang, Yuxi Mi, Shouhong Ding, Shuigeng Zhou, Shuai He, Lingzhi Fu, Heng Cong, Rongyu Zhang, Zhihong Xiao, Evgeny Smirnov, Anton Pimenov, Aleksei Grigorev, Denis Timoshenko, Kaleb Mesfin Asfaw, Cheng Yaw Low, Hao liu, Chuyi Wang, Qing Zuo, Zhixiang He, Hatef Otroshi Shahreza, Anjith George, Alexander Unnervik, Parsa Rahimi, Sébastien Marcel, Pedro C. Neto, Marco Huber, Jan Niklas Kolf, Naser Damer, Fadi Boutros, Jaime S. Cardoso, Ana F. Sequeira, Andrea Atzori, Gianni Fenu, Mirko Marras, Vitomir Štruc, Jiang Yu, Zhangjie Li, Jichun Li, Weisong Zhao, Zhen Lei, Xiangyu Zhu, Xiao-Yu Zhang, Bernardo Biesseck, Pedro Vidal, Luiz Coelho, Roger Granada, David Menotti

Synthetic data is gaining increasing relevance for training machine learning models.

Benchmarking Face Recognition

218

Paper
Code

FusionMamba: Efficient Image Fusion with State Space Model

no code implementations • 11 Apr 2024 • Siran Peng, Xiangyu Zhu, Haoyu Deng, Zhen Lei, Liang-Jian Deng

Image fusion aims to generate a high-resolution multi/hyper-spectral image by combining a high-resolution image with limited spectral information and a low-resolution image with abundant spectral data.

Paper
Add Code

Solving Parametric PDEs with Radial Basis Functions and Deep Neural Networks

no code implementations • 10 Apr 2024 • Guanhang Lei, Zhen Lei, Lei Shi, Chenyu Zeng

We propose the POD-DNN, a novel algorithm leveraging deep neural networks (DNNs) along with radial basis functions (RBFs) in the context of the proper orthogonal decomposition (POD) reduced basis method (RBM), aimed at approximating the parametric mapping of parametric partial differential equations on irregular domains.

Paper
Add Code

Unified Physical-Digital Attack Detection Challenge

no code implementations • 9 Apr 2024 • Haocheng Yuan, Ajian Liu, Junze Zheng, Jun Wan, Jiankang Deng, Sergio Escalera, Hugo Jair Escalante, Isabelle Guyon, Zhen Lei

Based on this dataset, we organized a Unified Physical-Digital Face Attack Detection Challenge to boost the research in Unified Attack Detections.

Face Anti-Spoofing Face Recognition

Paper
Add Code

Generative Active Learning for Image Synthesis Personalization

1 code implementation • 22 Mar 2024 • Xulu Zhang, WengYu Zhang, Xiao-Yong Wei, Jinlin Wu, Zhaoxiang Zhang, Zhen Lei, Qing Li

The primary challenge in conducting active learning on generative models lies in the open-ended nature of querying, which differs from the closed form of querying in discriminative models that typically target a single concept.

Active Learning Image Generation

Paper
Code

CFPL-FAS: Class Free Prompt Learning for Generalizable Face Anti-spoofing

no code implementations • 21 Mar 2024 • Ajian Liu, Shuai Xue, Jianwen Gan, Jun Wan, Yanyan Liang, Jiankang Deng, Sergio Escalera, Zhen Lei

Specifically, we propose a novel Class Free Prompt Learning (CFPL) paradigm for DG FAS, which utilizes two lightweight transformers, namely Content Q-Former (CQF) and Style Q-Former (SQF), to learn the different semantic prompts conditioned on content and style features by using a set of learnable query vectors, respectively.

Domain Generalization Face Anti-Spoofing

Paper
Add Code

Factorized Learning Assisted with Large Language Model for Gloss-free Sign Language Translation

no code implementations • 19 Mar 2024 • Zhigang Chen, Benjia Zhou, Jun Li, Jun Wan, Zhen Lei, Ning Jiang, Quan Lu, Guoqing Zhao

Although some approaches work towards gloss-free SLT through jointly training the visual encoder and translation network, these efforts still suffer from poor performance and inefficient use of the powerful Large Language Model (LLM).

Ranked #1 on Gloss-free Sign Language Translation on CSL-Daily

Gloss-free Sign Language Translation Language Modelling +3

Paper
Add Code

DiffSpeaker: Speech-Driven 3D Facial Animation with Diffusion Transformer

1 code implementation • 8 Feb 2024 • Zhiyuan Ma, Xiangyu Zhu, GuoJun Qi, Chen Qian, Zhaoxiang Zhang, Zhen Lei

We suspect this is due to a shortage of paired audio-4D data, which is crucial for the Transformer to effectively perform as a denoiser within the Diffusion framework.

Paper
Code

Unified Physical-Digital Face Attack Detection

no code implementations • 31 Jan 2024 • Hao Fang, Ajian Liu, Haocheng Yuan, Junze Zheng, Dingheng Zeng, Yanhong Liu, Jiankang Deng, Sergio Escalera, Xiaoming Liu, Jun Wan, Zhen Lei

These three modules seamlessly form a robust unified attack detection framework.

Face Recognition Face Swapping

Paper
Add Code

Segment Anything in 3D Gaussians

no code implementations • 31 Jan 2024 • Xu Hu, Yuxi Wang, Lue Fan, Junsong Fan, Junran Peng, Zhen Lei, Qing Li, Zhaoxiang Zhang

In this paper, we propose a novel approach to achieve object segmentation in 3D Gaussian via an interactive procedure without any training process and learned parameters.

Segmentation Semantic Segmentation

Paper
Add Code

PVLR: Prompt-driven Visual-Linguistic Representation Learning for Multi-Label Image Recognition

no code implementations • 31 Jan 2024 • Hao Tan, Zichang Tan, Jun Li, Jun Wan, Zhen Lei

In contrast to the unidirectional fusion in previous works, we introduce a Dual-Modal Attention (DMA) that enables bidirectional interaction between textual and visual features, yielding context-aware label representations and semantic-related visual representations, which are subsequently used to calculate similarities and generate final predictions for all labels.

Representation Learning

Paper
Add Code

Modeling Spoof Noise by De-spoofing Diffusion and its Application in Face Anti-spoofing

no code implementations • 16 Jan 2024 • Bin Zhang, Xiangyu Zhu, XiaoYu Zhang, Zhen Lei

Face anti-spoofing is crucial for ensuring the security and reliability of face recognition systems.

Face Anti-Spoofing Face Recognition

Paper
Add Code

Seek for Incantations: Towards Accurate Text-to-Image Diffusion Synthesis through Prompt Engineering

no code implementations • 12 Jan 2024 • Chang Yu, Junran Peng, Xiangyu Zhu, Zhaoxiang Zhang, Qi Tian, Zhen Lei

The text-to-image synthesis by diffusion models has recently shown remarkable performance in generating high-quality images.

Image Generation Prompt Engineering

Paper
Add Code

Compositional Inversion for Stable Diffusion Models

1 code implementation • 13 Dec 2023 • Xulu Zhang, Xiao-Yong Wei, Jinlin Wu, Tianyi Zhang, Zhaoxiang Zhang, Zhen Lei, Qing Li

It stems from the fact that during inversion, the irrelevant semantics in the user images are also encoded, forcing the inverted concepts to occupy locations far from the core distribution in the embedding space.

Paper
Code

Compound Text-Guided Prompt Tuning via Image-Adaptive Cues

1 code implementation • 11 Dec 2023 • Hao Tan, Jun Li, Yizhuang Zhou, Jun Wan, Zhen Lei, Xiangyu Zhang

We introduce text supervision to the optimization of prompts, which enables two benefits: 1) releasing the model reliance on the pre-defined category names during inference, thereby enabling more flexible prompt generation; 2) reducing the number of inputs to the text encoder, which decreases GPU memory consumption significantly.

Domain Generalization

Paper
Code

GPT4SGG: Synthesizing Scene Graphs from Holistic and Region-specific Narratives

no code implementations • 7 Dec 2023 • Zuyao Chen, Jinlin Wu, Zhen Lei, Zhaoxiang Zhang, Changwen Chen

Learning scene graphs from natural language descriptions has proven to be a cheap and promising scheme for Scene Graph Generation (SGG).

Graph Generation Scene Graph Generation +1

Paper
Add Code

3D Face Reconstruction with the Geometric Guidance of Facial Part Segmentation

1 code implementation • 1 Dec 2023 • Zidu Wang, Xiangyu Zhu, Tianshuo Zhang, Baiqin Wang, Zhen Lei

In this paper, we fully utilize the facial part segmentation geometry by introducing Part Re-projection Distance Loss (PRDL).

3D Face Reconstruction Segmentation

Paper
Code

Expanding Scene Graph Boundaries: Fully Open-vocabulary Scene Graph Generation via Visual-Concept Alignment and Retention

no code implementations • 18 Nov 2023 • Zuyao Chen, Jinlin Wu, Zhen Lei, Zhaoxiang Zhang, Changwen Chen

For the more challenging settings of relation-involved open vocabulary SGG, the proposed approach integrates relation-aware pre-training utilizing image-caption data and retains visual-concept alignment through knowledge distillation.

Concept Alignment Graph Generation +6

Paper
Add Code

FRCSyn Challenge at WACV 2024:Face Recognition Challenge in the Era of Synthetic Data

1 code implementation • 17 Nov 2023 • Pietro Melzi, Ruben Tolosana, Ruben Vera-Rodriguez, Minchul Kim, Christian Rathgeb, Xiaoming Liu, Ivan DeAndres-Tame, Aythami Morales, Julian Fierrez, Javier Ortega-Garcia, Weisong Zhao, Xiangyu Zhu, Zheyu Yan, Xiao-Yu Zhang, Jinlin Wu, Zhen Lei, Suvidha Tripathi, Mahak Kothari, Md Haider Zama, Debayan Deb, Bernardo Biesseck, Pedro Vidal, Roger Granada, Guilherme Fickel, Gustavo Führ, David Menotti, Alexander Unnervik, Anjith George, Christophe Ecabert, Hatef Otroshi Shahreza, Parsa Rahimi, Sébastien Marcel, Ioannis Sarridis, Christos Koutlis, Georgia Baltsou, Symeon Papadopoulos, Christos Diou, Nicolò Di Domenico, Guido Borghi, Lorenzo Pellegrini, Enrique Mas-Candela, Ángela Sánchez-Pérez, Andrea Atzori, Fadi Boutros, Naser Damer, Gianni Fenu, Mirko Marras

Despite the widespread adoption of face recognition technology around the world, and its remarkable performance on current benchmarks, there are still several challenges that must be covered in more detail.

Face Recognition

Paper
Code

PWISeg: Point-based Weakly-supervised Instance Segmentation for Surgical Instruments

1 code implementation • 16 Nov 2023 • Zhen Sun, Huan Xu, Jinlin Wu, Zhen Chen, Zhen Lei, Hongbin Liu

To address this issue, we propose a novel yet effective weakly-supervised surgical instrument instance segmentation approach, named Point-based Weakly-supervised Instance Segmentation (PWISeg).

Instance Segmentation Segmentation +4

Paper
Code

SurgPLAN: Surgical Phase Localization Network for Phase Recognition

no code implementations • 16 Nov 2023 • Xingjian Luo, You Pang, Zhen Chen, Jinlin Wu, Zongmin Zhang, Zhen Lei, Hongbin Liu

To address these two challenges, we propose a Surgical Phase LocAlization Network, named SurgPLAN, to facilitate a more accurate and stable surgical phase recognition with the principle of temporal detection.

Surgical phase recognition

Paper
Add Code

Visual Commonsense based Heterogeneous Graph Contrastive Learning

no code implementations • 11 Nov 2023 • Zongzhao Li, Xiangyu Zhu, Xi Zhang, Zhaoxiang Zhang, Zhen Lei

Specifically, our model contains two key components: the Commonsense-based Contrastive Learning and the Graph Relation Network.

Contrastive Learning Question Answering +4

Paper
Add Code

Solving PDEs on Spheres with Physics-Informed Convolutional Neural Networks

no code implementations • 18 Aug 2023 • Guanhang Lei, Zhen Lei, Lei Shi, Chenyu Zeng, Ding-Xuan Zhou

In this paper, we establish rigorous analysis of the physics-informed convolutional neural network (PICNN) for solving PDEs on the sphere.

Paper
Add Code

DiffusePast: Diffusion-based Generative Replay for Class Incremental Semantic Segmentation

no code implementations • 2 Aug 2023 • Jingfan Chen, Yuxi Wang, Pengfei Wang, Xiao Chen, Zhaoxiang Zhang, Zhen Lei, Qing Li

The Class Incremental Semantic Segmentation (CISS) extends the traditional segmentation task by incrementally learning newly added classes.

Class-Incremental Semantic Segmentation Segmentation

Paper
Add Code

Gloss-free Sign Language Translation: Improving from Visual-Language Pretraining

1 code implementation • ICCV 2023 • Benjia Zhou, Zhigang Chen, Albert Clapés, Jun Wan, Yanyan Liang, Sergio Escalera, Zhen Lei, Du Zhang

Many previous methods employ an intermediate representation, i. e., gloss sequences, to facilitate SLT, thus transforming it into a two-stage task of sign language recognition (SLR) followed by sign language translation (SLT).

Ranked #2 on Gloss-free Sign Language Translation on PHOENIX14T

Gloss-free Sign Language Translation Self-Supervised Learning +3

Paper
Code

General vs. Long-Tailed Age Estimation: An Approach to Kill Two Birds with One Stone

no code implementations • 19 Jul 2023 • Zenghao Bao, Zichang Tan, Jun Li, Jun Wan, Xibo Ma, Zhen Lei

Driven by this, some works suggest that each class should be treated equally to improve performance in tail classes (with a minority of samples), which can be summarized as Long-tailed Age Estimation.

Age Estimation MORPH

Paper
Add Code

NCL++: Nested Collaborative Learning for Long-Tailed Visual Recognition

no code implementations • 29 Jun 2023 • Zichang Tan, Jun Li, Jinhao Du, Jun Wan, Zhen Lei, Guodong Guo

To achieve the collaborative learning in long-tailed learning, the balanced online distillation is proposed to force the consistent predictions among different experts and augmented copies, which reduces the learning uncertainties.

Paper
Add Code

Cross Architecture Distillation for Face Recognition

no code implementations • 26 Jun 2023 • Weisong Zhao, Xiangyu Zhu, Zhixiang He, Xiao-Yu Zhang, Zhen Lei

Transformers have emerged as the superior choice for face recognition tasks, but their insufficient platform acceleration hinders their application on mobile devices.

Face Recognition Knowledge Distillation

Paper
Add Code

FM-ViT: Flexible Modal Vision Transformers for Face Anti-Spoofing

no code implementations • 5 May 2023 • Ajian Liu, Zichang Tan, Zitong Yu, Chenxu Zhao, Jun Wan, Yanyan Liang, Zhen Lei, Du Zhang, Stan Z. Li, Guodong Guo

The availability of handy multi-modal (i. e., RGB-D) sensors has brought about a surge of face anti-spoofing research.

Face Anti-Spoofing Face Presentation Attack Detection

Paper
Add Code

Surveillance Face Presentation Attack Detection Challenge

no code implementations • 15 Apr 2023 • Hao Fang, Ajian Liu, Jun Wan, Sergio Escalera, Hugo Jair Escalante, Zhen Lei

Based on this dataset and protocol-$3$ for evaluating the robustness of the algorithm under quality changes, we organized a face presentation attack detection challenge in surveillance scenarios.

Face Anti-Spoofing Face Presentation Attack Detection +1

Paper
Add Code

Wild Face Anti-Spoofing Challenge 2023: Benchmark and Results

1 code implementation • 12 Apr 2023 • Dong Wang, Jia Guo, Qiqi Shao, Haochi He, Zhian Chen, Chuanbao Xiao, Ajian Liu, Sergio Escalera, Hugo Jair Escalante, Zhen Lei, Jun Wan, Jiankang Deng

Leveraging the WFAS dataset and Protocol 1 (Known-Type), we host the Wild Face Anti-Spoofing Challenge at the CVPR2023 workshop.

Face Anti-Spoofing Face Recognition

21,318

Paper
Code

Grouped Knowledge Distillation for Deep Face Recognition

no code implementations • 10 Apr 2023 • Weisong Zhao, Xiangyu Zhu, Kaiwen Guo, Xiao-Yu Zhang, Zhen Lei

Therefore, we seek to probe the target logits to extract the primary knowledge related to face identity, and discard the others, to make the distillation more achievable for the student network.

Face Recognition Knowledge Distillation

Paper
Add Code

High-Fidelity Clothed Avatar Reconstruction from a Single Image

1 code implementation • CVPR 2023 • Tingting Liao, Xiaomei Zhang, Yuliang Xiu, Hongwei Yi, Xudong Liu, Guo-Jun Qi, Yong Zhang, Xuan Wang, Xiangyu Zhu, Zhen Lei

This paper presents a framework for efficient 3D clothed avatar reconstruction.

Vocal Bursts Intensity Prediction

101

Paper
Code

OTAvatar: One-shot Talking Face Avatar with Controllable Tri-plane Rendering

1 code implementation • CVPR 2023 • Zhiyuan Ma, Xiangyu Zhu, GuoJun Qi, Zhen Lei, Lei Zhang

In this paper, we propose One-shot Talking face Avatar (OTAvatar), which constructs face avatars by a generalized controllable tri-plane rendering solution so that each personalized avatar can be constructed from only one portrait as the reference.

285

Paper
Code

Graphics Capsule: Learning Hierarchical 3D Face Representations from 2D Images

no code implementations • CVPR 2023 • Chang Yu, Xiangyu Zhu, Xiaomei Zhang, Zhaoxiang Zhang, Zhen Lei

The function of constructing the hierarchy of objects is important to the visual process of the human brain.

Face Recognition

Paper
Add Code

Sharpness-Aware Gradient Matching for Domain Generalization

1 code implementation • CVPR 2023 • Pengfei Wang, Zhaoxiang Zhang, Zhen Lei, Lei Zhang

In this paper, we present two conditions to ensure that the model could converge to a flat minimum with a small loss, and present an algorithm, named Sharpness-Aware Gradient Matching (SAGM), to meet the two conditions for improving model generalization capability.

Domain Generalization

Paper
Code

Intrinsic Physical Concepts Discovery with Object-Centric Predictive Models

no code implementations • CVPR 2023 • Qu Tang, Xiangyu Zhu, Zhen Lei, Zhaoxiang Zhang

The ability to discover abstract physical concepts and understand how they work in the world through observing lies at the core of human intelligence.

Paper
Add Code

Self-similarity Driven Scale-invariant Learning for Weakly Supervised Person Search

no code implementations • ICCV 2023 • Benzhi Wang, Yang Yang, Jinlin Wu, Guo-Jun Qi, Zhen Lei

On the other hand, the similarity of cross-scale images is often smaller than that of images with the same scale for a person, which will increase the difficulty of matching.

Person Search

Paper
Add Code

Deep Learning for Human Parsing: A Survey

no code implementations • 29 Jan 2023 • Xiaomei Zhang, Xiangyu Zhu, Ming Tang, Zhen Lei

Human parsing is a key topic in image processing with many applications, such as surveillance analysis, human-robot interaction, person search, and clothing category classification, among many others.

Human Parsing Person Search

Paper
Add Code

Surveillance Face Anti-spoofing

no code implementations • 3 Jan 2023 • Hao Fang, Ajian Liu, Jun Wan, Sergio Escalera, Chenxu Zhao, Xu Zhang, Stan Z. Li, Zhen Lei

In order to promote relevant research and fill this gap in the community, we collect a large-scale Surveillance High-Fidelity Mask (SuHiFiMask) dataset captured under 40 surveillance scenes, which has 101 subjects from different age groups with 232 3D attacks (high-fidelity masks), 200 2D attacks (posters, portraits, and screens), and 2 adversarial attacks.

Contrastive Learning Face Anti-Spoofing +2

Paper
Add Code

Face Presentation Attack Detection

no code implementations • 7 Dec 2022 • Zitong Yu, Chenxu Zhao, Zhen Lei

Face recognition technology has been widely used in daily interactive applications such as checking-in and mobile payment due to its convenience and high accuracy.

Face Anti-Spoofing Face Presentation Attack Detection +1

Paper
Add Code

DFGC 2022: The Second DeepFake Game Competition

1 code implementation • 30 Jun 2022 • Bo Peng, Wei Xiang, Yue Jiang, Wei Wang, Jing Dong, Zhenan Sun, Zhen Lei, Siwei Lyu

There is a two-party game between DeepFake creators and defenders.

Benchmarking Face Swapping

Paper
Code

Towards 3D Face Reconstruction in Perspective Projection: Estimating 6DoF Face Pose from Monocular Image

1 code implementation • 9 May 2022 • Yueying Kao, Bowen Pan, Miao Xu, Jiangjing Lyu, Xiangyu Zhu, Yuanzhang Chang, Xiaobo Li, Zhen Lei

In 3D face reconstruction, orthogonal projection has been widely employed to substitute perspective projection to simplify the fitting process.

3D Face Reconstruction

Paper
Code

MVP-Human Dataset for 3D Human Avatar Reconstruction from Unconstrained Frames

1 code implementation • 24 Apr 2022 • Xiangyu Zhu, Tingting Liao, Jiangjing Lyu, Xiang Yan, Yunfeng Wang, Kan Guo, Qiong Cao, Stan Z. Li, Zhen Lei

In this paper, we consider a novel problem of reconstructing a 3D human avatar from multiple unconstrained frames, independent of assumptions on camera calibration, capture space, and constrained actions.

Camera Calibration

Paper
Code

Weakly Aligned Feature Fusion for Multimodal Object Detection

no code implementations • 21 Apr 2022 • Lu Zhang, Zhiyong Liu, Xiangyu Zhu, Zhan Song, Xu Yang, Zhen Lei, Hong Qiao

In this article, we propose a general multimodal detector named aligned region CNN (AR-CNN) to tackle the position shift problem.

Object object-detection +2

Paper
Add Code

Beyond 3DMM: Learning to Capture High-fidelity 3D Face Shape

no code implementations • 9 Apr 2022 • Xiangyu Zhu, Chang Yu, Di Huang, Zhen Lei, Hao Wang, Stan Z. Li

3D Morphable Model (3DMM) fitting has widely benefited face analysis due to its strong 3D priori.

Vocal Bursts Intensity Prediction

Paper
Add Code

Nested Collaborative Learning for Long-Tailed Visual Recognition

1 code implementation • CVPR 2022 • Jun Li, Zichang Tan, Jun Wan, Zhen Lei, Guodong Guo

NCL consists of two core components, namely Nested Individual Learning (NIL) and Nested Balanced Online Distillation (NBOD), which focus on the individual supervised learning for each single expert and the knowledge transferring among multiple experts, respectively.

Ranked #6 on Long-tail Learning on CIFAR-10-LT (ρ=50)

Image Classification Long-tail Learning

Paper
Code

HP-Capsule: Unsupervised Face Part Discovery by Hierarchical Parsing Capsule Network

no code implementations • CVPR 2022 • Chang Yu, Xiangyu Zhu, Xiaomei Zhang, Zidu Wang, Zhaoxiang Zhang, Zhen Lei

Capsule networks are designed to present the objects by a set of parts and their relationships, which provide an insight into the procedure of visual perception.

Paper
Add Code

Solving parametric partial differential equations with deep rectified quadratic unit neural networks

no code implementations • 14 Mar 2022 • Zhen Lei, Lei Shi, Chenyu Zeng

In this study, we investigate the expressive power of deep rectified quadratic unit (ReQU) neural networks for approximating the solution maps of parametric PDEs.

Paper
Add Code

VLAD-VSA: Cross-Domain Face Presentation Attack Detection with Vocabulary Separation and Adaptation

1 code implementation • 21 Feb 2022 • Jiong Wang, Zhou Zhao, Weike Jin, Xinyu Duan, Zhen Lei, Baoxing Huai, Yiling Wu, Xiaofei He

In this paper, the VLAD aggregation method is adopted to quantize local features with visual vocabulary locally partitioning the feature space, and hence preserve the local discriminability.

Face Presentation Attack Detection

Paper
Code

Multi-initialization Optimization Network for Accurate 3D Human Pose and Shape Estimation

no code implementations • 24 Dec 2021 • Zhiwei Liu, Xiangyu Zhu, Lu Yang, Xiang Yan, Ming Tang, Zhen Lei, Guibo Zhu, Xuetao Feng, Yan Wang, Jinqiao Wang

In the second stage, we design a mesh refinement transformer (MRT) to respectively refine each coarse reconstruction result via a self-attention mechanism.

Ranked #64 on 3D Human Pose Estimation on 3DPW (MPJPE metric)

3D human pose and shape estimation 3D Reconstruction

Paper
Add Code

Decoupling and Recoupling Spatiotemporal Representation for RGB-D-based Motion Recognition

1 code implementation • CVPR 2022 • Benjia Zhou, Pichao Wang, Jun Wan, Yanyan Liang, Fan Wang, Du Zhang, Zhen Lei, Hao Li, Rong Jin

Decoupling spatiotemporal representation refers to decomposing the spatial and temporal features into dimension-independent factors.

Ranked #1 on Hand Gesture Recognition on NVGesture

Hand Gesture Recognition

Paper
Code

Consistency Regularization for Deep Face Anti-Spoofing

1 code implementation • 24 Nov 2021 • Zezheng Wang, Zitong Yu, Xun Wang, Yunxiao Qin, Jiahong Li, Chenxu Zhao, Zhen Lei, Xin Liu, Size Li, Zhongyuan Wang

Face anti-spoofing (FAS) plays a crucial role in securing face recognition systems.

Face Anti-Spoofing Face Recognition

Paper
Code

Meta-Teacher For Face Anti-Spoofing

no code implementations • 12 Nov 2021 • Yunxiao Qin, Zitong Yu, Longbin Yan, Zezheng Wang, Chenxu Zhao, Zhen Lei

The meta-teacher is trained in a bi-level optimization manner to learn the ability to supervise the PA detectors learning rich spoofing cues.

Face Anti-Spoofing Face Recognition

Paper
Add Code

LAE : Long-tailed Age Estimation

no code implementations • 25 Oct 2021 • Zenghao Bao, Zichang Tan, Yu Zhu, Jun Wan, Xibo Ma, Zhen Lei, Guodong Guo

To improve the performance of facial age estimation, we first formulate a simple standard baseline and build a much strong one by collecting the tricks in pre-training, data augmentation, model architecture, and so on.

Age Estimation Data Augmentation +1

Paper
Add Code

OBJECT DYNAMICS DISTILLATION FOR SCENE DECOMPOSITION AND REPRESENTATION

no code implementations • ICLR 2022 • Qu Tang, Xiangyu Zhu, Zhen Lei, Zhaoxiang Zhang

In this paper, we work on object dynamics and propose Object Dynamics Distillation Network (ODDN), a framework that distillates explicit object dynamics (e. g., velocity) from sequential static representations.

Object Predict Future Video Frames +1

Paper
Add Code

3D High-Fidelity Mask Face Presentation Attack Detection Challenge

no code implementations • 16 Aug 2021 • Ajian Liu, Chenxu Zhao, Zitong Yu, Anyang Su, Xing Liu, Zijian Kong, Jun Wan, Sergio Escalera, Hugo Jair Escalante, Zhen Lei, Guodong Guo

The threat of 3D masks to face recognition systems is increasingly serious and has been widely concerned by researchers.

Face Presentation Attack Detection Face Recognition +1

Paper
Add Code

PoseFace: Pose-Invariant Features and Pose-Adaptive Loss for Face Recognition

no code implementations • 25 Jul 2021 • Qiang Meng, Xiaqing Xu, Xiaobo Wang, Yang Qian, Yunxiao Qin, Zezheng Wang, Chenxu Zhao, Feng Zhou, Zhen Lei

Despite the great success achieved by deep learning methods in face recognition, severe performance drops are observed for large pose variations in unconstrained environments (e. g., in cases of surveillance and photo-tagging).

Face Recognition

Paper
Add Code

Deep Learning for Face Anti-Spoofing: A Survey

3 code implementations • 28 Jun 2021 • Zitong Yu, Yunxiao Qin, Xiaobai Li, Chenxu Zhao, Zhen Lei, Guoying Zhao

Face anti-spoofing (FAS) has lately attracted increasing attention due to its vital role in securing face recognition systems from presentation attacks (PAs).

Domain Generalization Face Anti-Spoofing +1

483

Paper
Code

Represent Items by Items: An Enhanced Representation of the Target Item for Recommendation

no code implementations • 26 Apr 2021 • Yinjiang Cai, Zeyu Cui, Shu Wu, Zhen Lei, Xibo Ma

Our proposed Co-occurrence based Enhanced Representation model (CER) learns the scoring function by a deep neural network with the attentive user representation and fusion of raw representation and enhanced representation of target item as input.

Collaborative Filtering Recommendation Systems

Paper
Add Code

Contrastive Context-Aware Learning for 3D High-Fidelity Mask Face Presentation Attack Detection

no code implementations • 13 Apr 2021 • Ajian Liu, Chenxu Zhao, Zitong Yu, Jun Wan, Anyang Su, Xing Liu, Zichang Tan, Sergio Escalera, Junliang Xing, Yanyan Liang, Guodong Guo, Zhen Lei, Stan Z. Li, Du Zhang

To bridge the gap to real-world applications, we introduce a largescale High-Fidelity Mask dataset, namely CASIA-SURF HiFiMask (briefly HiFiMask).

Face Presentation Attack Detection Face Recognition

Paper
Add Code

Searching for Alignment in Face Recognition

no code implementations • 10 Feb 2021 • Xiaqing Xu, Qiang Meng, Yunxiao Qin, Jianzhu Guo, Chenxu Zhao, Feng Zhou, Zhen Lei

A standard pipeline of current face recognition frameworks consists of four individual steps: locating a face with a rough bounding box and several fiducial landmarks, aligning the face image using a pre-defined template, extracting representations and comparing.

Face Alignment Face Detection +2

Paper
Add Code

Face Forgery Detection by 3D Decomposition

no code implementations • CVPR 2021 • Xiangyu Zhu, Hao Wang, Hongyan Fei, Zhen Lei, Stan Z. Li

Detecting digital face manipulation has attracted extensive attention due to fake media's potential harms to the public.

Paper
Add Code

Towards Fast, Accurate and Stable 3D Dense Face Alignment

3 code implementations • ECCV 2020 • Jianzhu Guo, Xiangyu Zhu, Yang Yang, Fan Yang, Zhen Lei, Stan Z. Li

Firstly, on the basis of a lightweight backbone, we propose a meta-joint optimization strategy to dynamically regress a small set of 3DMM parameters, which greatly enhances speed and accuracy simultaneously.

Ranked #1 on 3D Face Reconstruction on Florence (Mean NME metric)

3D Face Modelling 3D Face Reconstruction +2

3,558

Paper
Code

SADet: Learning An Efficient and Accurate Pedestrian Detector

no code implementations • 26 Jul 2020 • Chubin Zhuang, Zhen Lei, Stan Z. Li

Although the anchor-based detectors have taken a big step forward in pedestrian detection, the overall performance of algorithm still needs further improvement for practical applications, \emph{e. g.}, a good trade-off between the accuracy and efficiency.

Human Detection Pedestrian Detection +2

Paper
Add Code

NPCFace: Negative-Positive Collaborative Training for Large-scale Face Recognition

no code implementations • 20 Jul 2020 • Dan Zeng, Hailin Shi, Hang Du, Jun Wang, Zhen Lei, Tao Mei

However, the correlation between hard positive and hard negative is overlooked, and so is the relation between the margins in positive and negative logits.

Face Recognition

Paper
Add Code

Semi-Siamese Training for Shallow Face Learning

3 code implementations • ECCV 2020 • Hang Du, Hailin Shi, Yuchi Liu, Jun Wang, Zhen Lei, Dan Zeng, Tao Mei

Extensive experiments on various benchmarks of face recognition show the proposed method significantly improves the training, not only in shallow face learning, but also for conventional deep face data.

Face Recognition

Paper
Code

Multi-Modal Face Anti-Spoofing Based on Central Difference Networks

1 code implementation • 17 Apr 2020 • Zitong Yu, Yunxiao Qin, Xiaobai Li, Zezheng Wang, Chenxu Zhao, Zhen Lei, Guoying Zhao

Face anti-spoofing (FAS) plays a vital role in securing face recognition systems from presentation attacks.

Face Anti-Spoofing Face Recognition

544

Paper
Code

Domain Balancing: Face Recognition on Long-Tailed Domains

no code implementations • CVPR 2020 • Dong Cao, Xiangyu Zhu, Xingyu Huang, Jianzhu Guo, Zhen Lei

Finally, we propose a Domain Balancing Margin (DBM) in the loss function to further optimize the feature space of the tail domains to improve generalization.

Face Recognition

Paper
Add Code

Deep Spatial Gradient and Temporal Depth Learning for Face Anti-spoofing

6 code implementations • CVPR 2020 • Zezheng Wang, Zitong Yu, Chenxu Zhao, Xiangyu Zhu, Yunxiao Qin, Qiusheng Zhou, Feng Zhou, Zhen Lei

Depth supervised learning has been proven as one of the most effective methods for face anti-spoofing.

Face Anti-Spoofing Face Recognition

224

Paper
Code

Learning Meta Face Recognition in Unseen Domains

7 code implementations • CVPR 2020 • Jianzhu Guo, Xiangyu Zhu, Chenxu Zhao, Dong Cao, Zhen Lei, Stan Z. Li

Face recognition systems are usually faced with unseen domains in real-world applications and show unsatisfactory performance due to their poor generalization.

Face Recognition Meta-Learning

218

Paper
Code

LAMP-HQ: A Large-Scale Multi-Pose High-Quality Database and Benchmark for NIR-VIS Face Recognition

no code implementations • 17 Dec 2019 • Aijing Yu, Haoxue Wu, Huaibo Huang, Zhen Lei, Ran He

A spectral conditional attention module is introduced to reduce the domain gap between NIR and VIS data and then improve the performance of NIR-VIS heterogeneous face recognition on various databases including the LAMP-HQ.

Attribute Face Recognition +1

Paper
Add Code

Bridging the Gap Between Anchor-based and Anchor-free Detection via Adaptive Training Sample Selection

11 code implementations • CVPR 2020 • Shifeng Zhang, Cheng Chi, Yongqiang Yao, Zhen Lei, Stan Z. Li

In this paper, we first point out that the essential difference between anchor-based and anchor-free detection is actually how to define positive and negative training samples, which leads to the performance gap between them.

Ranked #37 on Object Detection on COCO-O

Object object-detection +1

27,852

Paper
Code

Relational Learning for Joint Head and Human Detection

1 code implementation • 24 Sep 2019 • Cheng Chi, Shifeng Zhang, Junliang Xing, Zhen Lei, Stan Z. Li, Xudong Zou

Head and human detection have been rapidly improved with the development of deep convolutional neural networks.

Head Detection Human Detection +1

Paper
Code

PedHunter: Occlusion Robust Pedestrian Detector in Crowded Scenes

no code implementations • 15 Sep 2019 • Cheng Chi, Shifeng Zhang, Junliang Xing, Zhen Lei, Stan Z. Li, Xudong Zou

Pedestrian detection in crowded scenes is a challenging problem, because occlusion happens frequently among different pedestrians.

Data Augmentation Occlusion Handling +2

Paper
Add Code

RefineFace: Refinement Neural Network for High Performance Face Detection

no code implementations • 10 Sep 2019 • Shifeng Zhang, Cheng Chi, Zhen Lei, Stan Z. Li

To improve the classification ability for high recall efficiency, STC first filters out most simple negatives from low level detection layers to reduce search space for subsequent classifier, then SML is applied to better distinguish faces from background at various scales and FSM is introduced to let the backbone learn more discriminative features for classification.

Classification Face Detection +3

Paper
Add Code

Domain Adaptive Person Re-Identification via Camera Style Generation and Label Propagation

no code implementations • 14 May 2019 • Chuan-Xian Ren, Bo-Hua Liang, Zhen Lei

We derive a camera style adaptation framework to learn the style-based mappings between different camera views, from the target domain to the source domain, and then we can transfer the identity-based distribution from the source domain to the target domain on the camera level.

Domain Adaptive Person Re-Identification Person Re-Identification +1

Paper
Add Code

Learning Meta Model for Zero- and Few-shot Face Anti-spoofing

no code implementations • 29 Apr 2019 • Yunxiao Qin, Chenxu Zhao, Xiangyu Zhu, Zezheng Wang, Zitong Yu, Tianyu Fu, Feng Zhou, Jingping Shi, Zhen Lei

Therefore, we define face anti-spoofing as a zero- and few-shot learning problem.

Face Anti-Spoofing Face Recognition +1

Paper
Add Code

Semantic Alignment: Finding Semantically Consistent Ground-truth for Facial Landmark Detection

no code implementations • CVPR 2019 • Zhiwei Liu, Xiangyu Zhu, Guosheng Hu, Haiyun Guo, Ming Tang, Zhen Lei, Neil M. Robertson, Jinqiao Wang

Despite this, we notice that the semantic ambiguity greatly degrades the detection performance.

Ranked #1 on Face Alignment on 300W (NME_inter-pupil (%, Full) metric)

Face Alignment Facial Landmark Detection

Paper
Add Code

WIDER Face and Pedestrian Challenge 2018: Methods and Results

no code implementations • 19 Feb 2019 • Chen Change Loy, Dahua Lin, Wanli Ouyang, Yuanjun Xiong, Shuo Yang, Qingqiu Huang, Dongzhan Zhou, Wei Xia, Quanquan Li, Ping Luo, Junjie Yan, Jian-Feng Wang, Zuoxin Li, Ye Yuan, Boxun Li, Shuai Shao, Gang Yu, Fangyun Wei, Xiang Ming, Dong Chen, Shifeng Zhang, Cheng Chi, Zhen Lei, Stan Z. Li, Hongkai Zhang, Bingpeng Ma, Hong Chang, Shiguang Shan, Xilin Chen, Wu Liu, Boyan Zhou, Huaxiong Li, Peng Cheng, Tao Mei, Artem Kukharenko, Artem Vasenin, Nikolay Sergievskiy, Hua Yang, Liangqi Li, Qiling Xu, Yuan Hong, Lin Chen, Mingjun Sun, Yirong Mao, Shiying Luo, Yongjun Li, Ruiping Wang, Qiaokang Xie, Ziyang Wu, Lei Lu, Yiheng Liu, Wengang Zhou

This paper presents a review of the 2018 WIDER Challenge on Face and Pedestrian.

Face Detection Pedestrian Detection +2

Paper
Add Code

Weakly Aligned Cross-Modal Learning for Multispectral Pedestrian Detection

no code implementations • ICCV 2019 • Lu Zhang, Xiangyu Zhu, Xiangyu Chen, Xu Yang, Zhen Lei, Zhi-Yong Liu

In this paper, we propose a novel Aligned Region CNN (AR-CNN) to handle the weakly aligned multispectral data in an end-to-end way.

Position

Paper
Add Code

Improving Face Anti-Spoofing by 3D Virtual Synthesis

3 code implementations • 2 Jan 2019 • Jianzhu Guo, Xiangyu Zhu, Jinchuan Xiao, Zhen Lei, Genxun Wan, Stan Z. Li

Specifically, we consider a printed photo as a flat surface and mesh it into a 3D object, which is then randomly bent and rotated in 3D space.

Ranked #1 on Face Anti-Spoofing on CASIA-MFSD

Face Anti-Spoofing Face Recognition

1,539

Paper
Code

Recurrent Calibration Network for Irregular Text Recognition

no code implementations • 18 Dec 2018 • Yunze Gao, Yingying Chen, Jinqiao Wang, Zhen Lei, Xiao-Yu Zhang, Hanqing Lu

In this paper, we propose a novel Recurrent Calibration Network (RCN) for irregular scene text recognition.

Irregular Text Recognition Scene Text Recognition

Paper
Add Code

Prior-Knowledge and Attention-based Meta-Learning for Few-Shot Learning

no code implementations • 11 Dec 2018 • Yunxiao Qin, WeiGuo Zhang, Chenxu Zhao, Zezheng Wang, Xiangyu Zhu, Guo-Jun Qi, Jingping Shi, Zhen Lei

In this paper, inspired by the human cognition process which utilizes both prior-knowledge and vision attention in learning new knowledge, we present a novel paradigm of meta-learning approach with three developments to introduce attention mechanism and prior-knowledge for meta-learning.

Few-Shot Learning

Paper
Add Code

Representation based and Attention augmented Meta learning

no code implementations • 19 Nov 2018 • Yunxiao Qin, Chenxu Zhao, Zezheng Wang, Junliang Xing, Jun Wan, Zhen Lei

The method RAML aims to give the Meta learner the ability of leveraging the past learned knowledge to reduce the dimension of the original input data by expressing it into high representations, and help the Meta learner to perform well.

Few-Shot Learning

Paper
Add Code

Vehicle Re-identification Using Quadruple Directional Deep Learning Features

no code implementations • 13 Nov 2018 • Jianqing Zhu, Huanqiang Zeng, Jingchang Huang, Shengcai Liao, Zhen Lei, Canhui Cai, Lixin Zheng

Specifically, the same basic deep learning architecture is a shortly and densely connected convolutional neural network to extract basic feature maps of an input square vehicle image in the first stage.

Ranked #3 on Vehicle Re-Identification on VehicleID Large (mAP metric)

Vehicle Re-Identification

Paper
Add Code

Exploiting temporal and depth information for multi-frame face anti-spoofing

1 code implementation • 13 Nov 2018 • Zezheng Wang, Chenxu Zhao, Yunxiao Qin, Qiusheng Zhou, Guo-Jun Qi, Jun Wan, Zhen Lei

Face anti-spoofing is significant to the security of face recognition systems.

Face Anti-Spoofing Face Recognition +1

Paper
Code

Selective Refinement Network for High Performance Face Detection

3 code implementations • 7 Sep 2018 • Cheng Chi, Shifeng Zhang, Junliang Xing, Zhen Lei, Stan Z. Li, Xudong Zou

In particular, the SRN consists of two modules: the Selective Two-step Classification (STC) module and the Selective Two-step Regression (STR) module.

Ranked #1 on Face Detection on PASCAL Face

Face Detection General Classification +2

273

Paper
Code

Occlusion-aware R-CNN: Detecting Pedestrians in a Crowd

no code implementations • ECCV 2018 • Shifeng Zhang, Longyin Wen, Xiao Bian, Zhen Lei, Stan Z. Li

Pedestrian detection in crowded scenes is a challenging problem since the pedestrians often gather together and occlude each other.

Ranked #10 on Pedestrian Detection on Caltech (using extra training data)

Pedestrian Detection

Paper
Add Code

Large-scale Bisample Learning on ID Versus Spot Face Recognition

no code implementations • 8 Jun 2018 • Xiangyu Zhu, Hao liu, Zhen Lei, Hailin Shi, Fan Yang, Dong Yi, Guo-Jun Qi, Stan Z. Li

In this paper, we propose a deep learning based large-scale bisample learning (LBL) method for IvS face recognition.

Face Recognition General Classification

Paper
Add Code

Face Synthesis for Eyeglass-Robust Face Recognition

1 code implementation • 4 Jun 2018 • Jianzhu Guo, Xiangyu Zhu, Zhen Lei, Stan Z. Li

A feasible method is to collect large-scale face images with eyeglasses for training deep learning methods.

Face Generation Face Model +2

340

Paper
Code

Ensemble Soft-Margin Softmax Loss for Image Classification

no code implementations • 10 May 2018 • Xiaobo Wang, Shifeng Zhang, Zhen Lei, Si Liu, Xiaojie Guo, Stan Z. Li

On the other hand, the learned classifier of softmax loss is weak.

Classification General Classification +1

Paper
Add Code

Face Alignment in Full Pose Range: A 3D Total Solution

2 code implementations • 2 Apr 2018 • Xiangyu Zhu, Xiaoming Liu, Zhen Lei, Stan Z. Li

In this paper, we propose to tackle these three challenges in an new alignment framework termed 3D Dense Face Alignment (3DDFA), in which a dense 3D Morphable Model (3DMM) is fitted to the image via Cascaded Convolutional Neural Networks.

Ranked #3 on Face Alignment on AFLW

3D Pose Estimation Depth Image Estimation +3

3,558

Paper
Code

Single-Shot Refinement Neural Network for Object Detection

12 code implementations • CVPR 2018 • Shifeng Zhang, Longyin Wen, Xiao Bian, Zhen Lei, Stan Z. Li

For object detection, the two-stage approach (e. g., Faster R-CNN) has been achieving the highest accuracy, whereas the one-stage approach (e. g., SSD) has the advantage of high efficiency.

Ranked #164 on Object Detection on COCO test-dev

Object object-detection +1

1,436

Paper
Code

S3FD: Single Shot Scale-Invariant Face Detector

no code implementations • ICCV 2017 • Shifeng Zhang, Xiangyu Zhu, Zhen Lei, Hailin Shi, Xiaobo Wang, Stan Z. Li

This paper presents a real-time face detector, named Single Shot Scale-invariant Face Detector (S3FD), which performs superiorly on various scales of faces with a single deep neural network, especially for small faces.

Face Detection

Paper
Add Code

S$^3$FD: Single Shot Scale-invariant Face Detector

3 code implementations • 17 Aug 2017 • Shifeng Zhang, Xiangyu Zhu, Zhen Lei, Hailin Shi, Xiaobo Wang, Stan Z. Li

This paper presents a real-time face detector, named Single Shot Scale-invariant Face Detector (S$^3$FD), which performs superiorly on various scales of faces with a single deep neural network, especially for small faces.

Ranked #2 on Face Detection on PASCAL Face

Face Detection

521

Paper
Code

FaceBoxes: A CPU Real-time Face Detector with High Accuracy

10 code implementations • 17 Aug 2017 • Shifeng Zhang, Xiangyu Zhu, Zhen Lei, Hailin Shi, Xiaobo Wang, Stan Z. Li

The MSCL aims at enriching the receptive fields and discretizing anchors over different layers to handle faces of various scales.

Ranked #3 on Face Detection on PASCAL Face

Face Detection Vocal Bursts Intensity Prediction

840

Paper
Code

Learning Efficient Image Representation for Person Re-Identification

no code implementations • 7 Jul 2017 • Yang Yang, Shengcai Liao, Zhen Lei, Stan Z. Li

Then, a robust image representation based on color names is obtained by concatenating the statistical descriptors in each stripe.

Person Re-Identification

Paper
Add Code

Exclusivity-Consistency Regularized Multi-View Subspace Clustering

no code implementations • CVPR 2017 • Xiaobo Wang, Xiaojie Guo, Zhen Lei, Changqing Zhang, Stan Z. Li

Multi-view subspace clustering aims to partition a set of multi-source data into their underlying groups.

Clustering Multi-view Subspace Clustering +1

Paper
Add Code

Deep Hybrid Similarity Learning for Person Re-identification

no code implementations • 16 Feb 2017 • Jianqing Zhu, Huanqiang Zeng, Shengcai Liao, Zhen Lei, Canhui Cai, Lixin Zheng

In this paper, a deep hybrid similarity learning (DHSL) method for person Re-ID based on a convolution neural network (CNN) is proposed.

Metric Learning Person Re-Identification

Paper
Add Code

Embedding Deep Metric for Person Re-identication A Study Against Large Variations

no code implementations • 1 Nov 2016 • Hailin Shi, Yang Yang, Xiangyu Zhu, Shengcai Liao, Zhen Lei, Wei-Shi Zheng, Stan Z. Li

From this point of view, selecting suitable positive i. e. intra-class) training samples within a local range is critical for training the CNN embedding, especially when the data has large intra-class variations.

Person Re-Identification

Paper
Add Code

Learning Discriminative Features with Class Encoder

no code implementations • 9 May 2016 • Hailin Shi, Xiangyu Zhu, Zhen Lei, Shengcai Liao, Stan Z. Li

Deep neural networks usually benefit from unsupervised pre-training, e. g. auto-encoders.

Face Recognition Translation +1

Paper
Add Code

CRAFT Objects from Images

1 code implementation • CVPR 2016 • Bin Yang, Junjie Yan, Zhen Lei, Stan Z. Li

They decompose the object detection problem into two cascaded easier tasks: 1) generating object proposals from images, 2) classifying proposals into various object categories.

Object object-detection +2

Paper
Code

Constrained Deep Metric Learning for Person Re-identification

no code implementations • 24 Nov 2015 • Hailin Shi, Xiangyu Zhu, Shengcai Liao, Zhen Lei, Yang Yang, Stan Z. Li

In this paper, we propose a novel CNN-based method to learn a discriminative metric with good robustness to the over-fitting problem in person re-identification.

Metric Learning Person Re-Identification

Paper
Add Code

Face Alignment Across Large Poses: A 3D Solution

no code implementations • CVPR 2016 • Xiangyu Zhu, Zhen Lei, Xiaoming Liu, Hailin Shi, Stan Z. Li

Face alignment, which fits a face model to an image and extracts the semantic meanings of facial pixels, has been an important topic in CV community.

Ranked #3 on 3D Face Reconstruction on Florence

3D Face Reconstruction Face Alignment +2

Paper
Add Code

UA-DETRAC: A New Benchmark and Protocol for Multi-Object Detection and Tracking

no code implementations • 13 Nov 2015 • Longyin Wen, Dawei Du, Zhaowei Cai, Zhen Lei, Ming-Ching Chang, Honggang Qi, Jongwoo Lim, Ming-Hsuan Yang, Siwei Lyu

In this work, we perform a comprehensive quantitative study on the effects of object detection accuracy to the overall MOT performance, using the new large-scale University at Albany DETection and tRACking (UA-DETRAC) benchmark dataset.

Multi-Object Tracking Object +2

Paper
Add Code

Object Detection by Labeling Superpixels

no code implementations • CVPR 2015 • Junjie Yan, Yinan Yu, Xiangyu Zhu, Zhen Lei, Stan Z. Li

Object detection is always conducted by object proposal generation and classification sequentially.

General Classification Object +3

Paper
Add Code

High-Fidelity Pose and Expression Normalization for Face Recognition in the Wild

no code implementations • CVPR 2015 • Xiangyu Zhu, Zhen Lei, Junjie Yan, Dong Yi, Stan Z. Li

Pose and expression normalization is a crucial step to recover the canonical view of faces under arbitrary conditions, so as to improve the face recognition performance.

Face Recognition Vocal Bursts Intensity Prediction

Paper
Add Code

JOTS: Joint Online Tracking and Segmentation

no code implementations • CVPR 2015 • Longyin Wen, Dawei Du, Zhen Lei, Stan Z. Li, Ming-Hsuan Yang

We present a novel Joint Online Tracking and Segmentation (JOTS) algorithm which integrates the multi-part tracking and segmentation into a unified energy optimization framework to handle the video segmentation task.

Segmentation Video Segmentation +1

Paper
Add Code

Convolutional Channel Features

1 code implementation • ICCV 2015 • Bin Yang, Junjie Yan, Zhen Lei, Stan Z. Li

With the combination of CNN features and boosting forest, CCF benefits from the richer capacity in feature representation compared with channel features, as well as lower cost in computation and storage compared with end-to-end CNN methods.

Edge Detection Face Detection +2

Paper
Code

Learning Face Representation from Scratch

15 code implementations • 28 Nov 2014 • Dong Yi, Zhen Lei, Shengcai Liao, Stan Z. Li

The current situation in the field of face recognition is that data is more important than algorithm.

Face Recognition

4,166

Paper
Code

Learn Convolutional Neural Network for Face Anti-Spoofing

4 code implementations • 24 Aug 2014 • Jianwei Yang, Zhen Lei, Stan Z. Li

Moreover, the nets trained using combined data from two datasets have less biases between two datasets.

Ranked #2 on Face Anti-Spoofing on CASIA-MFSD

Face Anti-Spoofing

214

Paper
Code

Deep Metric Learning for Practical Person Re-Identification

no code implementations • 18 Jul 2014 • Dong Yi, Zhen Lei, Stan Z. Li

Compared to existing researches, a more practical setting is studied in the experiments that is training and test on different datasets (cross dataset person re-identification).

Metric Learning Person Re-Identification

Paper
Add Code

Aggregate channel features for multi-view face detection

no code implementations • 15 Jul 2014 • Bin Yang, Junjie Yan, Zhen Lei, Stan Z. Li

Face detection has drawn much attention in recent decades since the seminal work by Viola and Jones.

Ranked #37 on Face Detection on WIDER Face (Medium)

Face Detection Re-Ranking

Paper
Add Code

Shared Representation Learning for Heterogeneous Face Recognition

no code implementations • 5 Jun 2014 • Dong Yi, Zhen Lei, Shengcai Liao, Stan Z. Li

For NIR-VIS problem, we produce new state-of-the-art performance on the CASIA HFB and NIR-VIS 2. 0 databases.

Face Recognition Heterogeneous Face Recognition +1

Paper
Add Code

Multiple Target Tracking Based on Undirected Hierarchical Relation Hypergraph

no code implementations • CVPR 2014 • Longyin Wen, Wenbo Li, Junjie Yan, Zhen Lei, Dong Yi, Stan Z. Li

Multi-target tracking is an interesting but challenging task in computer vision field.

Relation

Paper
Add Code

The Fastest Deformable Part Model for Object Detection

no code implementations • CVPR 2014 • Junjie Yan, Zhen Lei, Longyin Wen, Stan Z. Li

Three prohibitive steps in cascade version of DPM are accelerated, including 2D correlation between root filter and feature map, cascade part pruning and HOG feature extraction.

Face Detection Object +2

Paper
Add Code

Towards Pose Robust Face Recognition

no code implementations • CVPR 2013 • Dong Yi, Zhen Lei, Stan Z. Li

In this paper, we propose a novel method for pose robust face recognition towards practical applications, which is fast, pose robust and can work well under unconstrained environments.

Face Recognition Robust Face Recognition

Paper
Add Code

Robust Multi-resolution Pedestrian Detection in Traffic Scenes

no code implementations • CVPR 2013 • Junjie Yan, Xucong Zhang, Zhen Lei, Shengcai Liao, Stan Z. Li

The model contains resolution aware transformations to map pedestrians in different resolutions to a common space, where a shared detector is constructed to distinguish pedestrians from background.

Pedestrian Detection

Paper
Add Code

Fast Matching by 2 Lines of Code for Large Scale Face Recognition Systems

no code implementations • 28 Feb 2013 • Dong Yi, Zhen Lei, Yang Hu, Stan Z. Li

However, the use of this method is very generic and not limited in face recognition, which can be easily generalized to other biometrics as a post-processing module.

Computational Efficiency Face Recognition

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.