Search Results for author: Shiguang Shan

Found 135 papers, 51 papers with code

Clothes-Changing Person Re-Identification with Feasibility-Aware Intermediary Matching

no code implementations • 15 Apr 2024 • Jiahe Zhao, Ruibing Hou, Hong Chang, Xinqian Gu, Bingpeng Ma, Shiguang Shan, Xilin Chen

Current clothes-changing person re-identification (re-id) approaches usually perform retrieval based on clothes-irrelevant features, while neglecting the potential of clothes-relevant features.

Clothes Changing Person Re-Identification Retrieval

Paper
Add Code

HPNet: Dynamic Trajectory Forecasting with Historical Prediction Attention

1 code implementation • 9 Apr 2024 • Xiaolong Tang, Meina Kan, Shiguang Shan, Zhilong Ji, Jinfeng Bai, Xilin Chen

The proposed Historical Prediction Attention together with the Agent Attention and Mode Attention is further formulated as the Triple Factorized Attention module, serving as the core design of HPNet. Experiments on the Argoverse and INTERACTION datasets show that HPNet achieves state-of-the-art performance, and generates accurate and stable future trajectories.

Autonomous Driving Trajectory Forecasting

Paper
Code

StylizedGS: Controllable Stylization for 3D Gaussian Splatting

no code implementations • 8 Apr 2024 • Dingxi Zhang, Zhuoxun Chen, Yu-Jie Yuan, Fang-Lue Zhang, Zhenliang He, Shiguang Shan, Lin Gao

With the rapid development of XR, 3D generation and editing are becoming more and more important, among which, stylization is an important tool of 3D appearance editing.

3D Generation Style Transfer

Paper
Add Code

GPT as Psychologist? Preliminary Evaluations for GPT-4V on Visual Affective Computing

2 code implementations • 9 Mar 2024 • Hao Lu, Xuesong Niu, Jiyao Wang, Yin Wang, Qingyong Hu, Jiaqi Tang, Yuting Zhang, Kaishen Yuan, Bin Huang, Zitong Yu, Dengbo He, Shuiguang Deng, Hao Chen, Yingcong Chen, Shiguang Shan

In conclusion, this paper provides valuable insights into the potential applications and challenges of MLLMs in human-centric computing.

Emotion Recognition Facial Action Unit Detection +4

Paper
Code

Contrastive Learning of Person-independent Representations for Facial Action Unit Detection

no code implementations • 6 Mar 2024 • Yong Li, Shiguang Shan

We formulate the self-supervised AU representation learning signals in two-fold: (1) AU representation should be frame-wisely discriminative within a short video clip; (2) Facial frames sampled from different identities but show analogous facial AUs should have consistent AU representations.

Action Unit Detection Contrastive Learning +2

Paper
Add Code

Generalized Face Liveness Detection via De-spoofing Face Generator

no code implementations • 17 Jan 2024 • Xingming Long, Shiguang Shan, Jie Zhang

In this paper, we conduct an Anomalous cue Guided FAS (AG-FAS) method, which leverages real faces for improving model generalization via a De-spoofing Face Generator (DFG).

Face Anti-Spoofing

Paper
Add Code

Collaboratively Self-supervised Video Representation Learning for Action Recognition

no code implementations • 15 Jan 2024 • Jie Zhang, Zhifan Wan, Lanqing Hu, Stephen Lin, Shuzhe Wu, Shiguang Shan

Considering the close connection between action recognition and human pose estimation, we design a Collaboratively Self-supervised Video Representation (CSVR) learning framework specific to action recognition by jointly considering generative pose prediction and discriminative context matching as pretext tasks.

Action Recognition Pose Estimation +2

Paper
Add Code

Pre-trained Model Guided Fine-Tuning for Zero-Shot Adversarial Robustness

1 code implementation • 9 Jan 2024 • Sibo Wang, Jie Zhang, Zheng Yuan, Shiguang Shan

Specifically, PMG-AFT minimizes the distance between the features of adversarial examples in the target model and those in the pre-trained model, aiming to preserve the generalization features already captured by the pre-trained model.

Adversarial Robustness Zero-shot Generalization

Paper
Code

Towards Robust Semantic Segmentation against Patch-based Attack via Attention Refinement

no code implementations • 3 Jan 2024 • Zheng Yuan, Jie Zhang, Yude Wang, Shiguang Shan, Xilin Chen

The attention mechanism has been proven effective on various visual tasks in recent years.

Attribute Segmentation +1

Paper
Add Code

FullLoRA-AT: Efficiently Boosting the Robustness of Pretrained Vision Transformers

no code implementations • 3 Jan 2024 • Zheng Yuan, Jie Zhang, Shiguang Shan

In recent years, the Vision Transformer (ViT) model has gradually become mainstream in various computer vision tasks, and the robustness of the model has received increasing attention.

Adversarial Robustness

Paper
Add Code

Tokenize Anything via Prompting

1 code implementation • 14 Dec 2023 • Ting Pan, Lulu Tang, Xinlong Wang, Shiguang Shan

The semantic token is responsible for learning the semantic priors in a predefined concept space.

Visual Prompting

432

Paper
Code

From Static to Dynamic: Adapting Landmark-Aware Image Models for Facial Expression Recognition in Videos

1 code implementation • 9 Dec 2023 • Yin Chen, Jia Li, Shiguang Shan, Meng Wang, Richang Hong

And the TMAs capture and model the relationships of dynamic changes in facial expressions, effectively extending the pre-trained image model for videos.

Ranked #1 on Facial Expression Recognition (FER) on RAF-DB

Dynamic Facial Expression Recognition Facial Expression Recognition +1

Paper
Code

Cooperative Dual Attention for Audio-Visual Speech Enhancement with Facial Cues

no code implementations • 24 Nov 2023 • Feixiang Wang, Shuang Yang, Shiguang Shan, Xilin Chen

By integrating cooperative dual attention in the visual encoder and audio-visual fusion strategy, our model effectively extracts beneficial speech information from both audio and visual cues for AVSE.

Speech Enhancement

Paper
Add Code

Learning Separable Hidden Unit Contributions for Speaker-Adaptive Lip-Reading

1 code implementation • 8 Oct 2023 • Songtao Luo, Shuang Yang, Shiguang Shan, Xilin Chen

For deep layers where both the speaker's features and the speech content features are all expressed well, we introduce the speaker-adaptive features to learn for suppressing the speech content irrelevant noise for robust lip reading.

Lip Reading

Paper
Code

Dual Compensation Residual Networks for Class Imbalanced Learning

no code implementations • 25 Aug 2023 • Ruibing Hou, Hong Chang, Bingpeng Ma, Shiguang Shan, Xilin Chen

Learning generalizable representation and classifier for class-imbalanced data is challenging for data-driven deep models.

Paper
Add Code

Patch Is Not All You Need

no code implementations • 21 Aug 2023 • Changzhen Li, Jie Zhang, Yang Wei, Zhilong Ji, Jinfeng Bai, Shiguang Shan

Vision Transformers have achieved great success in computer visions, delivering exceptional performance across various tasks.

Paper
Add Code

Triplet Knowledge Distillation

no code implementations • 25 May 2023 • Xijun Wang, Dongyang Liu, Meina Kan, Chunrui Han, Zhongqin Wu, Shiguang Shan

Distillation then begins in an online manner, and the teacher is only allowed to express solutions within the aforementioned subspace.

Face Recognition Image Classification +1

Paper
Add Code

Function-Consistent Feature Distillation

1 code implementation • 24 Apr 2023 • Dongyang Liu, Meina Kan, Shiguang Shan, Xilin Chen

The core idea of FCFD is to make teacher and student features not only numerically similar, but more importantly produce similar outputs when fed to the later part of the same network.

Image Classification object-detection +1

Paper
Code

CCLAP: Controllable Chinese Landscape Painting Generation via Latent Diffusion Model

1 code implementation • 9 Apr 2023 • Zhongqi Wang, Jie Zhang, Zhilong Ji, Jinfeng Bai, Shiguang Shan

While the style aggregator module is to generate paintings of a style corresponding to a reference image.

Chinese Landscape Painting Generation

Paper
Code

Real Face Foundation Representation Learning for Generalized Deepfake Detection

no code implementations • 15 Mar 2023 • Liang Shi, Jie Zhang, Shiguang Shan

In this study, we propose Real Face Foundation Representation Learning (RFFR), which aims to learn a general representation from large-scale real face datasets and detect potential artifacts outside the distribution of RFFR.

DeepFake Detection Face Swapping +1

Paper
Add Code

Diversity-Measurable Anomaly Detection

1 code implementation • CVPR 2023 • Wenrui Liu, Hong Chang, Bingpeng Ma, Shiguang Shan, Xilin Chen

In this paper, to better handle the tradeoff problem, we propose Diversity-Measurable Anomaly Detection (DMAD) framework to enhance reconstruction diversity while avoid the undesired generalization on anomalies.

Ranked #1 on Anomaly Detection on UCSD Ped2

Anomaly Detection In Surveillance Videos Defect Detection +1

Paper
Code

Holistic Label Correction for Noisy Multi-Label Classification

no code implementations • ICCV 2023 • Xiaobo Xia, Jiankang Deng, Wei Bao, Yuxuan Du, Bo Han, Shiguang Shan, Tongliang Liu

The issues are, that we do not understand why label dependence is helpful in the problem, and how to learn and utilize label dependence only using training data with noisy multiple labels.

Classification Memorization +1

Paper
Add Code

DISC: Learning From Noisy Labels via Dynamic Instance-Specific Selection and Correction

1 code implementation • CVPR 2023 • YiFan Li, Hu Han, Shiguang Shan, Xilin Chen

Then we propose a dynamic threshold strategy for each instance, based on the momentum of each instance's memorization strength in previous epochs to select and correct noisy labeled data.

Learning with noisy labels Memorization

Paper
Code

DandelionNet: Domain Composition with Instance Adaptive Classification for Domain Generalization

no code implementations • ICCV 2023 • Lanqing Hu, Meina Kan, Shiguang Shan, Xilin Chen

Domain generalization (DG) attempts to learn a model on source domains that can well generalize to unseen but different domains.

Domain Generalization

Paper
Add Code

Source-Free Adaptive Gaze Estimation by Uncertainty Reduction

1 code implementation • CVPR 2023 • Xin Cai, Jiabei Zeng, Shiguang Shan, Xilin Chen

In light of this, we present an unsupervised source-free domain adaptation approach for gaze estimation, which adapts a source-trained gaze estimator to unlabeled target domains without source data.

Gaze Estimation Source-Free Domain Adaptation

Paper
Code

Hierarchical Compositional Representations for Few-shot Action Recognition

no code implementations • 19 Aug 2022 • Changzhen Li, Jie Zhang, Shuzhe Wu, Xin Jin, Shiguang Shan

Recently action recognition has received more and more attention for its comprehensive and practical applications in intelligent surveillance and human-computer interaction.

Few-Shot action recognition Few Shot Action Recognition

Paper
Add Code

Learning Pseudo Labels for Semi-and-Weakly Supervised Semantic Segmentation

1 code implementation • Pattern Recognition 2022 • Yude Wang, Jie Zhang, Meina Kan, Shiguang Shan

Firstly, we introduce a class-aware cross entropy (CCE) loss for network training.

Ranked #9 on Semi-Supervised Semantic Segmentation on PASCAL VOC 2012 50%

Pseudo Label Semi-Supervised Semantic Segmentation +2

Paper
Code

MAFW: A Large-scale, Multi-modal, Compound Affective Database for Dynamic Facial Expression Recognition in the Wild

no code implementations • 1 Aug 2022 • Yuanyuan Liu, Wei Dai, Chuanxu Feng, Wenbin Wang, Guanghao Yin, Jiabei Zeng, Shiguang Shan

To the best of our knowledge, MAFW is the first in-the-wild multi-modal database annotated with compound emotion annotations and emotion-related captions.

Ranked #9 on Dynamic Facial Expression Recognition on MAFW

Dynamic Facial Expression Recognition Facial Expression Recognition +1

Paper
Add Code

UniCon+: ICTCAS-UCAS Submission to the AVA-ActiveSpeaker Task at ActivityNet Challenge 2022

no code implementations • 22 Jun 2022 • Yuanhang Zhang, Susan Liang, Shuang Yang, Shiguang Shan

This report presents a brief description of our winning solution to the AVA Active Speaker Detection (ASD) task at ActivityNet Challenge 2022.

Ranked #2 on Audio-Visual Active Speaker Detection on AVA-ActiveSpeaker

Audio-Visual Active Speaker Detection

Paper
Add Code

Clothes-Changing Person Re-identification with RGB Modality Only

1 code implementation • CVPR 2022 • Xinqian Gu, Hong Chang, Bingpeng Ma, Shutao Bai, Shiguang Shan, Xilin Chen

In this paper, we propose a Clothes-based Adversarial Loss (CAL) to mine clothes-irrelevant features from the original RGB images by penalizing the predictive power of re-id model w. r. t.

Ranked #1 on Multiview Gait Recognition on CASIA-B

Clothes Changing Person Re-Identification Multiview Gait Recognition

115

Paper
Code

Joint Feature Learning and Relation Modeling for Tracking: A One-Stream Framework

1 code implementation • 22 Mar 2022 • Botao Ye, Hong Chang, Bingpeng Ma, Shiguang Shan, Xilin Chen

The current popular two-stream, two-stage tracking framework extracts the template and the search region features separately and then performs relation modeling, thus the extracted features lack the awareness of the target and have limited target-background discriminability.

Ranked #4 on Visual Object Tracking on UAV123

Relation Visual Object Tracking +1

325

Paper
Code

Enhancing Face Recognition With Self-Supervised 3D Reconstruction

no code implementations • CVPR 2022 • Mingjie He, Jie Zhang, Shiguang Shan, Xilin Chen

In this paper, we propose to enhance face recognition with a bypass of self-supervised 3D reconstruction, which enforces the neural backbone to focus on the identity-related depth and albedo information while neglects the identity-irrelevant pose and illumination information.

3D Face Reconstruction 3D Reconstruction +3

Paper
Add Code

Adaptive Image Transformations for Transfer-based Adversarial Attack

2 code implementations • 27 Nov 2021 • Zheng Yuan, Jie Zhang, Shiguang Shan

Adversarial attacks provide a good way to study the robustness of deep learning models.

Adversarial Attack

135

Paper
Code

Adaptive Perturbation for Adversarial Attack

no code implementations • 27 Nov 2021 • Zheng Yuan, Jie Zhang, Zhaoyan Jiang, Liangliang Li, Shiguang Shan

Instead of using the sign function, we propose to directly utilize the exact gradient direction with a scaling factor for generating adversarial perturbations, which improves the attack success rates of adversarial examples even with fewer perturbations.

Adversarial Attack

Paper
Add Code

Learning Fair Face Representation With Progressive Cross Transformer

no code implementations • 11 Aug 2021 • Yong Li, Yufei Sun, Zhen Cui, Shiguang Shan, Jian Yang

To mitigate racial bias and meantime preserve robust FR, we abstract face identity-related representation as a signal denoising problem and propose a progressive cross transformer (PCT) method for fair face recognition.

Denoising Face Recognition

Paper
Add Code

Meta Gradient Adversarial Attack

1 code implementation • ICCV 2021 • Zheng Yuan, Jie Zhang, Yunpei Jia, Chuanqi Tan, Tao Xue, Shiguang Shan

In recent years, research on adversarial attacks has become a hot spot.

Adversarial Attack Meta-Learning

Paper
Code

UniCon: Unified Context Network for Robust Active Speaker Detection

no code implementations • 5 Aug 2021 • Yuanhang Zhang, Susan Liang, Shuang Yang, Xiao Liu, Zhongqin Wu, Shiguang Shan, Xilin Chen

Our solution is a novel, unified framework that focuses on jointly modeling multiple types of contextual information: spatial context to indicate the position and scale of each candidate's face, relational context to capture the visual relationships among the candidates and contrast audio-visual affinities with each other, and temporal context to aggregate long-term information and smooth out local uncertainties.

Ranked #10 on Audio-Visual Active Speaker Detection on AVA-ActiveSpeaker

Audio-Visual Active Speaker Detection

Paper
Add Code

Locality-aware Channel-wise Dropout for Occluded Face Recognition

no code implementations • 20 Jul 2021 • Mingjie He, Jie Zhang, Shiguang Shan, Xiao Liu, Zhongqin Wu, Xilin Chen

Furthermore, by randomly dropping out several feature channels, our method can well simulate the occlusion of larger area.

Face Recognition

Paper
Add Code

Graph Jigsaw Learning for Cartoon Face Recognition

1 code implementation • 14 Jul 2021 • Yong Li, Lingjie Lao, Zhen Cui, Shiguang Shan, Jian Yang

To mitigate this issue, we propose the GraphJigsaw that constructs jigsaw puzzles at various stages in the classification network and solves the puzzles with the graph convolutional network (GCN) in a progressive manner.

Classification Face Recognition

Paper
Code

Gaze Estimation with an Ensemble of Four Architectures

1 code implementation • 5 Jul 2021 • Xin Cai, BoYu Chen, Jiabei Zeng, Jiajun Zhang, Yunjia Sun, Xiao Wang, Zhilong Ji, Xiao Liu, Xilin Chen, Shiguang Shan

This paper presents a method for gaze estimation according to face images.

Gaze Estimation

Paper
Code

MFR 2021: Masked Face Recognition Competition

no code implementations • 29 Jun 2021 • Fadi Boutros, Naser Damer, Jan Niklas Kolf, Kiran Raja, Florian Kirchbuchner, Raghavendra Ramachandra, Arjan Kuijper, Pengcheng Fang, Chao Zhang, Fei Wang, David Montero, Naiara Aginako, Basilio Sierra, Marcos Nieto, Mustafa Ekrem Erakin, Ugur Demir, Hazim Kemal, Ekenel, Asaki Kataoka, Kohei Ichikawa, Shizuma Kubo, Jie Zhang, Mingjie He, Dan Han, Shiguang Shan, Klemen Grm, Vitomir Štruc, Sachith Seneviratne, Nuran Kasthuriarachchi, Sanka Rasnayaka, Pedro C. Neto, Ana F. Sequeira, Joao Ribeiro Pinto, Mohsen Saffari, Jaime S. Cardoso

These teams successfully submitted 18 valid solutions.

Face Recognition Face Verification +1

Paper
Add Code

Feature Completion for Occluded Person Re-Identification

1 code implementation • 24 Jun 2021 • Ruibing Hou, Bingpeng Ma, Hong Chang, Xinqian Gu, Shiguang Shan, Xilin Chen

Our method significantly outperforms existing methods on the occlusion datasets, while remains top even superior performance on holistic datasets.

Person Re-Identification

Paper
Code

ICTCAS-UCAS-TAL Submission to the AVA-ActiveSpeaker Task at ActivityNet Challenge 2021

no code implementations • The ActivityNet Large-Scale Activity Recognition Challenge Workshop, CVPR 2021 • Yuanhang Zhang, Susan Liang, Shuang Yang, Xiao Liu, Zhongqin Wu, Shiguang Shan

This report presents a brief description of our method for the AVA Active Speaker Detection (ASD) task at ActivityNet Challenge 2021.

Ranked #6 on Audio-Visual Active Speaker Detection on AVA-ActiveSpeaker

Audio-Visual Active Speaker Detection

Paper
Add Code

Meta Auxiliary Learning for Facial Action Unit Detection

no code implementations • 14 May 2021 • Yong Li, Shiguang Shan

The learned sample weights alleviate the negative transfer from two aspects: 1) balance the loss of each task automatically, and 2) suppress the weights of FE samples that have large uncertainties.

Action Unit Detection Auxiliary Learning +4

Paper
Add Code

BiCnet-TKS: Learning Efficient Spatial-Temporal Representation for Video Person Re-Identification

1 code implementation • CVPR 2021 • Ruibing Hou, Hong Chang, Bingpeng Ma, Rui Huang, Shiguang Shan

Detail Branch processes frames at original resolution to preserve the detailed visual clues, and Context Branch with a down-sampling strategy is employed to capture long-range contexts.

Video-Based Person Re-Identification

Paper
Code

EigenGAN: Layer-Wise Eigen-Learning for GANs

1 code implementation • ICCV 2021 • Zhenliang He, Meina Kan, Shiguang Shan

Via generative adversarial training to learn a target distribution, these layer-wise subspaces automatically discover a set of "eigen-dimensions" at each layer corresponding to a set of semantic attributes or interpretable variations.

Attribute Face Generation +1

341

Paper
Code

Continuity-Discrimination Convolutional Neural Network for Visual Object Tracking

no code implementations • 18 Apr 2021 • Shen Li, Bingpeng Ma, Hong Chang, Shiguang Shan, Xilin Chen

This paper proposes a novel model, named Continuity-Discrimination Convolutional Neural Network (CD-CNN), for visual object tracking.

Object Visual Object Tracking

Paper
Add Code

Cross-Encoder for Unsupervised Gaze Representation Learning

1 code implementation • ICCV 2021 • Yunjia Sun, Jiabei Zeng, Shiguang Shan, Xilin Chen

To address the issue that the feature of gaze is always intertwined with the appearance of the eye, Cross-Encoder disentangles the features using a latent-code-swapping mechanism on eye-consistent image pairs and gaze-similar ones.

Gaze Estimation Representation Learning

Paper
Code

Attributes Aware Face Generation with Generative Adversarial Networks

1 code implementation • 3 Dec 2020 • Zheng Yuan, Jie Zhang, Shiguang Shan, Xilin Chen

Recent studies have shown remarkable success in face image generations.

Attribute Face Generation

Paper
Code

Learn an Effective Lip Reading Model without Pains

1 code implementation • 15 Nov 2020 • Dalu Feng, Shuang Yang, Shiguang Shan, Xilin Chen

Considering the non-negligible effects of these strategies and the existing tough status to train an effective lip reading model, we perform a comprehensive quantitative study and comparative analysis, for the first time, to show the effects of several different choices for lip reading.

Ranked #1 on Lipreading on CAS-VSR-W1k (LRW-1000) (using extra training data)

Lipreading Lip Reading +2

141

Paper
Code

IAUnet: Global Context-Aware Feature Learning for Person Re-Identification

1 code implementation • 2 Sep 2020 • Ruibing Hou, Bingpeng Ma, Hong Chang, Xinqian Gu, Shiguang Shan, Xilin Chen

Furthermore, a Channel IAU (CIAU) module is designed to model the semantic contextual interactions between channel features to enhance the feature representation, especially for small-scale visual cues and body parts.

Object Categorization Person Re-Identification

Paper
Code

Temporal Complementary Learning for Video Person Re-Identification

2 code implementations • ECCV 2020 • Ruibing Hou, Hong Chang, Bingpeng Ma, Shiguang Shan, Xilin Chen

This paper proposes a Temporal Complementary Learning Network that extracts complementary features of consecutive video frames for video person re-identification.

Video-Based Person Re-Identification

Paper
Code

Sketching Image Gist: Human-Mimetic Hierarchical Scene Graph Generation

1 code implementation • ECCV 2020 • Wenbin Wang, Ruiping Wang, Shiguang Shan, Xilin Chen

Scene graph aims to faithfully reveal humans' perception of image content.

Graph Generation Scene Graph Generation

Paper
Code

Video-based Remote Physiological Measurement via Cross-verified Feature Disentangling

1 code implementation • ECCV 2020 • Xuesong Niu, Zitong Yu, Hu Han, Xiaobai Li, Shiguang Shan, Guoying Zhao

Remote physiological measurements, e. g., remote photoplethysmography (rPPG) based heart rate (HR), heart rate variability (HRV) and respiration frequency (RF) measuring, are playing more and more important roles under the application scenarios where contact measurement is inconvenient or impossible.

Heart Rate Variability

Paper
Code

PA-GAN: Progressive Attention Generative Adversarial Network for Facial Attribute Editing

3 code implementations • 12 Jul 2020 • Zhenliang He, Meina Kan, Jichao Zhang, Shiguang Shan

Facial attribute editing aims to manipulate attributes on the human face, e. g., adding a mustache or changing the hair color.

Attribute Generative Adversarial Network

Paper
Code

Synchronous Bidirectional Learning for Multilingual Lip Reading

1 code implementation • 8 May 2020 • Mingshuang Luo, Shuang Yang, Xilin Chen, Zitao Liu, Shiguang Shan

Based on this idea, we try to explore the synergized learning of multilingual lip reading in this paper, and further propose a synchronous bidirectional learning (SBL) framework for effective synergy of multilingual lip reading.

Lip Reading

Paper
Code

Single-Side Domain Generalization for Face Anti-Spoofing

1 code implementation • CVPR 2020 • Yunpei Jia, Jie Zhang, Shiguang Shan, Xilin Chen

In this work, we propose an end-to-end single-side domain generalization framework (SSDG) to improve the generalization ability of face anti-spoofing.

Domain Generalization Face Anti-Spoofing

214

Paper
Code

Self-supervised Equivariant Attention Mechanism for Weakly Supervised Semantic Segmentation

2 code implementations • CVPR 2020 • Yude Wang, Jie Zhang, Meina Kan, Shiguang Shan, Xilin Chen

Our method is based on the observation that equivariance is an implicit constraint in fully supervised semantic segmentation, whose pixel-level labels take the same spatial transformation as the input images during data augmentation.

Ranked #69 on Weakly-Supervised Semantic Segmentation on PASCAL VOC 2012 val

Data Augmentation Weakly supervised Semantic Segmentation +1

526

Paper
Code

Cross-domain Face Presentation Attack Detection via Multi-domain Disentangled Representation Learning

no code implementations • CVPR 2020 • Guoqing Wang, Hu Han, Shiguang Shan, Xilin Chen

In light of this, we propose an efficient disentangled representation learning for cross-domain face PAD.

Face Presentation Attack Detection Face Recognition +1

Paper
Add Code

Multi-Modal Graph Neural Network for Joint Reasoning on Vision and Scene Text

1 code implementation • CVPR 2020 • Difei Gao, Ke Li, Ruiping Wang, Shiguang Shan, Xilin Chen

Then, we introduce three aggregators which guide the message passing from one graph to another to utilize the contexts in various modalities, so as to refine the features of nodes.

Question Answering Visual Question Answering (VQA)

Paper
Code

The 1st Challenge on Remote Physiological Signal Sensing (RePSS)

no code implementations • 26 Mar 2020 • Xiaobai Li, Hu Han, Hao Lu, Xuesong Niu, Zitong Yu, Antitza Dantcheva, Guoying Zhao, Shiguang Shan

Remote measurement of physiological signals from videos is an emerging topic.

Paper
Add Code

Mutual Information Maximization for Effective Lip Reading

1 code implementation • 13 Mar 2020 • Xing Zhao, Shuang Yang, Shiguang Shan, Xilin Chen

By combining these two advantages together, the proposed method is expected to be both discriminative and robust for effective lip reading.

Ranked #8 on Lipreading on CAS-VSR-W1k (LRW-1000)

Lipreading Lip Reading

Paper
Code

Deformation Flow Based Two-Stream Network for Lip Reading

1 code implementation • 12 Mar 2020 • Jing-Yun Xiao, Shuang Yang, Yuan-Hang Zhang, Shiguang Shan, Xilin Chen

Observing on the continuity in adjacent frames in the speaking process, and the consistency of the motion patterns among different speakers when they pronounce the same phoneme, we model the lip movements in the speaking process as a sequence of apparent deformations in the lip region.

Ranked #6 on Lipreading on CAS-VSR-W1k (LRW-1000)

Knowledge Distillation Lipreading +2

Paper
Code

Pseudo-Convolutional Policy Gradient for Sequence-to-Sequence Lip-Reading

no code implementations • 9 Mar 2020 • Mingshuang Luo, Shuang Yang, Shiguang Shan, Xilin Chen

On the one hand, we introduce the evaluation metric (refers to the character error rate in this paper) as a form of reward to optimize the model together with the original discriminative target.

Ranked #9 on Lipreading on CAS-VSR-W1k (LRW-1000)

Lipreading Lip Reading +1

Paper
Add Code

Can We Read Speech Beyond the Lips? Rethinking RoI Selection for Deep Visual Speech Recognition

1 code implementation • 6 Mar 2020 • Yuan-Hang Zhang, Shuang Yang, Jing-Yun Xiao, Shiguang Shan, Xilin Chen

Recent advances in deep learning have heightened interest among researchers in the field of visual speech recognition (VSR).

Ranked #2 on Lipreading on GRID corpus (mixed-speech)

Lipreading Lip Reading +3

Paper
Code

Emotion Recognition for In-the-wild Videos

no code implementations • 13 Feb 2020 • Hanyu Liu, Jiabei Zeng, Shiguang Shan, Xilin Chen

This paper is a brief introduction to our submission to the seven basic expression classification track of Affective Behavior Analysis in-the-wild Competition held in conjunction with the IEEE International Conference on Automatic Face and Gesture Recognition (FG) 2020.

Emotion Recognition General Classification +1

Paper
Add Code

$M^3$T: Multi-Modal Continuous Valence-Arousal Estimation in the Wild

1 code implementation • 7 Feb 2020 • Yuan-Hang Zhang, Rulin Huang, Jiabei Zeng, Shiguang Shan, Xilin Chen

This report describes a multi-modal multi-task ($M^3$T) approach underlying our submission to the valence-arousal estimation track of the Affective Behavior Analysis in-the-wild (ABAW) Challenge, held in conjunction with the IEEE International Conference on Automatic Face and Gesture Recognition (FG) 2020.

Arousal Estimation Gesture Recognition

Paper
Code

FCSR-GAN: Joint Face Completion and Super-resolution via Multi-task Learning

1 code implementation • 4 Nov 2019 • Jiancheng Cai, Hu Han, Shiguang Shan, Xilin Chen

Combined variations containing low-resolution and occlusion often present in face images in the wild, e. g., under the scenario of video surveillance.

Face Identification Facial Inpainting +3

Paper
Code

Deep Heterogeneous Hashing for Face Video Retrieval

no code implementations • 4 Nov 2019 • Shishi Qiao, Ruiping Wang, Shiguang Shan, Xilin Chen

To tackle the key challenge of hashing on the manifold, a well-studied Riemannian kernel mapping is employed to project data (i. e. covariance matrices) into Euclidean space and thus enables to embed the two heterogeneous representations into a common Hamming space, where both intra-space discriminability and inter-space compatibility are considered.

Retrieval Video Retrieval

Paper
Add Code

RhythmNet: End-to-end Heart Rate Estimation from Face via Spatial-temporal Representation

no code implementations • 25 Oct 2019 • Xuesong Niu, Shiguang Shan, Hu Han, Xilin Chen

Recently, some methods have been proposed for remote HR estimation from face videos; however, most of them focus on well-controlled scenarios, their generalization ability into less-constrained scenarios (e. g., with head movement, and bad illumination) are not known.

Heart rate estimation

Paper
Add Code

Multi-label Co-regularization for Semi-supervised Facial Action Unit Recognition

1 code implementation • NeurIPS 2019 • Xuesong Niu, Hu Han, Shiguang Shan, Xilin Chen

In this work, we propose a semi-supervised approach for AU recognition utilizing a large number of web face images without AU labels and a relatively small face dataset with AU annotations inspired by the co-training methods.

Emotion Recognition Facial Action Unit Detection

Paper
Code

Cross Attention Network for Few-shot Classification

1 code implementation • NeurIPS 2019 • Ruibing Hou, Hong Chang, Bingpeng Ma, Shiguang Shan, Xilin Chen

The unseen classes and low-data problem make few-shot classification very challenging.

Classification General Classification

203

Paper
Code

Cross-modal Scene Graph Matching for Relationship-aware Image-Text Retrieval

1 code implementation • 11 Oct 2019 • Sijin Wang, Ruiping Wang, Ziwei Yao, Shiguang Shan, Xilin Chen

In the light of recent success of scene graph in many CV and NLP tasks for describing complex natural scenes, we propose to represent image and text with two kinds of scene graphs: visual scene graph (VSG) and textual scene graph (TSG), each of which is exploited to jointly characterize objects and relationships in the corresponding modality.

Graph Matching Retrieval +1

Paper
Code

Hierarchical Disentangle Network for Object Representation Learning

no code implementations • 25 Sep 2019 • Shishi Qiao, Ruiping Wang, Shiguang Shan, Xilin Chen

In this paper, we propose the hierarchical disentangle network (HDN) to exploit the rich hierarchical characteristics among categories to divide the disentangling process in a coarse-to-fine manner, such that each level only focuses on learning the specific representations in its granularity and finally the common and unique representations in all granularities jointly constitute the raw object.

Disentanglement Generative Adversarial Network +1

Paper
Add Code

Self-supervised Scale Equivariant Network for Weakly Supervised Semantic Segmentation

1 code implementation • 9 Sep 2019 • Yude Wang, Jie Zhang, Meina Kan, Shiguang Shan, Xilin Chen

This regularized CAM can be embedded in most recent advanced weakly supervised semantic segmentation framework.

Segmentation Weakly-supervised Learning +2

101

Paper
Code

Transferable Contrastive Network for Generalized Zero-Shot Learning

no code implementations • ICCV 2019 • Huajie Jiang, Ruiping Wang, Shiguang Shan, Xilin Chen

Zero-shot learning (ZSL) is a challenging problem that aims to recognize the target categories without seen data, where semantic information is leveraged to transfer knowledge from some source classes.

Ranked #6 on Zero-Shot Learning on SUN Attribute

Generalized Zero-Shot Learning Transfer Learning

Paper
Add Code

Temporal Knowledge Propagation for Image-to-Video Person Re-identification

1 code implementation • ICCV 2019 • Xinqian Gu, Bingpeng Ma, Hong Chang, Shiguang Shan, Xilin Chen

With back propagation, temporal knowledge can be transferred to enhance the image features and the information asymmetry problem can be alleviated.

Ranked #9 on Person Re-Identification on iLIDS-VID

Image-To-Video Person Re-Identification Video-Based Person Re-Identification

Paper
Code

CRIC: A VQA Dataset for Compositional Reasoning on Vision and Commonsense

no code implementations • 8 Aug 2019 • Difei Gao, Ruiping Wang, Shiguang Shan, Xilin Chen

To comprehensively evaluate such abilities, we propose a VQA benchmark, CRIC, which introduces new types of questions about Compositional Reasoning on vIsion and Commonsense, and an evaluation metric integrating the correctness of answering and commonsense grounding.

Question Answering Visual Question Answering (VQA)

Paper
Add Code

Interaction-and-Aggregation Network for Person Re-identification

1 code implementation • CVPR 2019 • Ruibing Hou, Bingpeng Ma, Hong Chang, Xinqian Gu, Shiguang Shan, Xilin Chen

Person re-identification (reID) benefits greatly from deep convolutional neural networks (CNNs) which learn robust feature embeddings.

Person Re-Identification

Paper
Code

VRSTC: Occlusion-Free Video Person Re-Identification

no code implementations • CVPR 2019 • Ruibing Hou, Bingpeng Ma, Hong Chang, Xinqian Gu, Shiguang Shan, Xilin Chen

For one thing, the spatial structure of a pedestrian frame can be used to predict the occluded body parts from the unoccluded body parts of this frame.

Video-Based Person Re-Identification

Paper
Add Code

Cascade RetinaNet: Maintaining Consistency for Single-Stage Object Detection

no code implementations • 16 Jul 2019 • Hongkai Zhang, Hong Chang, Bingpeng Ma, Shiguang Shan, Xilin Chen

Recent researches attempt to improve the detection performance by adopting the idea of cascade for single-stage detectors.

General Classification Object +2

Paper
Add Code

Multi-Task Learning for Audio Visual Active Speaker Detection

no code implementations • The ActivityNet Large-Scale Activity Recognition Challenge Workshop, CVPR 2019 • Yuanhang Zhang, Jingyun Xiao, Shuang Yang, Shiguang Shan

This report describes the approach underlying our submission to the active speaker detection task (task B-2) of ActivityNet Challenge 2019.

Ranked #17 on Audio-Visual Active Speaker Detection on AVA-ActiveSpeaker (using extra training data)

Audio-Visual Active Speaker Detection Lipreading +2

Paper
Add Code

Pose-adaptive Hierarchical Attention Network for Facial Expression Recognition

no code implementations • 24 May 2019 • Yuanyuan Liu, Jiyao Peng, Jiabei Zeng, Shiguang Shan

Multi-view facial expression recognition (FER) is a challenging task because the appearance of an expression varies in poses.

Facial Expression Recognition Facial Expression Recognition (FER)

Paper
Add Code

Weakly Supervised Object Detection with Segmentation Collaboration

no code implementations • ICCV 2019 • Xiaoyan Li, Meina Kan, Shiguang Shan, Xilin Chen

Weakly supervised object detection aims at learning precise object detectors, given image category labels.

General Classification Image Classification +5

Paper
Add Code

Fully Learnable Group Convolution for Acceleration of Deep Neural Networks

no code implementations • CVPR 2019 • Xijun Wang, Meina Kan, Shiguang Shan, Xilin Chen

Benefitted from its great success on many tasks, deep learning is increasingly used on low-computational-cost devices, e. g. smartphone, embedded devices, etc.

Paper
Add Code

WIDER Face and Pedestrian Challenge 2018: Methods and Results

no code implementations • 19 Feb 2019 • Chen Change Loy, Dahua Lin, Wanli Ouyang, Yuanjun Xiong, Shuo Yang, Qingqiu Huang, Dongzhan Zhou, Wei Xia, Quanquan Li, Ping Luo, Junjie Yan, Jian-Feng Wang, Zuoxin Li, Ye Yuan, Boxun Li, Shuai Shao, Gang Yu, Fangyun Wei, Xiang Ming, Dong Chen, Shifeng Zhang, Cheng Chi, Zhen Lei, Stan Z. Li, Hongkai Zhang, Bingpeng Ma, Hong Chang, Shiguang Shan, Xilin Chen, Wu Liu, Boyan Zhou, Huaxiong Li, Peng Cheng, Tao Mei, Artem Kukharenko, Artem Vasenin, Nikolay Sergievskiy, Hua Yang, Liangqi Li, Qiling Xu, Yuan Hong, Lin Chen, Mingjun Sun, Yirong Mao, Shiying Luo, Yongjun Li, Ruiping Wang, Qiaokang Xie, Ziyang Wu, Lei Lu, Yiheng Liu, Wengang Zhou

This paper presents a review of the 2018 WIDER Challenge on Face and Pedestrian.

Face Detection Pedestrian Detection +2

Paper
Add Code

Tattoo Image Search at Scale: Joint Detection and Compact Representation Learning

no code implementations • 1 Nov 2018 • Hu Han, Jie Li, Anil K. Jain, Shiguang Shan, Xilin Chen

To close the gap, we propose an efficient tattoo search approach that is able to learn tattoo detection and compact representation jointly in a single convolutional neural network (CNN) via multi-task learning.

Image Retrieval Multi-Task Learning +3

Paper
Add Code

LRW-1000: A Naturally-Distributed Large-Scale Benchmark for Lip Reading in the Wild

2 code implementations • 16 Oct 2018 • Shuang Yang, Yuan-Hang Zhang, Dalu Feng, Mingmin Yang, Chenhao Wang, Jing-Yun Xiao, Keyu Long, Shiguang Shan, Xilin Chen

It has shown a large variation in this benchmark in several aspects, including the number of samples in each class, video resolution, lighting conditions, and speakers' attributes such as pose, age, gender, and make-up.

Ranked #1 on Lipreading on LRW-1000

Lipreading Lip Reading +2

115

Paper
Code

VIPL-HR: A Multi-modal Database for Pulse Estimation from Less-constrained Face Video

1 code implementation • 11 Oct 2018 • Xuesong Niu, Hu Han, Shiguang Shan, Xilin Chen

We also learn a deep HR estimator (named as RhythmNet) with the proposed spatial-temporal representation, which achieves promising results on both the public-domain and our VIPL-HR HR estimation databases.

Representation Learning

412

Paper
Code

Meta-Learning with Individualized Feature Space for Few-Shot Classification

no code implementations • 27 Sep 2018 • Chunrui Han, Shiguang Shan, Meina Kan, Shuzhe Wu, Xilin Chen

Specifically, we introduce a kernel generator as meta-learner to learn to construct feature embedding for query images.

Classification Meta-Learning +1

Paper
Add Code

Facial Expression Recognition with Inconsistently Annotated Datasets

no code implementations • ECCV 2018 • Jiabei Zeng, Shiguang Shan, Xilin Chen

To address the inconsistency, we propose an Inconsistent Pseudo Annotations to Latent Truth(IPA2LT) framework to train a FER model from multiple inconsistently labeled datasets and large scale unlabeled data.

Facial Expression Recognition Facial Expression Recognition (FER)

Paper
Add Code

Generative Adversarial Network with Spatial Attention for Face Attribute Editing

1 code implementation • ECCV 2018 • Gang Zhang, Meina Kan, Shiguang Shan, Xilin Chen

The generator contains an attribute manipulation network (AMN) to edit the face image, and a spatial attention network (SAN) to localize the attribute-specific region which restricts the alternation of AMN within this region.

Attribute Data Augmentation +2

Paper
Code

Face Recognition with Contrastive Convolution

no code implementations • ECCV 2018 • Chunrui Han, Shiguang Shan, Meina Kan, Shuzhe Wu, Xilin Chen

In current face recognition approaches with convolutional neural network (CNN), a pair of faces to compare are independently fed into the CNN for feature extraction.

Face Recognition Face Verification

Paper
Add Code

Joint Representation and Truncated Inference Learning for Correlation Filter based Tracking

no code implementations • ECCV 2018 • Yingjie Yao, Xiaohe Wu, Lei Zhang, Shiguang Shan, WangMeng Zuo

In existing off-line deep learning models for CF trackers, the model adaptation usually is either abandoned or has closed-form solution to make it feasible to learn deep representation in an end-to-end manner.

Paper
Add Code

Learning Class Prototypes via Structure Alignment for Zero-Shot Recognition

no code implementations • ECCV 2018 • Huajie Jiang, Ruiping Wang, Shiguang Shan, Xilin Chen

Zero-shot learning (ZSL) aims to recognize objects of novel classes without any training samples of specific classes, which is achieved by exploiting the semantic information and auxiliary datasets.

Dictionary Learning Zero-Shot Learning

Paper
Add Code

Structure Inference Net: Object Detection Using Scene-Level Context and Instance-Level Relationships

no code implementations • CVPR 2018 • Yong Liu, Ruiping Wang, Shiguang Shan, Xilin Chen

Context is important for accurate visual recognition.

Object object-detection +1

Paper
Add Code

Duplex Generative Adversarial Network for Unsupervised Domain Adaptation

no code implementations • CVPR 2018 • Lanqing Hu, Meina Kan, Shiguang Shan, Xilin Chen

Following the similar idea of GAN, this work proposes a novel GAN architecture with duplex adversarial discriminators (referred to as DupGAN), which can achieve domain-invariant representation and domain transformation.

Generative Adversarial Network Object Recognition +1

Paper
Add Code

Mean-Variance Loss for Deep Age Estimation From a Face

no code implementations • CVPR 2018 • Hongyu Pan, Hu Han, Shiguang Shan, Xilin Chen

Age estimation has broad application prospects of many fields, such as video surveillance, social networking, and human-computer interaction.

Ranked #3 on Age Estimation on ChaLearn 2016

Age Estimation MORPH

Paper
Add Code

Real-Time Rotation-Invariant Face Detection with Progressive Calibration Networks

1 code implementation • CVPR 2018 • Xuepeng Shi, Shiguang Shan, Meina Kan, Shuzhe Wu, Xilin Chen

Rotation-invariant face detection, i. e. detecting faces with arbitrary rotation-in-plane (RIP) angles, is widely required in unconstrained applications but still remains as a challenging task, due to the large variations of face appearances.

Binary Classification Face Detection

1,074

Paper
Code

Shift-Net: Image Inpainting via Deep Feature Rearrangement

2 code implementations • ECCV 2018 • Zhaoyi Yan, Xiaoming Li, Mu Li, WangMeng Zuo, Shiguang Shan

To this end, the encoder feature of the known region is shifted to serve as an estimation of the missing parts.

Image Inpainting

362

Paper
Code

AttGAN: Facial Attribute Editing by Only Changing What You Want

10 code implementations • 29 Nov 2017 • Zhenliang He, WangMeng Zuo, Meina Kan, Shiguang Shan, Xilin Chen

Based on the encoder-decoder architecture, facial attribute editing is achieved by decoding the latent representation of the given face conditioned on the desired attributes.

Attribute

594

Paper
Code

Recursive Spatial Transformer (ReST) for Alignment-Free Face Recognition

no code implementations • ICCV 2017 • Wanglong Wu, Meina Kan, Xin Liu, Yi Yang, Shiguang Shan, Xilin Chen

The designed ReST has an intrinsic recursive structure and is capable of progressively aligning faces to a canonical one, even those with large variations.

Face Alignment Face Recognition

Paper
Add Code

Learning Discriminative Latent Attributes for Zero-Shot Classification

no code implementations • ICCV 2017 • Huajie Jiang, Ruiping Wang, Shiguang Shan, Yi Yang, Xilin Chen

Zero-shot learning (ZSL) aims to transfer knowledge from observed classes to the unseen classes, based on the assumption that both the seen and unseen classes share a common semantic space, among which attributes enjoy a great popularity.

Attribute Classification +3

Paper
Add Code

Learning Multifunctional Binary Codes for Both Category and Attribute Oriented Retrieval Tasks

no code implementations • CVPR 2017 • Haomiao Liu, Ruiping Wang, Shiguang Shan, Xilin Chen

In this paper we propose a unified framework to address multiple realistic image retrieval tasks concerning both category and attributes.

Attribute Image Retrieval +1

Paper
Add Code

Discriminative Covariance Oriented Representation Learning for Face Recognition With Image Sets

no code implementations • CVPR 2017 • Wen Wang, Ruiping Wang, Shiguang Shan, Xilin Chen

For face recognition with image sets, while most existing works mainly focus on building robust set models with hand-crafted feature, it remains a research gap to learn better image representations which can closely match the subsequent image set modeling and classification.

Face Recognition General Classification +2

Paper
Add Code

Heterogeneous Face Attribute Estimation: A Deep Multi-Task Learning Approach

no code implementations • 3 Jun 2017 • Hu Han, Anil K. Jain, Fang Wang, Shiguang Shan, Xilin Chen

In DMTL, we tackle attribute correlation and heterogeneity with convolutional neural networks (CNNs) consisting of shared feature learning for all the attributes, and category-specific feature learning for heterogeneous attributes.

Ranked #5 on Facial Attribute Classification on LFWA

Attribute Facial Attribute Classification +4

Paper
Add Code

Funnel-Structured Cascade for Multi-View Face Detection with Alignment-Awareness

no code implementations • 23 Sep 2016 • Shuzhe Wu, Meina Kan, Zhenliang He, Shiguang Shan, Xilin Chen

On the other hand, by using a unified MLP cascade to examine proposals of all views in a centralized style, it provides a favorable solution for multi-view face detection with high accuracy and low time-cost.

Face Alignment Face Detection

Paper
Add Code

VIPLFaceNet: An Open Source Deep Face Recognition SDK

no code implementations • 13 Sep 2016 • Xin Liu, Meina Kan, Wanglong Wu, Shiguang Shan, Xilin Chen

Robust face representation is imperative to highly accurate face recognition.

Face Recognition

Paper
Add Code

Geometry-aware Similarity Learning on SPD Manifolds for Visual Recognition

no code implementations • 17 Aug 2016 • Zhiwu Huang, Ruiping Wang, Xianqiu Li, Wenxian Liu, Shiguang Shan, Luc van Gool, Xilin Chen

Specifically, by exploiting the Riemannian geometry of the manifold of fixed-rank Positive Semidefinite (PSD) matrices, we present a new solution to reduce optimizing over the space of column full-rank transformation matrices to optimizing on the PSD manifold which has a well-established Riemannian structure.

Paper
Add Code

Cross Euclidean-to-Riemannian Metric Learning with Application to Face Recognition from Video

no code implementations • 15 Aug 2016 • Zhiwu Huang, Ruiping Wang, Shiguang Shan, Luc van Gool, Xilin Chen

With this mapping, the problem of learning a cross-view metric between the two source heterogeneous spaces can be expressed as learning a single-view Euclidean distance metric in the target common Euclidean space.

Face Recognition Metric Learning

Paper
Add Code

Self-paced Learning for Weakly Supervised Evidence Discovery in Multimedia Event Search

no code implementations • 12 Aug 2016 • Mengyi Liu, Lu Jiang, Shiguang Shan, Alexander G. Hauptmann

Multimedia event detection has been receiving increasing attention in recent years.

Event Detection

Paper
Add Code

Dual Purpose Hashing

no code implementations • 19 Jul 2016 • Haomiao Liu, Ruiping Wang, Shiguang Shan, Xilin Chen

Recent years have seen more and more demand for a unified framework to address multiple realistic image retrieval tasks concerning both category and attributes.

Attribute Image Retrieval +1

Paper
Add Code

Deep Supervised Hashing for Fast Image Retrieval

1 code implementation • CVPR 2016 • Haomiao Liu, Ruiping Wang, Shiguang Shan, Xilin Chen

In this paper, we present a new hashing method to learn compact binary codes for highly efficient image retrieval on large-scale datasets.

Ranked #1 on Image Retrieval on CIFAR-10

Image Retrieval Retrieval

Paper
Code

Multi-View Deep Network for Cross-View Classification

no code implementations • CVPR 2016 • Meina Kan, Shiguang Shan, Xilin Chen

As a result, the representation from the topmost layers of the MvDN network is robust to view discrepancy, and also discriminative.

Classification Face Recognition +1

Paper
Add Code

Occlusion-Free Face Alignment: Deep Regression Networks Coupled With De-Corrupt AutoEncoders

no code implementations • CVPR 2016 • Jie Zhang, Meina Kan, Shiguang Shan, Xilin Chen

Face alignment or facial landmark detection plays an important role in many computer vision applications, e. g., face recognition, facial expression recognition, face animation, etc.

Face Alignment Face Recognition +4

Paper
Add Code

Bi-Shifting Auto-Encoder for Unsupervised Domain Adaptation

no code implementations • ICCV 2015 • Meina Kan, Shiguang Shan, Xilin Chen

To alleviate the discrepancy between source and target domains, we propose a domain adaptation method, named as Bi-shifting Auto-Encoder network (BAE).

Face Recognition Unsupervised Domain Adaptation

Paper
Add Code

Leveraging Datasets With Varying Annotations for Face Alignment via Deep Regression Network

no code implementations • ICCV 2015 • Jie Zhang, Meina Kan, Shiguang Shan, Xilin Chen

Facial landmark detection, as a vital topic in computer vision, has been studied for many decades and lots of datasets have been collected for evaluation.

Face Alignment Facial Landmark Detection +1

Paper
Add Code

Two Birds, One Stone: Jointly Learning Binary Code for Large-Scale Face Image Retrieval and Attributes Prediction

no code implementations • ICCV 2015 • Yan Li, Ruiping Wang, Haomiao Liu, Huajie Jiang, Shiguang Shan, Xilin Chen

In this way, the learned binary codes can be applied to not only fine-grained face image retrieval, but also facial attributes prediction, which is the very innovation of this work, just like killing two birds with one stone.

Face Image Retrieval Retrieval

Paper
Add Code

A Unified Multiplicative Framework for Attribute Learning

no code implementations • ICCV 2015 • Kongming Liang, Hong Chang, Shiguang Shan, Xilin Chen

Attributes are mid-level semantic properties of objects.

Attribute Zero-Shot Learning

Paper
Add Code

Learning Expressionlets via Universal Manifold Model for Dynamic Facial Expression Recognition

no code implementations • 16 Nov 2015 • Mengyi Liu, Shiguang Shan, Ruiping Wang, Xilin Chen

3) the local modes on each STM can be instantiated by fitting to UMM, and the corresponding expressionlet is constructed by modeling the variations in each local mode.

Dynamic Facial Expression Recognition Facial Expression Recognition +1

Paper
Add Code

Learning Mid-level Words on Riemannian Manifold for Action Recognition

no code implementations • 16 Nov 2015 • Mengyi Liu, Ruiping Wang, Shiguang Shan, Xilin Chen

Human action recognition remains a challenging task due to the various sources of video data and large intra-class variations.

Action Recognition Clustering +1

Paper
Add Code

AgeNet: Deeply Learned Regressor and Classifier for Robust Apparent Age Estimation

no code implementations • ICCV Workshop 2015 • Xin Liu, Shaoxin Li, Meina Kan, Jie Zhang, Shuzhe Wu, Wenxian Liu, Hu Han, Shiguang Shan, Xilin Chen

Another key feature of the proposed AgeNet is that, to avoid the problem of over-fitting on small apparent age training set, we exploit a general-to-specific transfer learning scheme.

Ranked #4 on Age Estimation on ChaLearn 2015

Age Estimation Transfer Learning

Paper
Add Code

Cross-pose Face Recognition by Canonical Correlation Analysis

no code implementations • 29 Jul 2015 • Annan Li, Shiguang Shan, Xilin Chen, Bingpeng Ma, Shuicheng Yan, Wen Gao

We argue that one of the diffculties in this problem is the severe misalignment in face images or feature vectors with different poses.

Face Recognition

Paper
Add Code

Shape Driven Kernel Adaptation in Convolutional Neural Network for Robust Facial Traits Recognition

no code implementations • CVPR 2015 • Shaoxin Li, Junliang Xing, Zhiheng Niu, Shiguang Shan, Shuicheng Yan

Comprehensive experiments on WebFace, Morph II and MultiPIE databases well validate the effectiveness of the proposed kernel adaptation method and tree-structured convolutional architecture for facial traits recognition tasks, including identity, age and gender classification.

Age And Gender Classification Gender Classification +1

Paper
Add Code

Projection Metric Learning on Grassmann Manifold With Application to Video Based Face Recognition

no code implementations • CVPR 2015 • Zhiwu Huang, Ruiping Wang, Shiguang Shan, Xilin Chen

In video based face recognition, great success has been made by representing videos as linear subspaces, which typically lie in a special type of non-Euclidean space known as Grassmann manifold.

Dimensionality Reduction Face Recognition +1

Paper
Add Code

Discriminant Analysis on Riemannian Manifold of Gaussian Distributions for Face Recognition With Image Sets

no code implementations • CVPR 2015 • Wen Wang, Ruiping Wang, Zhiwu Huang, Shiguang Shan, Xilin Chen

This paper presents a method named Discriminant Analysis on Riemannian manifold of Gaussian distributions (DARG) to solve the problem of face recognition with image sets.

Face Identification Face Recognition +1

Paper
Add Code

Face Video Retrieval With Image Query via Hashing Across Euclidean Space and Riemannian Manifold

no code implementations • CVPR 2015 • Yan Li, Ruiping Wang, Zhiwu Huang, Shiguang Shan, Xilin Chen

Retrieving videos of a specific person given his/her face image as query becomes more and more appealing for applications like smart movie fast-forwards and suspect searching.

Retrieval Video Retrieval

Paper
Add Code

Self-Paced Learning with Diversity

no code implementations • NeurIPS 2014 • Lu Jiang, Deyu Meng, Shoou-I Yu, Zhenzhong Lan, Shiguang Shan, Alexander Hauptmann

Self-paced learning (SPL) is a recently proposed learning regime inspired by the learning process of humans and animals that gradually incorporates easy to more complex samples into training.

Paper
Add Code

Generalized Unsupervised Manifold Alignment

no code implementations • NeurIPS 2014 • Zhen Cui, Hong Chang, Shiguang Shan, Xilin Chen

In this paper, we propose a generalized Unsupervised Manifold Alignment (GUMA) method to build the connections between different but correlated datasets without any known correspondences.

Paper
Add Code

Stacked Progressive Auto-Encoders (SPAE) for Face Recognition Across Poses

no code implementations • CVPR 2014 • Meina Kan, Shiguang Shan, Hong Chang, Xilin Chen

Identifying subjects with variations caused by poses is one of the most challenging tasks in face recognition, since the difference in appearances caused by poses may be even larger than the difference due to identity.

Face Recognition Pose Estimation

Paper
Add Code

Adaptive Partial Differential Equation Learning for Visual Saliency Detection

no code implementations • CVPR 2014 • Risheng Liu, Junjie Cao, Zhouchen Lin, Shiguang Shan

Then by optimizing a discrete submodular function constrained with this LESD and a uniform matroid, the saliency seeds (i. e., boundary conditions) can be learnt for this image, thus achieving an optimal PDE system to model the evolution of visual saliency.

Saliency Detection

Paper
Add Code

Learning Expressionlets on Spatio-Temporal Manifold for Dynamic Facial Expression Recognition

no code implementations • CVPR 2014 • Mengyi Liu, Shiguang Shan, Ruiping Wang, Xilin Chen

In this paper, we attempt to solve both problems via manifold modeling of videos based on a novel mid-level representation, i. e. expressionlet.

Dynamic Facial Expression Recognition Facial Expression Recognition +1

Paper
Add Code

Learning Euclidean-to-Riemannian Metric for Point-to-Set Classification

no code implementations • CVPR 2014 • Zhiwu Huang, Ruiping Wang, Shiguang Shan, Xilin Chen

Since the points commonly lie in Euclidean space while the sets are typically modeled as elements on Riemannian manifold, they can be treated as Euclidean points and Riemannian points respectively.

Classification General Classification +1

Paper
Add Code

Deeply Coupled Auto-encoder Networks for Cross-view Classification

no code implementations • 10 Feb 2014 • Wen Wang, Zhen Cui, Hong Chang, Shiguang Shan, Xilin Chen

In this paper, we propose a simple but effective coupled neural network, called Deeply Coupled Autoencoder Networks (DCAN), which seeks to build two deep neural networks, coupled with each other in every corresponding layers.

Classification Denoising +2

Paper
Add Code

Fusing Robust Face Region Descriptors via Multiple Metric Learning for Face Recognition in the Wild

no code implementations • CVPR 2013 • Zhen Cui, Wen Li, Dong Xu, Shiguang Shan, Xilin Chen

Spatial-Temporal Face Region Descriptor, STFRD) for images (resp.

Face Recognition Face Verification +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.