Understanding Pixel-level 2D Image Semantics with 3D Keypoint Knowledge Engine

no code implementations21 Nov 2021 Yang You, Chengkun Li, Yujing Lou, Zhoujun Cheng, Liangwei Li, Lizhuang Ma, Weiming Wang, Cewu Lu

Pixel-level 2D object semantic understanding is an important topic in computer vision and could help machine deeply understand objects (e. g. functionality and affordance) in our daily life.

Domain Adaptive Semantic Segmentation with Regional Contrastive Consistency Regularization

no code implementations11 Oct 2021 Qianyu Zhou, Chuyun Zhuang, Xuequan Lu, Lizhuang Ma

Motivated by the above facts, we propose a novel and fully end-to-end trainable approach, called regional contrastive consistency regularization (RCCR) for domain adaptive semantic segmentation.

Spatiotemporal Inconsistency Learning for DeepFake Video Detection

no code implementations4 Sep 2021 Zhihao Gu, Yang Chen, Taiping Yao, Shouhong Ding, Jilin Li, Feiyue Huang, Lizhuang Ma

To address this issue, we term this task as a Spatial-Temporal Inconsistency Learning (STIL) process and instantiate it into a novel STIL block, which consists of a Spatial Inconsistency Module (SIM), a Temporal Inconsistency Module (TIM), and an Information Supplement Module (ISM).

PIT: Position-Invariant Transform for Cross-FoV Domain Adaptation

1 code implementation ICCV 2021 Qiqi Gu, Qianyu Zhou, Minghao Xu, Zhengyang Feng, Guangliang Cheng, Xuequan Lu, Jianping Shi, Lizhuang Ma

Extensive experiments demonstrate that our method can soundly boost the performance on both cross-domain object detection and segmentation for state-of-the-art techniques.

Semi-supervised 3D Object Detection via Adaptive Pseudo-Labeling

no code implementations15 Aug 2021 Hongyi Xu, Fengqi Liu, Qianyu Zhou, Jinkun Hao, Zhijie Cao, Zhengyang Feng, Lizhuang Ma

Inspired by this, we propose a novel semi-supervised framework based on pseudo-labeling for outdoor 3D object detection tasks.

Adaptive Normalized Representation Learning for Generalizable Face Anti-Spoofing

no code implementations5 Aug 2021 Shubao Liu, Ke-Yue Zhang, Taiping Yao, Mingwei Bi, Shouhong Ding, Jilin Li, Feiyue Huang, Lizhuang Ma

However, little attention has been paid to the feature extraction process for the FAS task, especially the influence of normalization, which also has a great impact on the generalization of the learned representation.

Dual Reweighting Domain Generalization for Face Presentation Attack Detection

no code implementations30 Jun 2021 Shubao Liu, Ke-Yue Zhang, Taiping Yao, Kekai Sheng, Shouhong Ding, Ying Tai, Jilin Li, Yuan Xie, Lizhuang Ma

Face anti-spoofing approaches based on domain generalization (DG) have drawn growing attention due to their robustness for unseen scenarios.

Novelty Detection via Contrastive Learning with Negative Data Augmentation

no code implementations18 Jun 2021 Chengwei Chen, Yuan Xie, Shaohui Lin, Ruizhi Qiao, Jian Zhou, Xin Tan, Yi Zhang, Lizhuang Ma

Moreover, our model is more stable for training in a non-adversarial manner, compared to other adversarial based novelty detection methods.

End-to-End Video Object Detection with Spatial-Temporal Transformers

no code implementations23 May 2021 Lu He, Qianyu Zhou, Xiangtai Li, Li Niu, Guangliang Cheng, Xiao Li, Wenxuan Liu, Yunhai Tong, Lizhuang Ma, Liqing Zhang

Recently, DETR and Deformable DETR have been proposed to eliminate the need for many hand-designed components in object detection while demonstrating good performance as previous complex hand-crafted detectors.

Contrastive Learning for Compact Single Image Dehazing

2 code implementations CVPR 2021 Haiyan Wu, Yanyun Qu, Shaohui Lin, Jian Zhou, Ruizhi Qiao, Zhizhong Zhang, Yuan Xie, Lizhuang Ma

In this paper, we propose a novel contrastive regularization (CR) built upon contrastive learning to exploit both the information of hazy images and clear images as negative and positive samples, respectively.

Farewell to Mutual Information: Variational Distillation for Cross-Modal Person Re-Identification

1 code implementation CVPR 2021 Xudong Tian, Zhizhong Zhang, Shaohui Lin, Yanyun Qu, Yuan Xie, Lizhuang Ma

The Information Bottleneck (IB) provides an information theoretic principle for representation learning, by retaining all information relevant for predicting label while minimizing the redundancy.

"Forget" the Forget Gate: Estimating Anomalies in Videos using Self-contained Long Short-Term Memory Networks

no code implementations3 Apr 2021 Habtamu Fanta, Zhiwen Shao, Lizhuang Ma

Abnormal event detection is a challenging task that requires effectively handling intricate features of appearance and motion.

PRIN/SPRIN: On Extracting Point-wise Rotation Invariant Features

2 code implementations24 Feb 2021 Yang You, Yujing Lou, Ruoxi Shi, Qi Liu, Yu-Wing Tai, Lizhuang Ma, Weiming Wang, Cewu Lu

Spherical Voxel Convolution and Point Re-sampling are proposed to extract rotation invariant features for each point.

Boundary-Aware Geometric Encoding for Semantic Segmentation of Point Clouds

no code implementations7 Jan 2021 Jingyu Gong, Jiachen Xu, Xin Tan, Jie zhou, Yanyun Qu, Yuan Xie, Lizhuang Ma

Boundary information plays a significant role in 2D image segmentation, while usually being ignored in 3D point cloud segmentation where ambiguous features might be generated in feature extraction, leading to misclassification in the transition area between two objects.

Weakly-Supervised Saliency Detection via Salient Object Subitizing

no code implementations4 Jan 2021 Xiaoyang Zheng, Xin Tan, Jie zhou, Lizhuang Ma, Rynson W. H. Lau

This allows the supervision to be aligned with the property of saliency detection, where the salient objects of an image could be from more than one class.

Canonical Voting: Towards Robust Oriented Bounding Box Detection in 3D Scenes

1 code implementation24 Nov 2020 Yang You, Zelin Ye, Yujing Lou, Chengkun Li, Yong-Lu Li, Lizhuang Ma, Weiming Wang, Cewu Lu

In the work, we disentangle the direct offset into Local Canonical Coordinates (LCC), box scales and box orientations.

Brain Tumor Anomaly Detection via Latent Regularized Adversarial Network

no code implementations9 Jul 2020 Nan Wang, Chengwei Chen, Yuan Xie, Lizhuang Ma

The brain structure in the collected data is complicated, thence, doctors are required to spend plentiful energy when diagnosing brain abnormalities.

Spoof Face Detection Via Semi-Supervised Adversarial Training

no code implementations22 May 2020 Chengwei Chen, Wang Yuan, Xuequan Lu, Lizhuang Ma

To capture the underlying structure of live faces data in latent representation space, we propose to train the live face data only, with a convolutional Encoder-Decoder network acting as a Generator.

Fine-Grained Expression Manipulation via Structured Latent Space

no code implementations21 Apr 2020 Junshu Tang, Zhiwen Shao, Lizhuang Ma

Most existing expression manipulation methods resort to discrete expression labels, which mainly edit global expressions and ignore the manipulation of fine details.

Semantic Correspondence via 2D-3D-2D Cycle

1 code implementation20 Apr 2020 Yang You, Chengkun Li, Yujing Lou, Zhoujun Cheng, Lizhuang Ma, Cewu Lu, Weiming Wang

Visual semantic correspondence is an important topic in computer vision and could help machine understand objects in our daily life.

SiTGRU: Single-Tunnelled Gated Recurrent Unit for Abnormality Detection

no code implementations30 Mar 2020 Habtamu Fanta, Zhiwen Shao, Lizhuang Ma

In this paper, we propose a novel version of Gated Recurrent Unit (GRU), called Single Tunnelled GRU for abnormality detection.

J$\hat{\text{A}}$A-Net: Joint Facial Action Unit Detection and Face Alignment via Adaptive Attention

1 code implementation18 Mar 2020 Zhiwen Shao, Zhilei Liu, Jianfei Cai, Lizhuang Ma

Moreover, to extract precise local features, we propose an adaptive attention learning module to refine the attention map of each AU adaptively.

Action Unit Detection Face Alignment +1

Night-time Scene Parsing with a Large Real Dataset

no code implementations15 Mar 2020 Xin Tan, Ke Xu, Ying Cao, Yiheng Zhang, Lizhuang Ma, Rynson W. H. Lau

Although huge progress has been made on scene analysis in recent years, most existing works assume the input images to be in day-time with good lighting conditions.

Anomaly Detection by One Class Latent Regularized Networks

no code implementations5 Feb 2020 Chengwei Chen, Pan Chen, Haichuan Song, Yiqing Tao, Yuan Xie, Shouhong Ding, Lizhuang Ma

Anomaly detection is a fundamental problem in computer vision area with many real-world applications.

Novelty Detection via Non-Adversarial Generative Network

no code implementations3 Feb 2020 Chengwei Chen, Wang Yuan, Yuan Xie, Yanyun Qu, Yiqing Tao, Haichuan Song, Lizhuang Ma

One-class novelty detection is the process of determining if a query example differs from the training examples (the target class).

SceneEncoder: Scene-Aware Semantic Segmentation of Point Clouds with A Learnable Scene Descriptor

1 code implementation24 Jan 2020 Jiachen Xu, Jingyu Gong, Jie zhou, Xin Tan, Yuan Xie, Lizhuang Ma

Besides local features, global information plays an essential role in semantic segmentation, while recent works usually fail to explicitly extract the meaningful global information and make full use of it.

Spatio-Temporal Relation and Attention Learning for Facial Action Unit Detection

no code implementations5 Jan 2020 Zhiwen Shao, Lixin Zou, Jianfei Cai, Yunsheng Wu, Lizhuang Ma

Specifically, we introduce a spatio-temporal graph convolutional network to capture both spatial and temporal relations from dynamic AUs, in which the AU relations are formulated as a spatio-temporal graph with adaptively learned instead of predefined edge weights.

Explicit Facial Expression Transfer via Fine-Grained Representations

no code implementations6 Sep 2019 Zhiwen Shao, Hengliang Zhu, Junshu Tang, Xuequan Lu, Lizhuang Ma

Instead of using an intermediate estimated guidance, we propose to explicitly transfer facial expression by directly mapping two unpaired input images to two synthesized images with swapped expressions.

FVNet: 3D Front-View Proposal Generation for Real-Time Object Detection from Point Clouds

no code implementations26 Mar 2019 Jie Zhou, Xin Tan, Zhiwei Shao, Lizhuang Ma

We then introduce a proposal generation network to predict 3D region proposals from the generated maps and further extrude objects of interest from the whole point cloud.

Unconstrained Facial Action Unit Detection via Latent Feature Domain

1 code implementation25 Mar 2019 Zhiwen Shao, Jianfei Cai, Tat-Jen Cham, Xuequan Lu, Lizhuang Ma

Due to the combination of source AU-related information and target AU-free information, the latent feature domain with transferred source label can be learned by maximizing the target-domain AU detection performance.

Efficient Super Resolution Using Binarized Neural Network

no code implementations16 Dec 2018 Yinglan Ma, Hongyu Xiong, Zhe Hu, Lizhuang Ma

As a way to significantly reduce model size and computation time, binarized neural network has only been shown to excel on semantic-level tasks such as image classification and recognition.

Pointwise Rotation-Invariant Network with Adaptive Sampling and 3D Spherical Voxel Convolution

1 code implementation23 Nov 2018 Yang You, Yujing Lou, Qi Liu, Yu-Wing Tai, Lizhuang Ma, Cewu Lu, Weiming Wang

Point cloud analysis without pose priors is very challenging in real applications, as the orientations of point clouds are often unknown.

Facial Action Unit Detection Using Attention and Relation Learning

no code implementations10 Aug 2018 Zhiwen Shao, Zhilei Liu, Jianfei Cai, Yunsheng Wu, Lizhuang Ma

By finding the region of interest of each AU with the attention mechanism, AU-related local features can be captured.

Deep Multi-Center Learning for Face Alignment

1 code implementation5 Aug 2018 Zhiwen Shao, Hengliang Zhu, Xin Tan, Yangyang Hao, Lizhuang Ma

Most of the existing deep learning methods only use one fully-connected layer called shape prediction layer to estimate the locations of facial landmarks.

DRPose3D: Depth Ranking in 3D Human Pose Estimation

no code implementations23 May 2018 Min Wang, Xipeng Chen, Wentao Liu, Chen Qian, Liang Lin, Lizhuang Ma

In this paper, we propose a two-stage depth ranking based method (DRPose3D) to tackle the problem of 3D human pose estimation.

Mask-aware Photorealistic Face Attribute Manipulation

no code implementations24 Apr 2018 Ruoqi Sun, Chen Huang, Jianping Shi, Lizhuang Ma

The task of face attribute manipulation has found increasing applications, but still remains challeng- ing with the requirement of editing the attributes of a face image while preserving its unique details.

Deep Adaptive Attention for Joint Facial Action Unit Detection and Face Alignment

1 code implementation ECCV 2018 Zhiwen Shao, Zhilei Liu, Jianfei Cai, Lizhuang Ma

Facial action unit (AU) detection and face alignment are two highly correlated tasks since facial landmarks can provide precise AU locations to facilitate the extraction of meaningful local features for AU detection.

Multi-Scale Video Frame-Synthesis Network with Transitive Consistency Loss

no code implementations7 Dec 2017 Zhe Hu, Yinglan Ma, Lizhuang Ma

Traditional approaches to interpolate/extrapolate frames in a video sequence require accurate pixel correspondences between images, e. g., using optical flow.

Learning deep representation from coarse to fine for face alignment

no code implementations31 Jul 2016 Zhiwen Shao, Shouhong Ding, Yiru Zhao, Qinchuan Zhang, Lizhuang Ma

In this paper, we propose a novel face alignment method that trains deep convolutional network from coarse to fine.

