Search Results for author: Lizhuang Ma

Found 99 papers, 49 papers with code

Learning deep representation from coarse to fine for face alignment

no code implementations • 31 Jul 2016 • Zhiwen Shao, Shouhong Ding, Yiru Zhao, Qinchuan Zhang, Lizhuang Ma

In this paper, we propose a novel face alignment method that trains deep convolutional network from coarse to fine.

Paper
Add Code

Multi-Scale Video Frame-Synthesis Network with Transitive Consistency Loss

no code implementations • 7 Dec 2017 • Zhe Hu, Yinglan Ma, Lizhuang Ma

Traditional approaches to interpolate/extrapolate frames in a video sequence require accurate pixel correspondences between images, e. g., using optical flow.

Optical Flow Estimation

Paper
Add Code

Deep Adaptive Attention for Joint Facial Action Unit Detection and Face Alignment

1 code implementation • ECCV 2018 • Zhiwen Shao, Zhilei Liu, Jianfei Cai, Lizhuang Ma

Facial action unit (AU) detection and face alignment are two highly correlated tasks since facial landmarks can provide precise AU locations to facilitate the extraction of meaningful local features for AU detection.

Ranked #5 on Facial Action Unit Detection on DISFA

Action Unit Detection Face Alignment +1

Paper
Code

Mask-aware Photorealistic Face Attribute Manipulation

no code implementations • 24 Apr 2018 • Ruoqi Sun, Chen Huang, Jianping Shi, Lizhuang Ma

The task of face attribute manipulation has found increasing applications, but still remains challeng- ing with the requirement of editing the attributes of a face image while preserving its unique details.

Attribute Face Recognition +1

Paper
Add Code

DRPose3D: Depth Ranking in 3D Human Pose Estimation

no code implementations • 23 May 2018 • Min Wang, Xipeng Chen, Wentao Liu, Chen Qian, Liang Lin, Lizhuang Ma

In this paper, we propose a two-stage depth ranking based method (DRPose3D) to tackle the problem of 3D human pose estimation.

3D Human Pose Estimation 3D Pose Estimation

Paper
Add Code

Deep attention-guided fusion network for lesion segmentation

no code implementations • 23 Jul 2018 • Hengliang Zhu, Yangyang Hao, Lizhuang Ma, Ruixing Li, Hua Wang

We participated the Task 1: Lesion Segmentation.

Deep Attention Lesion Segmentation +1

Paper
Add Code

Deep Multi-Center Learning for Face Alignment

1 code implementation • 5 Aug 2018 • Zhiwen Shao, Hengliang Zhu, Xin Tan, Yangyang Hao, Lizhuang Ma

Most of the existing deep learning methods only use one fully-connected layer called shape prediction layer to estimate the locations of facial landmarks.

Ranked #3 on Face Alignment on AFLW2000

Face Alignment

Paper
Code

Facial Action Unit Detection Using Attention and Relation Learning

no code implementations • 10 Aug 2018 • Zhiwen Shao, Zhilei Liu, Jianfei Cai, Yunsheng Wu, Lizhuang Ma

By finding the region of interest of each AU with the attention mechanism, AU-related local features can be captured.

Action Unit Detection Facial Action Unit Detection +1

Paper
Add Code

Pointwise Rotation-Invariant Network with Adaptive Sampling and 3D Spherical Voxel Convolution

1 code implementation • 23 Nov 2018 • Yang You, Yujing Lou, Qi Liu, Yu-Wing Tai, Lizhuang Ma, Cewu Lu, Weiming Wang

Point cloud analysis without pose priors is very challenging in real applications, as the orientations of point clouds are often unknown.

3D Feature Matching Data Augmentation

Paper
Code

Efficient Super Resolution Using Binarized Neural Network

no code implementations • 16 Dec 2018 • Yinglan Ma, Hongyu Xiong, Zhe Hu, Lizhuang Ma

As a way to significantly reduce model size and computation time, binarized neural network has only been shown to excel on semantic-level tasks such as image classification and recognition.

Binarization Image Classification +3

Paper
Add Code

Unconstrained Facial Action Unit Detection via Latent Feature Domain

1 code implementation • 25 Mar 2019 • Zhiwen Shao, Jianfei Cai, Tat-Jen Cham, Xuequan Lu, Lizhuang Ma

Due to the combination of source AU-related information and target AU-free information, the latent feature domain with transferred source label can be learned by maximizing the target-domain AU detection performance.

Action Unit Detection Domain Adaptation +2

Paper
Code

FVNet: 3D Front-View Proposal Generation for Real-Time Object Detection from Point Clouds

no code implementations • 26 Mar 2019 • Jie Zhou, Xin Tan, Zhiwei Shao, Lizhuang Ma

We then introduce a proposal generation network to predict 3D region proposals from the generated maps and further extrude objects of interest from the whole point cloud.

3D Object Detection Object +2

Paper
Add Code

Explicit Facial Expression Transfer via Fine-Grained Representations

no code implementations • 6 Sep 2019 • Zhiwen Shao, Hengliang Zhu, Junshu Tang, Xuequan Lu, Lizhuang Ma

Instead of using an intermediate estimated guidance, we propose to explicitly transfer facial expression by directly mapping two unpaired input images to two synthesized images with swapped expressions.

Multi-class Classification

Paper
Add Code

Human Correspondence Consensus for 3D Object Semantic Understanding

1 code implementation • ECCV 2020 • Yujing Lou, Yang You, Chengkun Li, Zhoujun Cheng, Liangwei Li, Lizhuang Ma, Weiming Wang, Cewu Lu

Semantic understanding of 3D objects is crucial in many applications such as object manipulation.

3D Feature Matching 3D Point Cloud Matching +1

Paper
Code

SceneEncoder: Scene-Aware Semantic Segmentation of Point Clouds with A Learnable Scene Descriptor

1 code implementation • 24 Jan 2020 • Jiachen Xu, Jingyu Gong, Jie zhou, Xin Tan, Yuan Xie, Lizhuang Ma

Besides local features, global information plays an essential role in semantic segmentation, while recent works usually fail to explicitly extract the meaningful global information and make full use of it.

Segmentation Semantic Segmentation

Paper
Code

Novelty Detection via Non-Adversarial Generative Network

no code implementations • 3 Feb 2020 • Chengwei Chen, Wang Yuan, Yuan Xie, Yanyun Qu, Yiqing Tao, Haichuan Song, Lizhuang Ma

One-class novelty detection is the process of determining if a query example differs from the training examples (the target class).

Image Reconstruction Novelty Detection

Paper
Add Code

Acoustic anomaly detection via latent regularized gaussian mixture generative adversarial networks

no code implementations • 4 Feb 2020 • Chengwei Chen, Pan Chen, Lingyu Yang, Jinyuan Mo, Haichuan Song, Yuan Xie, Lizhuang Ma

Acoustic anomaly detection aims at distinguishing abnormal acoustic signals from the normal ones.

Anomaly Detection Generative Adversarial Network

Paper
Add Code

Anomaly Detection by One Class Latent Regularized Networks

no code implementations • 5 Feb 2020 • Chengwei Chen, Pan Chen, Haichuan Song, Yiqing Tao, Yuan Xie, Shouhong Ding, Lizhuang Ma

Anomaly detection is a fundamental problem in computer vision area with many real-world applications.

Anomaly Detection

Paper
Add Code

KeypointNet: A Large-scale 3D Keypoint Dataset Aggregated from Numerous Human Annotations

1 code implementation • CVPR 2020 • Yang You, Yujing Lou, Chengkun Li, Zhoujun Cheng, Liangwei Li, Lizhuang Ma, Weiming Wang, Cewu Lu

Detecting 3D objects keypoints is of great interest to the areas of both graphics and computer vision.

144

Paper
Code

Night-time Scene Parsing with a Large Real Dataset

no code implementations • 15 Mar 2020 • Xin Tan, Ke Xu, Ying Cao, Yiheng Zhang, Lizhuang Ma, Rynson W. H. Lau

Although huge progress has been made on scene analysis in recent years, most existing works assume the input images to be in day-time with good lighting conditions.

Scene Parsing Semantic Segmentation

Paper
Add Code

J$\hat{\text{A}}$A-Net: Joint Facial Action Unit Detection and Face Alignment via Adaptive Attention

1 code implementation • 18 Mar 2020 • Zhiwen Shao, Zhilei Liu, Jianfei Cai, Lizhuang Ma

Moreover, to extract precise local features, we propose an adaptive attention learning module to refine the attention map of each AU adaptively.

Action Unit Detection Face Alignment +1

Paper
Code

Monocular Human Pose and Shape Reconstruction using Part Differentiable Rendering

no code implementations • 24 Mar 2020 • Min Wang, Feng Qiu, Wentao Liu, Chen Qian, Xiaowei Zhou, Lizhuang Ma

In this paper, we introduce body part segmentation as critical supervision.

Ranked #88 on 3D Human Pose Estimation on Human3.6M (PA-MPJPE metric)

3D Human Pose Estimation 3D Pose Estimation +3

Paper
Add Code

SiTGRU: Single-Tunnelled Gated Recurrent Unit for Abnormality Detection

no code implementations • 30 Mar 2020 • Habtamu Fanta, Zhiwen Shao, Lizhuang Ma

In this paper, we propose a novel version of Gated Recurrent Unit (GRU), called Single Tunnelled GRU for abnormality detection.

Anomaly Detection

Paper
Add Code

DMT: Dynamic Mutual Training for Semi-Supervised Learning

1 code implementation • 18 Apr 2020 • Zhengyang Feng, Qianyu Zhou, Qiqi Gu, Xin Tan, Guangliang Cheng, Xuequan Lu, Jianping Shi, Lizhuang Ma

Instead, leveraging inter-model disagreement between different models is a key to locate pseudo label errors.

Ranked #3 on Semi-Supervised Semantic Segmentation on Pascal VOC 2012 1% labeled

Pseudo Label Semi-Supervised Image Classification +1

134

Paper
Code

Uncertainty-Aware Consistency Regularization for Cross-Domain Semantic Segmentation

no code implementations • 19 Apr 2020 • Qianyu Zhou, Zhengyang Feng, Qiqi Gu, Guangliang Cheng, Xuequan Lu, Jianping Shi, Lizhuang Ma

Guided by this mask, we propose a ClassOut strategy to realize effective regional consistency in a fine-grained manner.

Semantic Segmentation Unsupervised Domain Adaptation

Paper
Add Code

Semantic Correspondence via 2D-3D-2D Cycle

1 code implementation • 20 Apr 2020 • Yang You, Chengkun Li, Yujing Lou, Zhoujun Cheng, Lizhuang Ma, Cewu Lu, Weiming Wang

Visual semantic correspondence is an important topic in computer vision and could help machine understand objects in our daily life.

Semantic correspondence

Paper
Code

Fine-Grained Expression Manipulation via Structured Latent Space

1 code implementation • 21 Apr 2020 • Junshu Tang, Zhiwen Shao, Lizhuang Ma

Most existing expression manipulation methods resort to discrete expression labels, which mainly edit global expressions and ignore the manipulation of fine details.

Generative Adversarial Network

Paper
Code

NTIRE 2020 Challenge on NonHomogeneous Dehazing

no code implementations • 7 May 2020 • Codruta O. Ancuti, Cosmin Ancuti, Florin-Alexandru Vasluianu, Radu Timofte, Jing Liu, Haiyan Wu, Yuan Xie, Yanyun Qu, Lizhuang Ma, Ziling Huang, Qili Deng, Ju-Chin Chao, Tsung-Shan Yang, Peng-Wen Chen, Po-Min Hsu, Tzu-Yi Liao, Chung-En Sun, Pei-Yuan Wu, Jeonghyeok Do, Jongmin Park, Munchurl Kim, Kareem Metwaly, Xuelu Li, Tiantong Guo, Vishal Monga, Mingzhao Yu, Venkateswararao Cherukuri, Shiue-Yuan Chuang, Tsung-Nan Lin, David Lee, Jerome Chang, Zhan-Han Wang, Yu-Bang Chang, Chang-Hong Lin, Yu Dong, Hong-Yu Zhou, Xiangzhen Kong, Sourya Dipta Das, Saikat Dutta, Xuan Zhao, Bing Ouyang, Dennis Estrada, Meiqi Wang, Tianqi Su, Siyi Chen, Bangyong Sun, Vincent Whannou de Dravo, Zhe Yu, Pratik Narang, Aryan Mehra, Navaneeth Raghunath, Murari Mandal

We focus on the proposed solutions and their results evaluated on NH-Haze, a novel dataset consisting of 55 pairs of real haze free and nonhomogeneous hazy images recorded outdoor.

Image Dehazing

Paper
Add Code

Spoof Face Detection Via Semi-Supervised Adversarial Training

no code implementations • 22 May 2020 • Chengwei Chen, Wang Yuan, Xuequan Lu, Lizhuang Ma

To capture the underlying structure of live faces data in latent representation space, we propose to train the live face data only, with a convolutional Encoder-Decoder network acting as a Generator.

Face Detection Face Presentation Attack Detection +4

Paper
Add Code

Brain Tumor Anomaly Detection via Latent Regularized Adversarial Network

no code implementations • 9 Jul 2020 • Nan Wang, Chengwei Chen, Yuan Xie, Lizhuang Ma

The brain structure in the collected data is complicated, thence, doctors are required to spend plentiful energy when diagnosing brain abnormalities.

Semi-supervised Anomaly Detection Supervised Anomaly Detection

Paper
Add Code

Face Anti-Spoofing Via Disentangled Representation Learning

no code implementations • ECCV 2020 • Ke-Yue Zhang, Taiping Yao, Jian Zhang, Ying Tai, Shouhong Ding, Jilin Li, Feiyue Huang, Haichuan Song, Lizhuang Ma

Face anti-spoofing is crucial to security of face recognition systems.

Disentanglement Face Anti-Spoofing +1

Paper
Add Code

Canonical Voting: Towards Robust Oriented Bounding Box Detection in 3D Scenes

1 code implementation • CVPR 2022 • Yang You, Zelin Ye, Yujing Lou, Chengkun Li, Yong-Lu Li, Lizhuang Ma, Weiming Wang, Cewu Lu

In the work, we disentangle the direct offset into Local Canonical Coordinates (LCC), box scales and box orientations.

3D Object Detection object-detection

Paper
Code

Weakly-Supervised Saliency Detection via Salient Object Subitizing

no code implementations • 4 Jan 2021 • Xiaoyang Zheng, Xin Tan, Jie zhou, Lizhuang Ma, Rynson W. H. Lau

This allows the supervision to be aligned with the property of saliency detection, where the salient objects of an image could be from more than one class.

Object object-detection +4

Paper
Add Code

Boundary-Aware Geometric Encoding for Semantic Segmentation of Point Clouds

no code implementations • 7 Jan 2021 • Jingyu Gong, Jiachen Xu, Xin Tan, Jie zhou, Yanyun Qu, Yuan Xie, Lizhuang Ma

Boundary information plays a significant role in 2D image segmentation, while usually being ignored in 3D point cloud segmentation where ambiguous features might be generated in feature extraction, leading to misclassification in the transition area between two objects.

Image Segmentation Point Cloud Segmentation +2

Paper
Add Code

Learn from Concepts: Towards the Purified Memory for Few-shot Learning

no code implementations • Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence 2021 • Xuncheng Liu, Xudong Tian, Shaohui Lin, Yanyun Qu, Lizhuang Ma, Wang Yuan, Zhizhong Zhang, Yuan Xie

In this paper, we present a novel purified memory mechanism that simulates the recognition process of human beings.

Few-Shot Learning

Paper
Add Code

PRIN/SPRIN: On Extracting Point-wise Rotation Invariant Features

2 code implementations • 24 Feb 2021 • Yang You, Yujing Lou, Ruoxi Shi, Qi Liu, Yu-Wing Tai, Lizhuang Ma, Weiming Wang, Cewu Lu

Spherical Voxel Convolution and Point Re-sampling are proposed to extract rotation invariant features for each point.

3D Feature Matching Data Augmentation

Paper
Code

"Forget" the Forget Gate: Estimating Anomalies in Videos using Self-contained Long Short-Term Memory Networks

no code implementations • 3 Apr 2021 • Habtamu Fanta, Zhiwen Shao, Lizhuang Ma

Abnormal event detection is a challenging task that requires effectively handling intricate features of appearance and motion.

Anomaly Detection Computational Efficiency +2

Paper
Add Code

Farewell to Mutual Information: Variational Distillation for Cross-Modal Person Re-Identification

3 code implementations • CVPR 2021 • Xudong Tian, Zhizhong Zhang, Shaohui Lin, Yanyun Qu, Yuan Xie, Lizhuang Ma

The Information Bottleneck (IB) provides an information theoretic principle for representation learning, by retaining all information relevant for predicting label while minimizing the redundancy.

Cross-Modality Person Re-identification Cross-Modal Person Re-Identification +3

334

Paper
Code

Contrastive Learning for Compact Single Image Dehazing

7 code implementations • CVPR 2021 • Haiyan Wu, Yanyun Qu, Shaohui Lin, Jian Zhou, Ruizhi Qiao, Zhizhong Zhang, Yuan Xie, Lizhuang Ma

In this paper, we propose a novel contrastive regularization (CR) built upon contrastive learning to exploit both the information of hazy images and clear images as negative and positive samples, respectively.

Ranked #5 on Image Dehazing on RS-Haze

Contrastive Learning Image Dehazing +1

328

Paper
Code

Omni-supervised Point Cloud Segmentation via Gradual Receptive Field Component Reasoning

2 code implementations • CVPR 2021 • Jingyu Gong, Jiachen Xu, Xin Tan, Haichuan Song, Yanyun Qu, Yuan Xie, Lizhuang Ma

Our method can significantly improve the backbones in all three datasets.

Ranked #2 on Semantic Segmentation on Semantic3D

Point Cloud Segmentation Semantic Segmentation

Paper
Code

End-to-End Video Object Detection with Spatial-Temporal Transformers

1 code implementation • 23 May 2021 • Lu He, Qianyu Zhou, Xiangtai Li, Li Niu, Guangliang Cheng, Xiao Li, Wenxuan Liu, Yunhai Tong, Lizhuang Ma, Liqing Zhang

Recently, DETR and Deformable DETR have been proposed to eliminate the need for many hand-designed components in object detection while demonstrating good performance as previous complex hand-crafted detectors.

Object object-detection +2

195

Paper
Code

Novelty Detection via Contrastive Learning with Negative Data Augmentation

no code implementations • 18 Jun 2021 • Chengwei Chen, Yuan Xie, Shaohui Lin, Ruizhi Qiao, Jian Zhou, Xin Tan, Yi Zhang, Lizhuang Ma

Moreover, our model is more stable for training in a non-adversarial manner, compared to other adversarial based novelty detection methods.

Clustering Contrastive Learning +4

Paper
Add Code

Dual Reweighting Domain Generalization for Face Presentation Attack Detection

no code implementations • 30 Jun 2021 • Shubao Liu, Ke-Yue Zhang, Taiping Yao, Kekai Sheng, Shouhong Ding, Ying Tai, Jilin Li, Yuan Xie, Lizhuang Ma

Face anti-spoofing approaches based on domain generalization (DG) have drawn growing attention due to their robustness for unseen scenarios.

Domain Generalization Face Anti-Spoofing +1

Paper
Add Code

Adaptive Normalized Representation Learning for Generalizable Face Anti-Spoofing

no code implementations • 5 Aug 2021 • Shubao Liu, Ke-Yue Zhang, Taiping Yao, Mingwei Bi, Shouhong Ding, Jilin Li, Feiyue Huang, Lizhuang Ma

However, little attention has been paid to the feature extraction process for the FAS task, especially the influence of normalization, which also has a great impact on the generalization of the learned representation.

Domain Generalization Face Anti-Spoofing +1

Paper
Add Code

Context-Aware Mixup for Domain Adaptive Semantic Segmentation

1 code implementation • 8 Aug 2021 • Qianyu Zhou, Zhengyang Feng, Qiqi Gu, Jiangmiao Pang, Guangliang Cheng, Xuequan Lu, Jianping Shi, Lizhuang Ma

The generated contextual mask is critical in this work and will guide the context-aware domain mixup on three different levels.

Ranked #5 on Image-to-Image Translation on SYNTHIA-to-Cityscapes

Semantic Segmentation Synthetic-to-Real Translation +1

Paper
Code

Self-Adversarial Disentangling for Specific Domain Adaptation

no code implementations • 8 Aug 2021 • Qianyu Zhou, Qiqi Gu, Jiangmiao Pang, Xuequan Lu, Lizhuang Ma

In this paper, we study a practical setting called Specific Domain Adaptation (SDA) that aligns the source and target domains in a demanded-specific dimension.

Ranked #10 on Unsupervised Domain Adaptation on Cityscapes to Foggy Cityscapes

Image-to-Image Translation on Cityscapes-to-Foggy Cityscapes object-detection +3

Paper
Add Code

Semi-supervised 3D Object Detection via Adaptive Pseudo-Labeling

no code implementations • 15 Aug 2021 • Hongyi Xu, Fengqi Liu, Qianyu Zhou, Jinkun Hao, Zhijie Cao, Zhengyang Feng, Lizhuang Ma

Inspired by this, we propose a novel semi-supervised framework based on pseudo-labeling for outdoor 3D object detection tasks.

3D Object Detection Object +1

Paper
Add Code

PIT: Position-Invariant Transform for Cross-FoV Domain Adaptation

1 code implementation • ICCV 2021 • Qiqi Gu, Qianyu Zhou, Minghao Xu, Zhengyang Feng, Guangliang Cheng, Xuequan Lu, Jianping Shi, Lizhuang Ma

Extensive experiments demonstrate that our method can soundly boost the performance on both cross-domain object detection and segmentation for state-of-the-art techniques.

Domain Adaptation object-detection +4

Paper
Code

Spatiotemporal Inconsistency Learning for DeepFake Video Detection

no code implementations • 4 Sep 2021 • Zhihao Gu, Yang Chen, Taiping Yao, Shouhong Ding, Jilin Li, Feiyue Huang, Lizhuang Ma

To address this issue, we term this task as a Spatial-Temporal Inconsistency Learning (STIL) process and instantiate it into a novel STIL block, which consists of a Spatial Inconsistency Module (SIM), a Temporal Inconsistency Module (TIM), and an Information Supplement Module (ISM).

Binary Classification Face Swapping

Paper
Add Code

Domain Adaptive Semantic Segmentation via Regional Contrastive Consistency Regularization

1 code implementation • 11 Oct 2021 • Qianyu Zhou, Chuyun Zhuang, Ran Yi, Xuequan Lu, Lizhuang Ma

In this paper, we propose a novel and fully end-to-end trainable approach, called regional contrastive consistency regularization (RCCR) for domain adaptive semantic segmentation.

Ranked #31 on Synthetic-to-Real Translation on GTAV-to-Cityscapes Labels

Semantic Segmentation Synthetic-to-Real Translation +1

Paper
Code

Understanding Pixel-level 2D Image Semantics with 3D Keypoint Knowledge Engine

no code implementations • 21 Nov 2021 • Yang You, Chengkun Li, Yujing Lou, Zhoujun Cheng, Liangwei Li, Lizhuang Ma, Weiming Wang, Cewu Lu

Pixel-level 2D object semantic understanding is an important topic in computer vision and could help machine deeply understand objects (e. g. functionality and affordance) in our daily life.

Paper
Add Code

Exploring Versatile Prior for Human Motion via Motion Frequency Guidance

1 code implementation • 25 Nov 2021 • Jiachen Xu, Min Wang, Jingyu Gong, Wentao Liu, Chen Qian, Yuan Xie, Lizhuang Ma

Prior plays an important role in providing the plausible constraint on human motion.

Denoising Representation Learning

Paper
Code

Feature Generation and Hypothesis Verification for Reliable Face Anti-Spoofing

1 code implementation • 30 Dec 2021 • Shice Liu, Shitao Lu, Hongyi Xu, Jing Yang, Shouhong Ding, Lizhuang Ma

However, the improvement is still limited by two issues: 1) It is difficult to perfectly map all faces to a shared feature space.

Disentanglement Domain Generalization +1

Paper
Code

HybridCR: Weakly-Supervised 3D Point Cloud Semantic Segmentation via Hybrid Contrastive Regularization

1 code implementation • CVPR 2022 • Mengtian Li, Yuan Xie, Yunhang Shen, Bo Ke, Ruizhi Qiao, Bo Ren, Shaohui Lin, Lizhuang Ma

To address the huge labeling cost in large-scale point cloud semantic segmentation, we propose a novel hybrid contrastive regularization (HybridCR) framework in weakly-supervised setting, which obtains competitive performance compared to its fully-supervised counterpart.

Semantic Segmentation Semantic Similarity +1

Paper
Code

ISDNet: Integrating Shallow and Deep Networks for Efficient Ultra-High Resolution Segmentation

1 code implementation • CVPR 2022 • Shaohua Guo, Liang Liu, Zhenye Gan, Yabiao Wang, Wuhao Zhang, Chengjie Wang, Guannan Jiang, Wei zhang, Ran Yi, Lizhuang Ma, Ke Xu

The huge burden of computation and memory are two obstacles in ultra-high resolution image segmentation.

Image Segmentation Segmentation +1

Paper
Code

TransVOD: End-to-End Video Object Detection with Spatial-Temporal Transformers

3 code implementations • 13 Jan 2022 • Qianyu Zhou, Xiangtai Li, Lu He, Yibo Yang, Guangliang Cheng, Yunhai Tong, Lizhuang Ma, DaCheng Tao

Detection Transformer (DETR) and Deformable DETR have been proposed to eliminate the need for many hand-designed components in object detection while demonstrating good performance as previous complex hand-crafted detectors.

Ranked #4 on Video Object Detection on ImageNet VID (using extra training data)

Object object-detection +2

195

Paper
Code

Rethinking Efficient Lane Detection via Curve Modeling

1 code implementation • CVPR 2022 • Zhengyang Feng, Shaohua Guo, Xin Tan, Ke Xu, Min Wang, Lizhuang Ma

This paper presents a novel parametric curve-based method for lane detection in RGB images.

Ranked #2 on Lane Detection on LLAMAS

Lane Detection

783

Paper
Code

CtlGAN: Few-shot Artistic Portraits Generation with Contrastive Transfer Learning

no code implementations • 16 Mar 2022 • Yue Wang, Ran Yi, Luying Li, Ying Tai, Chengjie Wang, Lizhuang Ma

We propose a new encoder which embeds real faces into Z+ space and proposes a dual-path training strategy to better cope with the adapted decoder and eliminate the artifacts.

Image-to-Image Translation Transfer Learning

Paper
Add Code

LAKe-Net: Topology-Aware Point Cloud Completion by Localizing Aligned Keypoints

1 code implementation • CVPR 2022 • Junshu Tang, Zhijun Gong, Ran Yi, Yuan Xie, Lizhuang Ma

An asymmetric keypoint locator, including an unsupervised multi-scale keypoint detector and a complete keypoint generator, is proposed for localizing aligned keypoints from complete and partial point clouds.

Point Cloud Completion

Paper
Code

MISSU: 3D Medical Image Segmentation via Self-distilling TransUNet

1 code implementation • 2 Jun 2022 • Nan Wang, Shaohui Lin, Xiaoxiao Li, Ke Li, Yunhang Shen, Yue Gao, Lizhuang Ma

U-Nets have achieved tremendous success in medical image segmentation.

Image Segmentation Medical Image Segmentation +2

Paper
Code

Variational Distillation for Multi-View Learning

3 code implementations • 20 Jun 2022 • Xudong Tian, Zhizhong Zhang, Cong Wang, Wensheng Zhang, Yanyun Qu, Lizhuang Ma, Zongze Wu, Yuan Xie, DaCheng Tao

Information Bottleneck (IB) based multi-view learning provides an information theoretic principle for seeking shared information contained in heterogeneous data descriptions.

MULTI-VIEW LEARNING Representation Learning

Paper
Code

Adaptive Mixture of Experts Learning for Generalizable Face Anti-Spoofing

no code implementations • 20 Jul 2022 • Qianyu Zhou, Ke-Yue Zhang, Taiping Yao, Ran Yi, Shouhong Ding, Lizhuang Ma

Existing DG-based FAS approaches always capture the domain-invariant features for generalizing on the various unseen domains.

Domain Generalization Face Anti-Spoofing +1

Paper
Add Code

Generative Domain Adaptation for Face Anti-Spoofing

no code implementations • 20 Jul 2022 • Qianyu Zhou, Ke-Yue Zhang, Taiping Yao, Ran Yi, Kekai Sheng, Shouhong Ding, Lizhuang Ma

Most existing UDA FAS methods typically fit the trained models to the target domain via aligning the distribution of semantic high-level features.

Domain Adaptation Face Anti-Spoofing

Paper
Add Code

Boosting Night-time Scene Parsing with Learnable Frequency

1 code implementation • 30 Aug 2022 • Zhifeng Xie, Sen Wang, Ke Xu, Zhizhong Zhang, Xin Tan, Yuan Xie, Lizhuang Ma

Based on this, we propose to exploit the image frequency distributions for night-time scene parsing.

Autonomous Driving Scene Parsing

Paper
Code

Prototype-Aware Heterogeneous Task for Point Cloud Completion

no code implementations • 5 Sep 2022 • Junshu Tang, Jiachen Xu, Jingyu Gong, Haichuan Song, Yuan Xie, Lizhuang Ma

Moreover, for effective training, we consider difficulty-based sampling strategy to encourage the network to pay more attention to some partial point clouds with fewer geometric information.

Point Cloud Completion

Paper
Add Code

3DFaceShop: Explicitly Controllable 3D-Aware Portrait Generation

1 code implementation • 12 Sep 2022 • Junshu Tang, Bo Zhang, Binxin Yang, Ting Zhang, Dong Chen, Lizhuang Ma, Fang Wen

In contrast to the traditional avatar creation pipeline which is a costly process, contemporary generative approaches directly learn the data distribution from photographs.

3D Face Animation Disentanglement +3

205

Paper
Code

Image Understands Point Cloud: Weakly Supervised 3D Semantic Segmentation via Association Learning

no code implementations • 16 Sep 2022 • Tianfang Sun, Zhizhong Zhang, Xin Tan, Yanyun Qu, Yuan Xie, Lizhuang Ma

In this paper, we propose a novel cross-modality weakly supervised method for 3D segmentation, incorporating complementary information from unlabeled images.

3D Semantic Segmentation Pseudo Label +2

Paper
Add Code

Rethinking Implicit Neural Representations for Vision Learners

no code implementations • 22 Nov 2022 • Yiran Song, Qianyu Zhou, Lizhuang Ma

Existing INRs methods suffer from two problems: 1) narrow theoretical definitions of INRs are inapplicable to high-level tasks; 2) lack of representation capabilities to deep networks.

Image Classification Image Generation +6

Paper
Add Code

DCS-RISR: Dynamic Channel Splitting for Efficient Real-world Image Super-Resolution

no code implementations • 15 Dec 2022 • Junbo Qiao, Shaohui Lin, Yunlun Zhang, Wei Li, Jie Hu, Gaoqi He, Changbo Wang, Lizhuang Ma

Real-world image super-resolution (RISR) has received increased focus for improving the quality of SR images under unknown complex degradation.

Image Super-Resolution SSIM

Paper
Add Code

Rethinking Gradient Projection Continual Learning: Stability / Plasticity Feature Space Decoupling

no code implementations • CVPR 2023 • Zhen Zhao, Zhizhong Zhang, Xin Tan, Jun Liu, Yanyun Qu, Yuan Xie, Lizhuang Ma

In this paper, we propose a space decoupling (SD) algorithm to decouple the feature space into a pair of complementary subspaces, i. e., the stability space I, and the plasticity space R. I is established by conducting space intersection between the historic and current feature space, and thus I contains more task-shared bases.

Continual Learning

Paper
Add Code

Remembering Normality: Memory-guided Knowledge Distillation for Unsupervised Anomaly Detection

1 code implementation • ICCV 2023 • Zhihao Gu, Liang Liu, Xu Chen, Ran Yi, Jiangning Zhang, Yabiao Wang, Chengjie Wang, Annan Shu, Guannan Jiang, Lizhuang Ma

Specifically, we first propose a normality recall memory (NR Memory) to strengthen the normality of student-generated features by recalling the stored normal information.

Ranked #11 on Anomaly Detection on MVTec AD

Knowledge Distillation Unsupervised Anomaly Detection

Paper
Code

CRIN: Rotation-Invariant Point Cloud Analysis and Rotation Estimation via Centrifugal Reference Frame

1 code implementation • 6 Mar 2023 • Yujing Lou, Zelin Ye, Yang You, Nianjuan Jiang, Jiangbo Lu, Weiming Wang, Lizhuang Ma, Cewu Lu

CRIN directly takes the coordinates of points as input and transforms local points into rotation-invariant representations via centrifugal reference frames.

Paper
Code

Make-It-3D: High-Fidelity 3D Creation from A Single Image with Diffusion Prior

2 code implementations • ICCV 2023 • Junshu Tang, Tengfei Wang, Bo Zhang, Ting Zhang, Ran Yi, Lizhuang Ma, Dong Chen

In this work, we investigate the problem of creating high-fidelity 3D content from only a single image.

Text to 3D

1,680

Paper
Code

Instance-Aware Domain Generalization for Face Anti-Spoofing

1 code implementation • CVPR 2023 • Qianyu Zhou, Ke-Yue Zhang, Taiping Yao, Xuequan Lu, Ran Yi, Shouhong Ding, Lizhuang Ma

To address these issues, we propose a novel perspective for DG FAS that aligns features on the instance level without the need for domain labels.

Domain Generalization Face Anti-Spoofing +1

Paper
Code

Re-thinking Data Availablity Attacks Against Deep Neural Networks

no code implementations • 18 May 2023 • Bin Fang, Bo Li, Shuang Wu, Ran Yi, Shouhong Ding, Lizhuang Ma

The unauthorized use of personal data for commercial purposes and the clandestine acquisition of private data for training machine learning models continue to raise concerns.

Paper
Add Code

Towards Generalizable Data Protection With Transferable Unlearnable Examples

no code implementations • 18 May 2023 • Bin Fang, Bo Li, Shuang Wu, Tianyi Zheng, Shouhong Ding, Ran Yi, Lizhuang Ma

One of the crucial factors contributing to this success has been the access to an abundance of high-quality data for constructing machine learning models.

Paper
Add Code

RFENet: Towards Reciprocal Feature Evolution for Glass Segmentation

1 code implementation • 12 Jul 2023 • Ke Fan, Changan Wang, Yabiao Wang, Chengjie Wang, Ran Yi, Lizhuang Ma

Glass-like objects are widespread in daily life but remain intractable to be segmented for most existing methods.

Semantic Segmentation

Paper
Code

LiDAR-Camera Panoptic Segmentation via Geometry-Consistent and Semantic-Aware Alignment

1 code implementation • ICCV 2023 • Zhiwei Zhang, Zhizhong Zhang, Qian Yu, Ran Yi, Yuan Xie, Lizhuang Ma

3D panoptic segmentation is a challenging perception task that requires both semantic segmentation and instance segmentation.

Instance Segmentation Panoptic Segmentation +1

Paper
Code

Phasic Content Fusing Diffusion Model with Directional Distribution Consistency for Few-Shot Model Adaption

1 code implementation • ICCV 2023 • Teng Hu, Jiangning Zhang, Liang Liu, Ran Yi, Siqi Kou, Haokun Zhu, Xu Chen, Yabiao Wang, Chengjie Wang, Lizhuang Ma

To address these problems, we propose a novel phasic content fusing few-shot diffusion model with directional distribution consistency loss, which targets different learning objectives at distinct training stages of the diffusion model.

Domain Adaptation

Paper
Code

Stroke-based Neural Painting and Stylization with Dynamically Predicted Painting Region

2 code implementations • 7 Sep 2023 • Teng Hu, Ran Yi, Haokun Zhu, Liang Liu, Jinlong Peng, Yabiao Wang, Chengjie Wang, Lizhuang Ma

To solve the problem, we propose Compositional Neural Painter, a novel stroke-based rendering framework which dynamically predicts the next painting region based on the current canvas, instead of dividing the image plane uniformly into painting regions.

Style Transfer

Paper
Code

Contrastive Pseudo Learning for Open-World DeepFake Attribution

1 code implementation • ICCV 2023 • Zhimin Sun, Shen Chen, Taiping Yao, Bangjie Yin, Ran Yi, Shouhong Ding, Lizhuang Ma

The challenge in sourcing attribution for forgery faces has gained widespread attention due to the rapid development of generative techniques.

DeepFake Detection Face Swapping +1

Paper
Code

Rethinking Domain Generalization: Discriminability and Generalizability

no code implementations • 28 Sep 2023 • Shaocong Long, Qianyu Zhou, Chenhao Ying, Lizhuang Ma, Yuan Luo

On the one hand, the simultaneous attainment of generalizability and discriminability of features presents a complex challenge, often entailing inherent contradictions.

Domain Generalization

Paper
Add Code

Diverse Target and Contribution Scheduling for Domain Generalization

no code implementations • 28 Sep 2023 • Shaocong Long, Qianyu Zhou, Chenhao Ying, Lizhuang Ma, Yuan Luo

In specific, DTS employs distinct soft labels as training targets to account for various feature distributions across domains and thereby mitigates the gradient conflicts, and DCB dynamically balances the contributions of source domains by ensuring a fair decline in losses of different source domains.

Domain Generalization Scheduling

Paper
Add Code

Generalized Category Discovery in Semantic Segmentation

1 code implementation • 20 Nov 2023 • Zhengyuan Peng, Qijian Tian, Jianqing Xu, Yizhang Jin, Xuequan Lu, Xin Tan, Yuan Xie, Lizhuang Ma

This paper explores a novel setting called Generalized Category Discovery in Semantic Segmentation (GCDSS), aiming to segment unlabeled images given prior knowledge from a labeled set of base classes.

Segmentation Semantic Segmentation

Paper
Code

COTR: Compact Occupancy TRansformer for Vision-based 3D Occupancy Prediction

1 code implementation • 4 Dec 2023 • Qihang Ma, Xin Tan, Yanyun Qu, Lizhuang Ma, Zhizhong Zhang, Yuan Xie

The autonomous driving community has shown significant interest in 3D occupancy prediction, driven by its exceptional geometric perception and general object recognition capabilities.

Autonomous Driving Object Recognition

Paper
Code

A Theory of Non-Acyclic Generative Flow Networks

no code implementations • 23 Dec 2023 • Leo Maxime Brunswic, Yinchuan Li, Yushun Xu, Shangling Jui, Lizhuang Ma

GFlowNets is a novel flow-based method for learning a stochastic policy to generate objects via a sequence of actions and with probability proportional to a given positive reward.

Paper
Add Code

BA-SAM: Scalable Bias-Mode Attention Mask for Segment Anything Model

1 code implementation • 4 Jan 2024 • Yiran Song, Qianyu Zhou, Xiangtai Li, Deng-Ping Fan, Xuequan Lu, Lizhuang Ma

To this end, we propose Scalable Bias-Mode Attention Mask (BA-SAM) to enhance SAM's adaptability to varying image resolutions while eliminating the need for structure modifications.

Paper
Code

Class-Imbalanced Semi-Supervised Learning for Large-Scale Point Cloud Semantic Segmentation via Decoupling Optimization

no code implementations • 13 Jan 2024 • Mengtian Li, Shaohui Lin, Zihan Wang, Yunhang Shen, Baochang Zhang, Lizhuang Ma

Semi-supervised learning (SSL), thanks to the significant reduction of data annotation costs, has been an active research topic for large-scale 3D scene understanding.

Pseudo Label Representation Learning +2

Paper
Add Code

Continuous Piecewise-Affine Based Motion Model for Image Animation

1 code implementation • 17 Jan 2024 • Hexiang Wang, Fengqi Liu, Qianyu Zhou, Ran Yi, Xin Tan, Lizhuang Ma

To address this issue, we propose to model motion from the source image to the driving frame in highly-expressive diffeomorphism spaces.

Image Animation

Paper
Code

SimAda: A Simple Unified Framework for Adapting Segment Anything Model in Underperformed Scenes

1 code implementation • 31 Jan 2024 • Yiran Song, Qianyu Zhou, Xuequan Lu, Zhiwen Shao, Lizhuang Ma

In this paper, we aim to investigate the impact of the general vision modules on finetuning SAM and enable them to generalize across all downstream tasks.

Paper
Code

DEMOS: Dynamic Environment Motion Synthesis in 3D Scenes via Local Spherical-BEV Perception

no code implementations • 4 Mar 2024 • Jingyu Gong, Min Wang, Wentao Liu, Chen Qian, Zhizhong Zhang, Yuan Xie, Lizhuang Ma

To handle this problem, we propose the first Dynamic Environment MOtion Synthesis framework (DEMOS) to predict future motion instantly according to the current scene, and use it to dynamically update the latent motion for final motion synthesis.

motion prediction Motion Synthesis

Paper
Add Code

Exploring Safety Generalization Challenges of Large Language Models via Code

no code implementations • 12 Mar 2024 • Qibing Ren, Chang Gao, Jing Shao, Junchi Yan, Xin Tan, Yu Qiao, Wai Lam, Lizhuang Ma

The rapid advancement of Large Language Models (LLMs) has brought about remarkable generative capabilities but also raised concerns about their potential misuse.

Code Completion

Paper
Add Code

Real-IAD: A Real-World Multi-View Dataset for Benchmarking Versatile Industrial Anomaly Detection

1 code implementation • 19 Mar 2024 • Chengjie Wang, Wenbing Zhu, Bin-Bin Gao, Zhenye Gan, Jianning Zhang, Zhihao Gu, Shuguang Qian, Mingang Chen, Lizhuang Ma

Finally, we report the results of popular IAD methods on the Real-IAD dataset, providing a highly challenging benchmark to promote the development of the IAD field.

Benchmarking Unsupervised Anomaly Detection

Paper
Code

Make-It-Vivid: Dressing Your Animatable Biped Cartoon Characters from Text

no code implementations • 25 Mar 2024 • Junshu Tang, Yanhong Zeng, Ke Fan, Xuheng Wang, Bo Dai, Kai Chen, Lizhuang Ma

Creating and animating 3D biped cartoon characters is crucial and valuable in various applications.

Question Answering Texture Synthesis

Paper
Add Code

Test-Time Domain Generalization for Face Anti-Spoofing

no code implementations • 28 Mar 2024 • Qianyu Zhou, Ke-Yue Zhang, Taiping Yao, Xuequan Lu, Shouhong Ding, Lizhuang Ma

Our method, consisting of Test-Time Style Projection (TTSP) and Diverse Style Shifts Simulation (DSSS), effectively projects the unseen data to the seen domain space.

Domain Generalization Face Anti-Spoofing

Paper
Add Code

SDPose: Tokenized Pose Estimation via Circulation-Guide Self-Distillation

1 code implementation • 4 Apr 2024 • Sichen Chen, Yingyi Zhang, Siming Huang, Ran Yi, Ke Fan, Ruixin Zhang, Peixian Chen, Jun Wang, Shouhong Ding, Lizhuang Ma

To mitigate the problem of under-fitting, we design a transformer module named Multi-Cycled Transformer(MCT) based on multiple-cycled forwards to more fully exploit the potential of small model parameters.

Edge-computing Pose Estimation

Paper
Code

Learning Topology Uniformed Face Mesh by Volume Rendering for Multi-view Reconstruction

no code implementations • 8 Apr 2024 • Yating Wang, Ran Yi, Ke Fan, Jinkun Hao, Jiangbo Lu, Lizhuang Ma

Our goal is to leverage the superiority of neural volume rendering into multi-view reconstruction of face mesh with consistent topology.

3D Reconstruction Face Reconstruction +1

Paper
Add Code

PromptAD: Learning Prompts with only Normal Samples for Few-Shot Anomaly Detection

2 code implementations • 8 Apr 2024 • Xiaofan Li, Zhizhong Zhang, Xin Tan, Chengwei Chen, Yanyun Qu, Yuan Xie, Lizhuang Ma

The vision-language model has brought great improvement to few-shot industrial anomaly detection, which usually needs to design of hundreds of prompts through prompt engineering.

Anomaly Detection Language Modelling +1

131

Paper
Code

DGMamba: Domain Generalization via Generalized State Space Model

2 code implementations • 11 Apr 2024 • Shaocong Long, Qianyu Zhou, Xiangtai Li, Xuequan Lu, Chenhao Ying, Yuan Luo, Lizhuang Ma, Shuicheng Yan

SPR strives to encourage the model to concentrate more on objects rather than context, consisting of two designs: Prior-Free Scanning~(PFS), and Domain Context Interchange~(DCI).

Domain Generalization

131

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.