Search Results for author: Lizhuang Ma

Found 99 papers, 49 papers with code

Learning deep representation from coarse to fine for face alignment

no code implementations31 Jul 2016 Zhiwen Shao, Shouhong Ding, Yiru Zhao, Qinchuan Zhang, Lizhuang Ma

In this paper, we propose a novel face alignment method that trains deep convolutional network from coarse to fine.

Face Alignment

Multi-Scale Video Frame-Synthesis Network with Transitive Consistency Loss

no code implementations7 Dec 2017 Zhe Hu, Yinglan Ma, Lizhuang Ma

Traditional approaches to interpolate/extrapolate frames in a video sequence require accurate pixel correspondences between images, e. g., using optical flow.

Optical Flow Estimation

Deep Adaptive Attention for Joint Facial Action Unit Detection and Face Alignment

1 code implementation ECCV 2018 Zhiwen Shao, Zhilei Liu, Jianfei Cai, Lizhuang Ma

Facial action unit (AU) detection and face alignment are two highly correlated tasks since facial landmarks can provide precise AU locations to facilitate the extraction of meaningful local features for AU detection.

Action Unit Detection Face Alignment +1

Mask-aware Photorealistic Face Attribute Manipulation

no code implementations24 Apr 2018 Ruoqi Sun, Chen Huang, Jianping Shi, Lizhuang Ma

The task of face attribute manipulation has found increasing applications, but still remains challeng- ing with the requirement of editing the attributes of a face image while preserving its unique details.

Attribute Face Recognition +1

DRPose3D: Depth Ranking in 3D Human Pose Estimation

no code implementations23 May 2018 Min Wang, Xipeng Chen, Wentao Liu, Chen Qian, Liang Lin, Lizhuang Ma

In this paper, we propose a two-stage depth ranking based method (DRPose3D) to tackle the problem of 3D human pose estimation.

3D Human Pose Estimation 3D Pose Estimation

Deep Multi-Center Learning for Face Alignment

1 code implementation5 Aug 2018 Zhiwen Shao, Hengliang Zhu, Xin Tan, Yangyang Hao, Lizhuang Ma

Most of the existing deep learning methods only use one fully-connected layer called shape prediction layer to estimate the locations of facial landmarks.

Face Alignment

Facial Action Unit Detection Using Attention and Relation Learning

no code implementations10 Aug 2018 Zhiwen Shao, Zhilei Liu, Jianfei Cai, Yunsheng Wu, Lizhuang Ma

By finding the region of interest of each AU with the attention mechanism, AU-related local features can be captured.

Action Unit Detection Facial Action Unit Detection +1

Pointwise Rotation-Invariant Network with Adaptive Sampling and 3D Spherical Voxel Convolution

1 code implementation23 Nov 2018 Yang You, Yujing Lou, Qi Liu, Yu-Wing Tai, Lizhuang Ma, Cewu Lu, Weiming Wang

Point cloud analysis without pose priors is very challenging in real applications, as the orientations of point clouds are often unknown.

3D Feature Matching Data Augmentation

Efficient Super Resolution Using Binarized Neural Network

no code implementations16 Dec 2018 Yinglan Ma, Hongyu Xiong, Zhe Hu, Lizhuang Ma

As a way to significantly reduce model size and computation time, binarized neural network has only been shown to excel on semantic-level tasks such as image classification and recognition.

Binarization Image Classification +3

Unconstrained Facial Action Unit Detection via Latent Feature Domain

1 code implementation25 Mar 2019 Zhiwen Shao, Jianfei Cai, Tat-Jen Cham, Xuequan Lu, Lizhuang Ma

Due to the combination of source AU-related information and target AU-free information, the latent feature domain with transferred source label can be learned by maximizing the target-domain AU detection performance.

Action Unit Detection Domain Adaptation +2

FVNet: 3D Front-View Proposal Generation for Real-Time Object Detection from Point Clouds

no code implementations26 Mar 2019 Jie Zhou, Xin Tan, Zhiwei Shao, Lizhuang Ma

We then introduce a proposal generation network to predict 3D region proposals from the generated maps and further extrude objects of interest from the whole point cloud.

3D Object Detection Object +2

Explicit Facial Expression Transfer via Fine-Grained Representations

no code implementations6 Sep 2019 Zhiwen Shao, Hengliang Zhu, Junshu Tang, Xuequan Lu, Lizhuang Ma

Instead of using an intermediate estimated guidance, we propose to explicitly transfer facial expression by directly mapping two unpaired input images to two synthesized images with swapped expressions.

Multi-class Classification

SceneEncoder: Scene-Aware Semantic Segmentation of Point Clouds with A Learnable Scene Descriptor

1 code implementation24 Jan 2020 Jiachen Xu, Jingyu Gong, Jie zhou, Xin Tan, Yuan Xie, Lizhuang Ma

Besides local features, global information plays an essential role in semantic segmentation, while recent works usually fail to explicitly extract the meaningful global information and make full use of it.

Segmentation Semantic Segmentation

Novelty Detection via Non-Adversarial Generative Network

no code implementations3 Feb 2020 Chengwei Chen, Wang Yuan, Yuan Xie, Yanyun Qu, Yiqing Tao, Haichuan Song, Lizhuang Ma

One-class novelty detection is the process of determining if a query example differs from the training examples (the target class).

Image Reconstruction Novelty Detection

Anomaly Detection by One Class Latent Regularized Networks

no code implementations5 Feb 2020 Chengwei Chen, Pan Chen, Haichuan Song, Yiqing Tao, Yuan Xie, Shouhong Ding, Lizhuang Ma

Anomaly detection is a fundamental problem in computer vision area with many real-world applications.

Anomaly Detection

Night-time Scene Parsing with a Large Real Dataset

no code implementations15 Mar 2020 Xin Tan, Ke Xu, Ying Cao, Yiheng Zhang, Lizhuang Ma, Rynson W. H. Lau

Although huge progress has been made on scene analysis in recent years, most existing works assume the input images to be in day-time with good lighting conditions.

Scene Parsing Semantic Segmentation

J$\hat{\text{A}}$A-Net: Joint Facial Action Unit Detection and Face Alignment via Adaptive Attention

1 code implementation18 Mar 2020 Zhiwen Shao, Zhilei Liu, Jianfei Cai, Lizhuang Ma

Moreover, to extract precise local features, we propose an adaptive attention learning module to refine the attention map of each AU adaptively.

Action Unit Detection Face Alignment +1

SiTGRU: Single-Tunnelled Gated Recurrent Unit for Abnormality Detection

no code implementations30 Mar 2020 Habtamu Fanta, Zhiwen Shao, Lizhuang Ma

In this paper, we propose a novel version of Gated Recurrent Unit (GRU), called Single Tunnelled GRU for abnormality detection.

Anomaly Detection

Semantic Correspondence via 2D-3D-2D Cycle

1 code implementation20 Apr 2020 Yang You, Chengkun Li, Yujing Lou, Zhoujun Cheng, Lizhuang Ma, Cewu Lu, Weiming Wang

Visual semantic correspondence is an important topic in computer vision and could help machine understand objects in our daily life.

Semantic correspondence

Fine-Grained Expression Manipulation via Structured Latent Space

1 code implementation21 Apr 2020 Junshu Tang, Zhiwen Shao, Lizhuang Ma

Most existing expression manipulation methods resort to discrete expression labels, which mainly edit global expressions and ignore the manipulation of fine details.

Generative Adversarial Network

Spoof Face Detection Via Semi-Supervised Adversarial Training

no code implementations22 May 2020 Chengwei Chen, Wang Yuan, Xuequan Lu, Lizhuang Ma

To capture the underlying structure of live faces data in latent representation space, we propose to train the live face data only, with a convolutional Encoder-Decoder network acting as a Generator.

Face Detection Face Presentation Attack Detection +4

Brain Tumor Anomaly Detection via Latent Regularized Adversarial Network

no code implementations9 Jul 2020 Nan Wang, Chengwei Chen, Yuan Xie, Lizhuang Ma

The brain structure in the collected data is complicated, thence, doctors are required to spend plentiful energy when diagnosing brain abnormalities.

Semi-supervised Anomaly Detection Supervised Anomaly Detection

Weakly-Supervised Saliency Detection via Salient Object Subitizing

no code implementations4 Jan 2021 Xiaoyang Zheng, Xin Tan, Jie zhou, Lizhuang Ma, Rynson W. H. Lau

This allows the supervision to be aligned with the property of saliency detection, where the salient objects of an image could be from more than one class.

Object object-detection +4

Boundary-Aware Geometric Encoding for Semantic Segmentation of Point Clouds

no code implementations7 Jan 2021 Jingyu Gong, Jiachen Xu, Xin Tan, Jie zhou, Yanyun Qu, Yuan Xie, Lizhuang Ma

Boundary information plays a significant role in 2D image segmentation, while usually being ignored in 3D point cloud segmentation where ambiguous features might be generated in feature extraction, leading to misclassification in the transition area between two objects.

Image Segmentation Point Cloud Segmentation +2

PRIN/SPRIN: On Extracting Point-wise Rotation Invariant Features

2 code implementations24 Feb 2021 Yang You, Yujing Lou, Ruoxi Shi, Qi Liu, Yu-Wing Tai, Lizhuang Ma, Weiming Wang, Cewu Lu

Spherical Voxel Convolution and Point Re-sampling are proposed to extract rotation invariant features for each point.

3D Feature Matching Data Augmentation

Farewell to Mutual Information: Variational Distillation for Cross-Modal Person Re-Identification

3 code implementations CVPR 2021 Xudong Tian, Zhizhong Zhang, Shaohui Lin, Yanyun Qu, Yuan Xie, Lizhuang Ma

The Information Bottleneck (IB) provides an information theoretic principle for representation learning, by retaining all information relevant for predicting label while minimizing the redundancy.

Cross-Modality Person Re-identification Cross-Modal Person Re-Identification +3

Contrastive Learning for Compact Single Image Dehazing

7 code implementations CVPR 2021 Haiyan Wu, Yanyun Qu, Shaohui Lin, Jian Zhou, Ruizhi Qiao, Zhizhong Zhang, Yuan Xie, Lizhuang Ma

In this paper, we propose a novel contrastive regularization (CR) built upon contrastive learning to exploit both the information of hazy images and clear images as negative and positive samples, respectively.

Contrastive Learning Image Dehazing +1

End-to-End Video Object Detection with Spatial-Temporal Transformers

1 code implementation23 May 2021 Lu He, Qianyu Zhou, Xiangtai Li, Li Niu, Guangliang Cheng, Xiao Li, Wenxuan Liu, Yunhai Tong, Lizhuang Ma, Liqing Zhang

Recently, DETR and Deformable DETR have been proposed to eliminate the need for many hand-designed components in object detection while demonstrating good performance as previous complex hand-crafted detectors.

Object object-detection +2

Novelty Detection via Contrastive Learning with Negative Data Augmentation

no code implementations18 Jun 2021 Chengwei Chen, Yuan Xie, Shaohui Lin, Ruizhi Qiao, Jian Zhou, Xin Tan, Yi Zhang, Lizhuang Ma

Moreover, our model is more stable for training in a non-adversarial manner, compared to other adversarial based novelty detection methods.

Clustering Contrastive Learning +4

Dual Reweighting Domain Generalization for Face Presentation Attack Detection

no code implementations30 Jun 2021 Shubao Liu, Ke-Yue Zhang, Taiping Yao, Kekai Sheng, Shouhong Ding, Ying Tai, Jilin Li, Yuan Xie, Lizhuang Ma

Face anti-spoofing approaches based on domain generalization (DG) have drawn growing attention due to their robustness for unseen scenarios.

Domain Generalization Face Anti-Spoofing +1

Adaptive Normalized Representation Learning for Generalizable Face Anti-Spoofing

no code implementations5 Aug 2021 Shubao Liu, Ke-Yue Zhang, Taiping Yao, Mingwei Bi, Shouhong Ding, Jilin Li, Feiyue Huang, Lizhuang Ma

However, little attention has been paid to the feature extraction process for the FAS task, especially the influence of normalization, which also has a great impact on the generalization of the learned representation.

Domain Generalization Face Anti-Spoofing +1

Semi-supervised 3D Object Detection via Adaptive Pseudo-Labeling

no code implementations15 Aug 2021 Hongyi Xu, Fengqi Liu, Qianyu Zhou, Jinkun Hao, Zhijie Cao, Zhengyang Feng, Lizhuang Ma

Inspired by this, we propose a novel semi-supervised framework based on pseudo-labeling for outdoor 3D object detection tasks.

3D Object Detection Object +1

PIT: Position-Invariant Transform for Cross-FoV Domain Adaptation

1 code implementation ICCV 2021 Qiqi Gu, Qianyu Zhou, Minghao Xu, Zhengyang Feng, Guangliang Cheng, Xuequan Lu, Jianping Shi, Lizhuang Ma

Extensive experiments demonstrate that our method can soundly boost the performance on both cross-domain object detection and segmentation for state-of-the-art techniques.

Domain Adaptation object-detection +4

Spatiotemporal Inconsistency Learning for DeepFake Video Detection

no code implementations4 Sep 2021 Zhihao Gu, Yang Chen, Taiping Yao, Shouhong Ding, Jilin Li, Feiyue Huang, Lizhuang Ma

To address this issue, we term this task as a Spatial-Temporal Inconsistency Learning (STIL) process and instantiate it into a novel STIL block, which consists of a Spatial Inconsistency Module (SIM), a Temporal Inconsistency Module (TIM), and an Information Supplement Module (ISM).

Binary Classification Face Swapping

Domain Adaptive Semantic Segmentation via Regional Contrastive Consistency Regularization

1 code implementation11 Oct 2021 Qianyu Zhou, Chuyun Zhuang, Ran Yi, Xuequan Lu, Lizhuang Ma

In this paper, we propose a novel and fully end-to-end trainable approach, called regional contrastive consistency regularization (RCCR) for domain adaptive semantic segmentation.

Semantic Segmentation Synthetic-to-Real Translation +1

Understanding Pixel-level 2D Image Semantics with 3D Keypoint Knowledge Engine

no code implementations21 Nov 2021 Yang You, Chengkun Li, Yujing Lou, Zhoujun Cheng, Liangwei Li, Lizhuang Ma, Weiming Wang, Cewu Lu

Pixel-level 2D object semantic understanding is an important topic in computer vision and could help machine deeply understand objects (e. g. functionality and affordance) in our daily life.

Feature Generation and Hypothesis Verification for Reliable Face Anti-Spoofing

1 code implementation30 Dec 2021 Shice Liu, Shitao Lu, Hongyi Xu, Jing Yang, Shouhong Ding, Lizhuang Ma

However, the improvement is still limited by two issues: 1) It is difficult to perfectly map all faces to a shared feature space.

Disentanglement Domain Generalization +1

HybridCR: Weakly-Supervised 3D Point Cloud Semantic Segmentation via Hybrid Contrastive Regularization

1 code implementation CVPR 2022 Mengtian Li, Yuan Xie, Yunhang Shen, Bo Ke, Ruizhi Qiao, Bo Ren, Shaohui Lin, Lizhuang Ma

To address the huge labeling cost in large-scale point cloud semantic segmentation, we propose a novel hybrid contrastive regularization (HybridCR) framework in weakly-supervised setting, which obtains competitive performance compared to its fully-supervised counterpart.

Semantic Segmentation Semantic Similarity +1

TransVOD: End-to-End Video Object Detection with Spatial-Temporal Transformers

3 code implementations13 Jan 2022 Qianyu Zhou, Xiangtai Li, Lu He, Yibo Yang, Guangliang Cheng, Yunhai Tong, Lizhuang Ma, DaCheng Tao

Detection Transformer (DETR) and Deformable DETR have been proposed to eliminate the need for many hand-designed components in object detection while demonstrating good performance as previous complex hand-crafted detectors.

Ranked #4 on Video Object Detection on ImageNet VID (using extra training data)

Object object-detection +2

CtlGAN: Few-shot Artistic Portraits Generation with Contrastive Transfer Learning

no code implementations16 Mar 2022 Yue Wang, Ran Yi, Luying Li, Ying Tai, Chengjie Wang, Lizhuang Ma

We propose a new encoder which embeds real faces into Z+ space and proposes a dual-path training strategy to better cope with the adapted decoder and eliminate the artifacts.

Image-to-Image Translation Transfer Learning

LAKe-Net: Topology-Aware Point Cloud Completion by Localizing Aligned Keypoints

1 code implementation CVPR 2022 Junshu Tang, Zhijun Gong, Ran Yi, Yuan Xie, Lizhuang Ma

An asymmetric keypoint locator, including an unsupervised multi-scale keypoint detector and a complete keypoint generator, is proposed for localizing aligned keypoints from complete and partial point clouds.

Point Cloud Completion

Variational Distillation for Multi-View Learning

3 code implementations20 Jun 2022 Xudong Tian, Zhizhong Zhang, Cong Wang, Wensheng Zhang, Yanyun Qu, Lizhuang Ma, Zongze Wu, Yuan Xie, DaCheng Tao

Information Bottleneck (IB) based multi-view learning provides an information theoretic principle for seeking shared information contained in heterogeneous data descriptions.

MULTI-VIEW LEARNING Representation Learning

Adaptive Mixture of Experts Learning for Generalizable Face Anti-Spoofing

no code implementations20 Jul 2022 Qianyu Zhou, Ke-Yue Zhang, Taiping Yao, Ran Yi, Shouhong Ding, Lizhuang Ma

Existing DG-based FAS approaches always capture the domain-invariant features for generalizing on the various unseen domains.

Domain Generalization Face Anti-Spoofing +1

Generative Domain Adaptation for Face Anti-Spoofing

no code implementations20 Jul 2022 Qianyu Zhou, Ke-Yue Zhang, Taiping Yao, Ran Yi, Kekai Sheng, Shouhong Ding, Lizhuang Ma

Most existing UDA FAS methods typically fit the trained models to the target domain via aligning the distribution of semantic high-level features.

Domain Adaptation Face Anti-Spoofing

Boosting Night-time Scene Parsing with Learnable Frequency

1 code implementation30 Aug 2022 Zhifeng Xie, Sen Wang, Ke Xu, Zhizhong Zhang, Xin Tan, Yuan Xie, Lizhuang Ma

Based on this, we propose to exploit the image frequency distributions for night-time scene parsing.

Autonomous Driving Scene Parsing

Prototype-Aware Heterogeneous Task for Point Cloud Completion

no code implementations5 Sep 2022 Junshu Tang, Jiachen Xu, Jingyu Gong, Haichuan Song, Yuan Xie, Lizhuang Ma

Moreover, for effective training, we consider difficulty-based sampling strategy to encourage the network to pay more attention to some partial point clouds with fewer geometric information.

Point Cloud Completion

3DFaceShop: Explicitly Controllable 3D-Aware Portrait Generation

1 code implementation12 Sep 2022 Junshu Tang, Bo Zhang, Binxin Yang, Ting Zhang, Dong Chen, Lizhuang Ma, Fang Wen

In contrast to the traditional avatar creation pipeline which is a costly process, contemporary generative approaches directly learn the data distribution from photographs.

3D Face Animation Disentanglement +3

Image Understands Point Cloud: Weakly Supervised 3D Semantic Segmentation via Association Learning

no code implementations16 Sep 2022 Tianfang Sun, Zhizhong Zhang, Xin Tan, Yanyun Qu, Yuan Xie, Lizhuang Ma

In this paper, we propose a novel cross-modality weakly supervised method for 3D segmentation, incorporating complementary information from unlabeled images.

3D Semantic Segmentation Pseudo Label +2

Rethinking Implicit Neural Representations for Vision Learners

no code implementations22 Nov 2022 Yiran Song, Qianyu Zhou, Lizhuang Ma

Existing INRs methods suffer from two problems: 1) narrow theoretical definitions of INRs are inapplicable to high-level tasks; 2) lack of representation capabilities to deep networks.

Image Classification Image Generation +6

DCS-RISR: Dynamic Channel Splitting for Efficient Real-world Image Super-Resolution

no code implementations15 Dec 2022 Junbo Qiao, Shaohui Lin, Yunlun Zhang, Wei Li, Jie Hu, Gaoqi He, Changbo Wang, Lizhuang Ma

Real-world image super-resolution (RISR) has received increased focus for improving the quality of SR images under unknown complex degradation.

Image Super-Resolution SSIM

Rethinking Gradient Projection Continual Learning: Stability / Plasticity Feature Space Decoupling

no code implementations CVPR 2023 Zhen Zhao, Zhizhong Zhang, Xin Tan, Jun Liu, Yanyun Qu, Yuan Xie, Lizhuang Ma

In this paper, we propose a space decoupling (SD) algorithm to decouple the feature space into a pair of complementary subspaces, i. e., the stability space I, and the plasticity space R. I is established by conducting space intersection between the historic and current feature space, and thus I contains more task-shared bases.

Continual Learning

CRIN: Rotation-Invariant Point Cloud Analysis and Rotation Estimation via Centrifugal Reference Frame

1 code implementation6 Mar 2023 Yujing Lou, Zelin Ye, Yang You, Nianjuan Jiang, Jiangbo Lu, Weiming Wang, Lizhuang Ma, Cewu Lu

CRIN directly takes the coordinates of points as input and transforms local points into rotation-invariant representations via centrifugal reference frames.

Instance-Aware Domain Generalization for Face Anti-Spoofing

1 code implementation CVPR 2023 Qianyu Zhou, Ke-Yue Zhang, Taiping Yao, Xuequan Lu, Ran Yi, Shouhong Ding, Lizhuang Ma

To address these issues, we propose a novel perspective for DG FAS that aligns features on the instance level without the need for domain labels.

Domain Generalization Face Anti-Spoofing +1

Re-thinking Data Availablity Attacks Against Deep Neural Networks

no code implementations18 May 2023 Bin Fang, Bo Li, Shuang Wu, Ran Yi, Shouhong Ding, Lizhuang Ma

The unauthorized use of personal data for commercial purposes and the clandestine acquisition of private data for training machine learning models continue to raise concerns.

Towards Generalizable Data Protection With Transferable Unlearnable Examples

no code implementations18 May 2023 Bin Fang, Bo Li, Shuang Wu, Tianyi Zheng, Shouhong Ding, Ran Yi, Lizhuang Ma

One of the crucial factors contributing to this success has been the access to an abundance of high-quality data for constructing machine learning models.

RFENet: Towards Reciprocal Feature Evolution for Glass Segmentation

1 code implementation12 Jul 2023 Ke Fan, Changan Wang, Yabiao Wang, Chengjie Wang, Ran Yi, Lizhuang Ma

Glass-like objects are widespread in daily life but remain intractable to be segmented for most existing methods.

Semantic Segmentation

Phasic Content Fusing Diffusion Model with Directional Distribution Consistency for Few-Shot Model Adaption

1 code implementation ICCV 2023 Teng Hu, Jiangning Zhang, Liang Liu, Ran Yi, Siqi Kou, Haokun Zhu, Xu Chen, Yabiao Wang, Chengjie Wang, Lizhuang Ma

To address these problems, we propose a novel phasic content fusing few-shot diffusion model with directional distribution consistency loss, which targets different learning objectives at distinct training stages of the diffusion model.

Domain Adaptation

Stroke-based Neural Painting and Stylization with Dynamically Predicted Painting Region

2 code implementations7 Sep 2023 Teng Hu, Ran Yi, Haokun Zhu, Liang Liu, Jinlong Peng, Yabiao Wang, Chengjie Wang, Lizhuang Ma

To solve the problem, we propose Compositional Neural Painter, a novel stroke-based rendering framework which dynamically predicts the next painting region based on the current canvas, instead of dividing the image plane uniformly into painting regions.

Style Transfer

Contrastive Pseudo Learning for Open-World DeepFake Attribution

1 code implementation ICCV 2023 Zhimin Sun, Shen Chen, Taiping Yao, Bangjie Yin, Ran Yi, Shouhong Ding, Lizhuang Ma

The challenge in sourcing attribution for forgery faces has gained widespread attention due to the rapid development of generative techniques.

DeepFake Detection Face Swapping +1

Rethinking Domain Generalization: Discriminability and Generalizability

no code implementations28 Sep 2023 Shaocong Long, Qianyu Zhou, Chenhao Ying, Lizhuang Ma, Yuan Luo

On the one hand, the simultaneous attainment of generalizability and discriminability of features presents a complex challenge, often entailing inherent contradictions.

Domain Generalization

Diverse Target and Contribution Scheduling for Domain Generalization

no code implementations28 Sep 2023 Shaocong Long, Qianyu Zhou, Chenhao Ying, Lizhuang Ma, Yuan Luo

In specific, DTS employs distinct soft labels as training targets to account for various feature distributions across domains and thereby mitigates the gradient conflicts, and DCB dynamically balances the contributions of source domains by ensuring a fair decline in losses of different source domains.

Domain Generalization Scheduling

Generalized Category Discovery in Semantic Segmentation

1 code implementation20 Nov 2023 Zhengyuan Peng, Qijian Tian, Jianqing Xu, Yizhang Jin, Xuequan Lu, Xin Tan, Yuan Xie, Lizhuang Ma

This paper explores a novel setting called Generalized Category Discovery in Semantic Segmentation (GCDSS), aiming to segment unlabeled images given prior knowledge from a labeled set of base classes.

Segmentation Semantic Segmentation

COTR: Compact Occupancy TRansformer for Vision-based 3D Occupancy Prediction

1 code implementation4 Dec 2023 Qihang Ma, Xin Tan, Yanyun Qu, Lizhuang Ma, Zhizhong Zhang, Yuan Xie

The autonomous driving community has shown significant interest in 3D occupancy prediction, driven by its exceptional geometric perception and general object recognition capabilities.

Autonomous Driving Object Recognition

A Theory of Non-Acyclic Generative Flow Networks

no code implementations23 Dec 2023 Leo Maxime Brunswic, Yinchuan Li, Yushun Xu, Shangling Jui, Lizhuang Ma

GFlowNets is a novel flow-based method for learning a stochastic policy to generate objects via a sequence of actions and with probability proportional to a given positive reward.

BA-SAM: Scalable Bias-Mode Attention Mask for Segment Anything Model

1 code implementation4 Jan 2024 Yiran Song, Qianyu Zhou, Xiangtai Li, Deng-Ping Fan, Xuequan Lu, Lizhuang Ma

To this end, we propose Scalable Bias-Mode Attention Mask (BA-SAM) to enhance SAM's adaptability to varying image resolutions while eliminating the need for structure modifications.

Class-Imbalanced Semi-Supervised Learning for Large-Scale Point Cloud Semantic Segmentation via Decoupling Optimization

no code implementations13 Jan 2024 Mengtian Li, Shaohui Lin, Zihan Wang, Yunhang Shen, Baochang Zhang, Lizhuang Ma

Semi-supervised learning (SSL), thanks to the significant reduction of data annotation costs, has been an active research topic for large-scale 3D scene understanding.

Pseudo Label Representation Learning +2

Continuous Piecewise-Affine Based Motion Model for Image Animation

1 code implementation17 Jan 2024 Hexiang Wang, Fengqi Liu, Qianyu Zhou, Ran Yi, Xin Tan, Lizhuang Ma

To address this issue, we propose to model motion from the source image to the driving frame in highly-expressive diffeomorphism spaces.

Image Animation

SimAda: A Simple Unified Framework for Adapting Segment Anything Model in Underperformed Scenes

1 code implementation31 Jan 2024 Yiran Song, Qianyu Zhou, Xuequan Lu, Zhiwen Shao, Lizhuang Ma

In this paper, we aim to investigate the impact of the general vision modules on finetuning SAM and enable them to generalize across all downstream tasks.

DEMOS: Dynamic Environment Motion Synthesis in 3D Scenes via Local Spherical-BEV Perception

no code implementations4 Mar 2024 Jingyu Gong, Min Wang, Wentao Liu, Chen Qian, Zhizhong Zhang, Yuan Xie, Lizhuang Ma

To handle this problem, we propose the first Dynamic Environment MOtion Synthesis framework (DEMOS) to predict future motion instantly according to the current scene, and use it to dynamically update the latent motion for final motion synthesis.

motion prediction Motion Synthesis

Exploring Safety Generalization Challenges of Large Language Models via Code

no code implementations12 Mar 2024 Qibing Ren, Chang Gao, Jing Shao, Junchi Yan, Xin Tan, Yu Qiao, Wai Lam, Lizhuang Ma

The rapid advancement of Large Language Models (LLMs) has brought about remarkable generative capabilities but also raised concerns about their potential misuse.

Code Completion

Real-IAD: A Real-World Multi-View Dataset for Benchmarking Versatile Industrial Anomaly Detection

1 code implementation19 Mar 2024 Chengjie Wang, Wenbing Zhu, Bin-Bin Gao, Zhenye Gan, Jianning Zhang, Zhihao Gu, Shuguang Qian, Mingang Chen, Lizhuang Ma

Finally, we report the results of popular IAD methods on the Real-IAD dataset, providing a highly challenging benchmark to promote the development of the IAD field.

Benchmarking Unsupervised Anomaly Detection

Test-Time Domain Generalization for Face Anti-Spoofing

no code implementations28 Mar 2024 Qianyu Zhou, Ke-Yue Zhang, Taiping Yao, Xuequan Lu, Shouhong Ding, Lizhuang Ma

Our method, consisting of Test-Time Style Projection (TTSP) and Diverse Style Shifts Simulation (DSSS), effectively projects the unseen data to the seen domain space.

Domain Generalization Face Anti-Spoofing

SDPose: Tokenized Pose Estimation via Circulation-Guide Self-Distillation

1 code implementation4 Apr 2024 Sichen Chen, Yingyi Zhang, Siming Huang, Ran Yi, Ke Fan, Ruixin Zhang, Peixian Chen, Jun Wang, Shouhong Ding, Lizhuang Ma

To mitigate the problem of under-fitting, we design a transformer module named Multi-Cycled Transformer(MCT) based on multiple-cycled forwards to more fully exploit the potential of small model parameters.

Edge-computing Pose Estimation

Learning Topology Uniformed Face Mesh by Volume Rendering for Multi-view Reconstruction

no code implementations8 Apr 2024 Yating Wang, Ran Yi, Ke Fan, Jinkun Hao, Jiangbo Lu, Lizhuang Ma

Our goal is to leverage the superiority of neural volume rendering into multi-view reconstruction of face mesh with consistent topology.

3D Reconstruction Face Reconstruction +1

PromptAD: Learning Prompts with only Normal Samples for Few-Shot Anomaly Detection

2 code implementations8 Apr 2024 Xiaofan Li, Zhizhong Zhang, Xin Tan, Chengwei Chen, Yanyun Qu, Yuan Xie, Lizhuang Ma

The vision-language model has brought great improvement to few-shot industrial anomaly detection, which usually needs to design of hundreds of prompts through prompt engineering.

Anomaly Detection Language Modelling +1

DGMamba: Domain Generalization via Generalized State Space Model

2 code implementations11 Apr 2024 Shaocong Long, Qianyu Zhou, Xiangtai Li, Xuequan Lu, Chenhao Ying, Yuan Luo, Lizhuang Ma, Shuicheng Yan

SPR strives to encourage the model to concentrate more on objects rather than context, consisting of two designs: Prior-Free Scanning~(PFS), and Domain Context Interchange~(DCI).

Domain Generalization

Cannot find the paper you are looking for? You can Submit a new open access paper.