Search Results for author: Bowen Ma

Found 16 papers, 6 papers with code

Structure Modeling Activation Free Fourier Network for Spacecraft Image Denoising

1 code implementation11 Sep 2024 Jingfan Yang, Hu Gao, Ying Zhang, Bowen Ma, Depeng Dang

We present AFFB and utilize an improved Fast Fourier block to extract repetitive periodic features and long-range information in noisy spacecraft image.

Image Denoising

Norface: Improving Facial Expression Analysis by Identity Normalization

1 code implementation22 Jul 2024 Hanwei Liu, Rudong An, Zhimeng Zhang, Bowen Ma, Wei zhang, Yan Song, Yujing Hu, Wei Chen, Yu Ding

First, the carefully designed normalization network struggles to directly remove the above task-irrelevant noise, by maintaining facial expression consistency but normalizing all original images to a common identity with consistent pose, and background.

Classification Facial Action Unit Detection +3

Open-Vocabulary X-ray Prohibited Item Detection via Fine-tuning CLIP

no code implementations16 Jun 2024 Shuyang Lin, Tong Jia, Hao Wang, Bowen Ma, Mingyuan Li, Dongyue Chen

To address aforementioned challenges, in this paper, we introduce distillation-based open-vocabulary object detection (OVOD) task into X-ray security inspection domain by extending CLIP to learn visual representations in our specific X-ray domain, aiming to detect novel prohibited item categories beyond base categories on which the detector is trained.

object-detection Open-vocabulary object detection +1

MMCL: Boosting Deformable DETR-Based Detectors with Multi-Class Min-Margin Contrastive Learning for Superior Prohibited Item Detection

1 code implementation5 Jun 2024 Mingyuan Li, Tong Jia, Hui Lu, Bowen Ma, Hao Wang, Dongyue Chen

Prohibited Item detection in X-ray images is one of the most effective security inspection methods. However, differing from natural light images, the unique overlapping phenomena in X-ray images lead to the coupling of foreground and background features, thereby lowering the accuracy of general object detectors. Therefore, we propose a Multi-Class Min-Margin Contrastive Learning (MMCL) method that, by clarifying the category semantic information of content queries under the deformable DETR architecture, aids the model in extracting specific category foreground information from coupled features. Specifically, after grouping content queries by the number of categories, we employ the Multi-Class Inter-Class Exclusion (MIE) loss to push apart content queries from different groups.

Contrastive Learning

Emphasizing Crucial Features for Efficient Image Restoration

1 code implementation19 May 2024 Hu Gao, Bowen Ma, Ying Zhang, Jingfan Yang, Jing Yang, Depeng Dang

SFAM consists of two modules: the spatial domain attention module (SDAM) and the frequency domain attention module (FDAM).

Image Restoration

AO-DETR: Anti-Overlapping DETR for X-Ray Prohibited Items Detection

1 code implementation7 Mar 2024 Mingyuan Li, Tong Jia, Hao Wang, Bowen Ma, Shuyang Lin, Da Cai, Dongyue Chen

Considering the significant overlapping phenomenon in X-ray prohibited item images, we propose an Anti-Overlapping DETR (AO-DETR) based on one of the state-of-the-art general object detectors, DINO.

Decoder

FlowFace++: Explicit Semantic Flow-supervised End-to-End Face Swapping

no code implementations22 Jun 2023 Yu Zhang, Hao Zeng, Bowen Ma, Wei zhang, Zhimeng Zhang, Yu Ding, Tangjie Lv, Changjie Fan

The discriminator is shape-aware and relies on a semantic flow-guided operation to explicitly calculate the shape discrepancies between the target and source faces, thus optimizing the face swapping network to generate highly realistic results.

Decoder Face Swapping

TalkCLIP: Talking Head Generation with Text-Guided Expressive Speaking Styles

no code implementations1 Apr 2023 Yifeng Ma, Suzhen Wang, Yu Ding, Bowen Ma, Tangjie Lv, Changjie Fan, Zhipeng Hu, Zhidong Deng, Xin Yu

Leveraging the proposed dataset, we introduce a CLIP-based style encoder that projects natural language-based descriptions to the representations of expressions.

2D Semantic Segmentation task 3 (25 classes) Talking Head Generation

Multi-modal Facial Affective Analysis based on Masked Autoencoder

no code implementations20 Mar 2023 Wei zhang, Bowen Ma, Feng Qiu, Yu Ding

The CVPR 2023 Competition on Affective Behavior Analysis in-the-wild (ABAW) is dedicated to providing high-quality and large-scale Aff-wild2 for the recognition of commonly used emotion representations, such as Action Units (AU), basic expression categories(EXPR), and Valence-Arousal (VA).

FlowFace: Semantic Flow-guided Shape-aware Face Swapping

no code implementations6 Dec 2022 Hao Zeng, Wei zhang, Changjie Fan, Tangjie Lv, Suzhen Wang, Zhimeng Zhang, Bowen Ma, Lincheng Li, Yu Ding, Xin Yu

Unlike most previous methods that focus on transferring the source inner facial features but neglect facial contours, our FlowFace can transfer both of them to a target face, thus leading to more realistic face swapping.

Face Swapping

Facial Action Unit Detection and Intensity Estimation from Self-supervised Representation

no code implementations28 Oct 2022 Bowen Ma, Rudong An, Wei zhang, Yu Ding, Zeng Zhao, Rongsheng Zhang, Tangjie Lv, Changjie Fan, Zhipeng Hu

As a fine-grained and local expression behavior measurement, facial action unit (FAU) analysis (e. g., detection and intensity estimation) has been documented for its time-consuming, labor-intensive, and error-prone annotation.

Action Unit Detection Facial Action Unit Detection

Transformer-based Multimodal Information Fusion for Facial Expression Analysis

no code implementations23 Mar 2022 Wei zhang, Feng Qiu, Suzhen Wang, Hao Zeng, Zhimeng Zhang, Rudong An, Bowen Ma, Yu Ding

Then, we introduce a transformer-based fusion module that integrates the static vision features and the dynamic multimodal features.

Action Unit Detection Arousal Estimation +2

Enhancing Identification of Structure Function of Academic Articles Using Contextual Information

1 code implementation28 Nov 2021 Bowen Ma, Chengzhi Zhang, Yuzhuo Wang, Sanhong Deng

In the research on identifying the structure function of chapters in academic articles, only a few studies used the deep learning model and explored the optimization for feature input.

Deep Learning text-classification +2

Cannot find the paper you are looking for? You can Submit a new open access paper.