Search Results for author: Zuheng Ming

Found 15 papers, 2 papers with code

MMFormer: Multimodal Transformer Using Multiscale Self-Attention for Remote Sensing Image Classification

no code implementations23 Mar 2023 Bo Zhang, Zuheng Ming, Wei Feng, Yaqian Liu, Liang He, Kaixing Zhao

To benefit the complementary information between heterogeneous data, we introduce a new Multimodal Transformer (MMFormer) for Remote Sensing (RS) image classification using Hyperspectral Image (HSI) accompanied by another source of data such as Light Detection and Ranging (LiDAR).

Image Classification Remote Sensing Image Classification

Identity Documents Authentication based on Forgery Detection of Guilloche Pattern

no code implementations22 Jun 2022 Musab Al-Ghadi, Zuheng Ming, Petra Gomez-Krämer, Jean-Christophe Burie

In this work, these two steps are combined together to achieve two objectives: (i) extracted features should have good anticollision (discriminative) capabilities to distinguish between a pair of identity documents belonging to different classes, (ii) checking out the conformity of the guilloche pattern of a given identity document and its similarity to the guilloche pattern of an authentic version of the same country.

VLCDoC: Vision-Language Contrastive Pre-Training Model for Cross-Modal Document Classification

no code implementations24 May 2022 Souhail Bakkali, Zuheng Ming, Mickael Coustaty, Marçal Rusiñol, Oriol Ramos Terrades

Multimodal learning from document data has achieved great success lately as it allows to pre-train semantically meaningful features as a prior into a learnable downstream task.

Document Classification Document Image Classification

ViTransPAD: Video Transformer using convolution and self-attention for Face Presentation Attack Detection

no code implementations3 Mar 2022 Zuheng Ming, Zitong Yu, Musab Al-Ghadi, Muriel Visani, Muhammad MuzzamilLuqman, Jean-Christophe Burie

Instead of using coarse image patches with single-scale as in ViT, we propose the Multi-scale Multi-Head Self-Attention (MsMHSA) architecture to accommodate multi-scale patch partitions of Q, K, V feature maps to the heads of transformer in a coarse-to-fine manner, which enables to learn a fine-grained representation to perform pixel-level discrimination for face PAD.

Binary Classification Face Presentation Attack Detection

Exploring Multi-Tasking Learning in Document Attribute Classification

no code implementations30 Aug 2021 Tanmoy Mondal, Abhijit Das, Zuheng Ming

In this work, we adhere to explore a Multi-Tasking learning (MTL) based network to perform document attribute classification such as the font type, font size, font emphasis and scanning resolution classification of a document image.

Attribute Classification

MIDV-2020: A Comprehensive Benchmark Dataset for Identity Document Analysis

no code implementations1 Jul 2021 Konstantin Bulatov, Ekaterina Emelianova, Daniil Tropin, Natalya Skoryukina, Yulia Chernyshova, Alexander Sheshkus, Sergey Usilin, Zuheng Ming, Jean-Christophe Burie, Muhammad Muzzamil Luqman, Vladimir V. Arlazarov

Identity documents recognition is an important sub-field of document analysis, which deals with tasks of robust document detection, type identification, text fields recognition, as well as identity fraud prevention and document authenticity validation given photos, scans, or video frames of an identity document capture.

Face Detection

A Survey On Anti-Spoofing Methods For Face Recognition with RGB Cameras of Generic Consumer Devices

no code implementations8 Oct 2020 Zuheng Ming, Muriel Visani, Muhammad Muzzamil Luqman, Jean-Christophe Burie

The widespread deployment of face recognition-based biometric systems has made face Presentation Attack Detection (face anti-spoofing) an increasingly critical issue.

Face Anti-Spoofing Face Presentation Attack Detection +1

Cross-modal Multi-task Learning for Graphic Recognition of Caricature Face

no code implementations10 Mar 2020 Zuheng Ming, Jean-Christophe Burie, Muhammad Muzzamil Luqman

Face recognition of realistic visual images has been well studied and made a significant progress in the recent decade.

Caricature Face Recognition +1

Dynamic Deep Multi-task Learning for Caricature-Visual Face Recognition

1 code implementation8 Nov 2019 Zuheng Ming, Jean-Christophe Burie, Muhammad Muzzamil Luqman

Rather than the visual images, the face recognition of the caricatures is far from the performance of the visual images.

Caricature Face Recognition +1

Dynamic Multi-Task Learning for Face Recognition with Facial Expression

1 code implementation8 Nov 2019 Zuheng Ming, Junshi Xia, Muhammad Muzzamil Luqman, Jean-Christophe Burie, Kaixing Zhao

This multi-task learning with dynamic weights also boosts of the performance on the different tasks comparing to the state-of-art methods with single-task learning.

Face Recognition Face Verification +3

Face Detection in Camera Captured Images of Identity Documents under Challenging Conditions

no code implementations8 Nov 2019 Souhail Bakkali, Zuheng Ming, Muhammad Muzzamil Luqman, Jean-Christophe Burie

Benefiting from the advance of deep convolutional neural network approaches (CNNs), many face detection algorithms have achieved state-of-the-art performance in terms of accuracy and very high speed in unconstrained applications.

Face Detection

FaceLiveNet+: A Holistic Networks For Face Authentication Based On Dynamic Multi-task Convolutional Neural Networks

no code implementations28 Feb 2019 Zuheng Ming, Junshi Xia, Muhammad Muzzamil Luqman, Jean-Christophe Burie, Kaixing Zhao

This paper proposes a holistic multi-task Convolutional Neural Networks (CNNs) with the dynamic weights of the tasks, namely FaceLiveNet+, for face authentication.

Face Verification Facial Expression Recognition +2

Cannot find the paper you are looking for? You can Submit a new open access paper.