no code implementations • 1 Mar 2024 • Musab Al-Ghadi, Joris Voerman, Souhail Bakkali, Mickaël Coustaty, Nicolas Sidere, Xavier St-Georges
The increasing use of digital technologies and mobile-based registration procedures highlights the vital role of personal identity documents (IDs) in verifying users and safeguarding sensitive information.
no code implementations • 11 Sep 2023 • Souhail Bakkali, Sanket Biswas, Zuheng Ming, Mickael Coustaty, Marçal Rusiñol, Oriol Ramos Terrades, Josep Lladós
The field of visual document understanding has witnessed a rapid growth in emerging challenges and powerful multi-modal strategies.
Ranked #19 on Document Image Classification on RVL-CDIP
no code implementations • IJDAR 2021 • Souhail Bakkali, Ziheng Ming, Mickael Coustaty, Marçal Rusiñol
To the best of our knowledge, this is the first time to leverage a mutual learning approach along with a self-attention-based fusion module to perform document image classification.
Ranked #1 on Document Image Classification on RVL-CDIP
no code implementations • 24 May 2022 • Souhail Bakkali, Zuheng Ming, Mickael Coustaty, Marçal Rusiñol, Oriol Ramos Terrades
Multimodal learning from document data has achieved great success lately as it allows to pre-train semantically meaningful features as a prior into a learnable downstream task.
Ranked #18 on Document Image Classification on RVL-CDIP
no code implementations • CVPRW 2020 • Souhail Bakkali, Ziheng Ming, Mickael Coustaty, Marçal Rusiñol
Moreover, a joint feature learning approach that combines image features and text embeddings is introduced as a late fusion methodology.
Ranked #2 on Document Image Classification on RVL-CDIP
no code implementations • 8 Nov 2019 • Souhail Bakkali, Zuheng Ming, Muhammad Muzzamil Luqman, Jean-Christophe Burie
Benefiting from the advance of deep convolutional neural network approaches (CNNs), many face detection algorithms have achieved state-of-the-art performance in terms of accuracy and very high speed in unconstrained applications.