Search Results for author: Yousri Kessentini

Found 12 papers, 6 papers with code

FATURA: A Multi-Layout Invoice Image Dataset for Document Analysis and Understanding

no code implementations20 Nov 2023 Mahmoud Limam, Marwa Dhiaf, Yousri Kessentini

In this paper, we introduce FATURA, a pivotal resource for researchers in the field of document analysis and understanding.

CSSL-MHTR: Continual Self-Supervised Learning for Scalable Multi-script Handwritten Text Recognition

no code implementations16 Mar 2023 Marwa Dhiaf, Mohamed Ali Souibgui, Kai Wang, Yuyang Liu, Yousri Kessentini, Alicia Fornés, Ahmed Cheikh Rouhou

In this paper, we explore the potential of continual self-supervised learning to alleviate the catastrophic forgetting problem in handwritten text recognition, as an example of sequence recognition.

Handwritten Text Recognition Self-Supervised Learning

Text-DIAE: A Self-Supervised Degradation Invariant Autoencoders for Text Recognition and Document Enhancement

1 code implementation9 Mar 2022 Mohamed Ali Souibgui, Sanket Biswas, Andres Mafla, Ali Furkan Biten, Alicia Fornés, Yousri Kessentini, Josep Lladós, Lluis Gomez, Dimosthenis Karatzas

In this paper, we propose a Text-Degradation Invariant Auto Encoder (Text-DIAE), a self-supervised model designed to tackle two tasks, text recognition (handwritten or scene-text) and document image enhancement.

Document Enhancement Scene Text Recognition

Transformer-Based Approach for Joint Handwriting and Named Entity Recognition in Historical documents

no code implementations8 Dec 2021 Ahmed Cheikh Rouhoua, Marwa Dhiaf, Yousri Kessentini, Sinda Ben Salem

Second, it allows the model to exploit larger bi-dimensional context information to identify the semantic categories, reaching a higher final prediction accuracy.

Language Modelling named-entity-recognition +2

Few Shots Are All You Need: A Progressive Few Shot Learning Approach for Low Resource Handwritten Text Recognition

1 code implementation21 Jul 2021 Mohamed Ali Souibgui, Alicia Fornés, Yousri Kessentini, Beáta Megyesi

Since this retraining would require annotation of thousands of handwritten symbols together with their bounding boxes, we propose to avoid such human effort through an unsupervised progressive learning approach that automatically assigns pseudo-labels to the non-annotated data.

Few-Shot Learning Handwriting Recognition +1

Enhance to Read Better: A Multi-Task Adversarial Network for Handwritten Document Image Enhancement

1 code implementation26 May 2021 Sana Khamekhem Jemni, Mohamed Ali Souibgui, Yousri Kessentini, Alicia Fornés

Unlike the most well-known document binarization methods, which try to improve the visual quality of the degraded document, the proposed architecture integrates a handwritten text recognizer that promotes the generated document image to be more readable.

Binarization Handwritten Text Recognition +2

One-shot Compositional Data Generation for Low Resource Handwritten Text Recognition

no code implementations11 May 2021 Mohamed Ali Souibgui, Ali Furkan Biten, Sounak Dey, Alicia Fornés, Yousri Kessentini, Lluis Gomez, Dimosthenis Karatzas, Josep Lladós

Low resource Handwritten Text Recognition (HTR) is a hard problem due to the scarce annotated data and the very limited linguistic information (dictionaries and language models).

Handwritten Text Recognition HTR

DE-GAN: A Conditional Generative Adversarial Network for Document Enhancement

4 code implementations17 Oct 2020 Mohamed Ali Souibgui, Yousri Kessentini

Documents often exhibit various forms of degradation, which make it hard to be read and substantially deteriorate the performance of an OCR system.

Binarization Deblurring +3

Cannot find the paper you are looking for? You can Submit a new open access paper.