Arabic Text Diacritization

In this paper, we propose an approach to tackle the problem of the automatic restoration of Arabic diacritics that includes three components stacked in a pipeline: a deep learning model which is a multi-layer recurrent neural network with LSTM and Dense layers, a character-level rule-based corrector which applies deterministic operations to prevent some errors, and a word-level statistical corrector which uses the context and the distance information to fix some diacritization issues.

Paper
Code

CAMeL Tools: An Open Source Python Toolkit for Arabic Natural Language Processing

CAMeL-Lab/camel_tools • • LREC 2020

We present CAMeL Tools, a collection of open-source tools for Arabic natural language processing in Python.

Paper
Code

Deep Diacritization: Efficient Hierarchical Recurrence for Improved Arabic Diacritization

BKHMSI/deep-diacritization • • COLING (WANLP) 2020

We propose a novel architecture for labelling character sequences that achieves state-of-the-art results on the Tashkeela Arabic diacritization benchmark.

Paper
Code

Effective Deep Learning Models for Automatic Diacritization of Arabic Text

almodhfer/Arabic_Diacritization • • 1 Nov 2020

We propose three deep learning models to recover Arabic text diacritics based on our work in a text-to-speech synthesis system using deep learning.

Paper
Code

Arabic Text Diacritization

Benchmarks Add a Result

Datasets

Most implemented papers

Arabic Text Diacritization Using Deep Neural Networks

Neural Arabic Text Diacritization: State of the Art Results and a Novel Approach for Machine Translation

Multi-components System for Automatic Arabic Diacritization

CAMeL Tools: An Open Source Python Toolkit for Arabic Natural Language Processing

Deep Diacritization: Efficient Hierarchical Recurrence for Improved Arabic Diacritization

Effective Deep Learning Models for Automatic Diacritization of Arabic Text

Content

Benchmarks

Add a Result