Search Results for author: Majdi Sawalha

Found 4 papers, 0 papers with code

Construction and Annotation of the Jordan Comprehensive Contemporary Arabic Corpus (JCCA)

no code implementations • WS 2019 • Majdi Sawalha, Faisal Al-Shargi, Abdallah AlShdaifat, Sane Yagi, Mohammad A. Qudah

To compile a modern dictionary that catalogues the words in currency, and to study linguistic patterns in the contemporary language, it is necessary to have a corpus of authentic texts that reflect current usage of the language.

TAG

Paper
Add Code

Tools for Arabic Natural Language Processing: a case study in qalqalah prosody

no code implementations • LREC 2014 • Claire Brierley, Majdi Sawalha, Eric Atwell

In this paper, we focus on the prosodic effect of qalqalah or {``}vibration{''} applied to a subset of Arabic consonants under certain constraints during correct Qur{'}anic recitation or ta{\c{C}}{\S}w{\=\i}d, using our Boundary-Annotated QurÂ’an dataset of 77430 words (Brierley et al 2012; Sawalha et al 2014).

Keyword Extraction

Paper
Add Code

Predicting Phrase Breaks in Classical and Modern Standard Arabic Text

no code implementations • LREC 2012 • Majdi Sawalha, Claire Brierley, Eric Atwell

We train and test two probabilistic taggers for Arabic phrase break prediction on a purpose-built, gold standard, boundary-annotated and PoS-tagged Qur'an corpus of 77430 words and 8230 sentences.

Chunking Human Parsing +3

Paper
Add Code

Open-Source Boundary-Annotated Corpus for Arabic Speech and Language Processing

no code implementations • LREC 2012 • Claire Brierley, Majdi Sawalha, Eric Atwell

We take a novel approach to phrase break prediction for Arabic, deriving our prosodic annotation scheme from Tajw{\=\i}d (recitation) mark-up in the Qur'an which we then interpret as additional text-based data for computational analysis.

Chunking Descriptive +2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.