no code implementations • AMTA 2016 • Sawsan Alqahtani, Mahmoud Ghoneim, Mona Diab
The absence of these diacritics naturally leads to significant word ambiguity to top the inherent ambiguity present in fully diacritized words.
no code implementations • LREC 2016 • Mona Diab, Mahmoud Ghoneim, Abdelati Hawwari, Fahad AlGhamdi, Nada Almarwani, Mohamed Al-Badrashiny
We present our effort to create a large Multi-Layered representational repository of Linguistic Code-Switched Arabic data.
no code implementations • WS 2016 • Giovanni Molina, Fahad AlGhamdi, Mahmoud Ghoneim, Abdelati Hawwari, Nicolas Rey-Villamizar, Mona Diab, Thamar Solorio
We present an overview of the second shared task on language identification in code-switched data.
no code implementations • WS 2016 • Wajdi Zaghouani, Abdelati Hawwari, Sawsan Alqahtani, Houda Bouamor, Mahmoud Ghoneim, Mona Diab, Kemal Oflazer
Arabic writing is typically underspecified for short vowels and other markups, referred to as diacritics.
no code implementations • WS 2016 • Mohamed Al-Badrashiny, Abdelati Hawwari, Mahmoud Ghoneim, Mona Diab
We propose an automated method that identifies the morphological and syntactic flexibility of Arabic Verbal Multiword Expressions (AVMWE).
no code implementations • LREC 2016 • Abdelati Hawwari, Mohammed Attia, Mahmoud Ghoneim, Mona Diab
Identifying the various types of the Idafa construction (IC) is of importance to Natural Language processing (NLP) applications.
no code implementations • LREC 2016 • Wajdi Zaghouani, Houda Bouamor, Abdelati Hawwari, Mona Diab, Ossama Obeid, Mahmoud Ghoneim, Sawsan Alqahtani, Kemal Oflazer
This paper presents the annotation guidelines developed as part of an effort to create a large scale manually diacritized corpus for various Arabic text genres.