no code implementations • LREC 2022 • Elisa Gugliotta, Marco Dinarelli
In this paper we present the final result of a project focused on Tunisian Arabic encoded in Arabizi, the Latin-based writing system for digital conversations.
no code implementations • 11 Jul 2022 • Elisa Gugliotta, Marco Dinarelli
In this paper we present the final result of a project on Tunisian Arabic encoded in Arabizi, the Latin-based writing system for digital conversations.
1 code implementation • COLING (WANLP) 2020 • Elisa Gugliotta, Marco Dinarelli, Olivier Kraif
We show also how we used the system in order to annotate a Tunisian Arabizi corpus, which has been afterwards manually corrected and used to further evaluate sequence models on Tunisian data.
no code implementations • JEPTALNRECITAL 2020 • Elisa Gugliotta, Marco Dinarelli
TArC : Incrementally and Semi-Automatically Collecting a Tunisian arabish Corpus This article describes the collection process of the first morpho-syntactically annotated Tunisian arabish Corpus (TArC).
no code implementations • LREC 2020 • Elisa Gugliotta, Marco Dinarelli
This article describes the constitution process of the first morpho-syntactically annotated Tunisian Arabish Corpus (TArC).