1 code implementation • EACL (WANLP) 2021 • Wissam Antoun, Fady Baly, Hazem Hajj
In this paper, we develop the first advanced Arabic language generation model, AraGPT2, trained from scratch on a large Arabic corpus of internet text and news articles.
1 code implementation • EACL (WANLP) 2021 • Wissam Antoun, Fady Baly, Hazem Hajj
Advances in English language representation enabled a more sample-efficient pre-training task by Efficiently Learning an Encoder that Classifies Token Replacements Accurately (ELECTRA).
no code implementations • LREC 2020 • Dj, Marc ji, Fady Baly, Wissam Antoun, Hazem Hajj
The shared task on Offensive Language Detection at the OSACT4 has aimed at achieving state of art profane language detection methods for Arabic social media.
3 code implementations • LREC 2020 • Wissam Antoun, Fady Baly, Hazem Hajj
Recently, with the surge of transformers based models, language-specific BERT based models have proven to be very efficient at language understanding, provided they are pre-trained on a very large corpus.
Ranked #1 on Sentiment Analysis on AJGT