Search Results for author: AbdelRahim Elmadany

Found 29 papers, 10 papers with code

Cheetah: Natural Language Generation for 517 African Languages

no code implementations2 Jan 2024 Ife Adebara, AbdelRahim Elmadany, Muhammad Abdul-Mageed

The findings of this study contribute to advancing NLP research in low-resource settings, enabling greater accessibility and inclusion for African languages in a rapidly expanding digital landscape.

Language Modelling Text Generation

Octopus: A Multitask Model and Toolkit for Arabic Natural Language Generation

no code implementations24 Oct 2023 AbdelRahim Elmadany, El Moatez Billah Nagoudi, Muhammad Abdul-Mageed

While many researchers have proposed models and solutions for individual problems, there is an acute shortage of a comprehensive Arabic natural language generation toolkit that is capable of handling a wide range of tasks.

Text Generation

On the Robustness of Arabic Speech Dialect Identification

no code implementations1 Jun 2023 Peter Sullivan, AbdelRahim Elmadany, Muhammad Abdul-Mageed

As these pipelines require application of ADI tools to potentially out-of-domain data, we aim to investigate how vulnerable the tools may be to this domain shift.

Dialect Identification Self-Supervised Learning +3

Dolphin: A Challenging and Diverse Benchmark for Arabic NLG

no code implementations24 May 2023 El Moatez Billah Nagoudi, AbdelRahim Elmadany, Ahmed El-Shangiti, Muhammad Abdul-Mageed

We present Dolphin, a novel benchmark that addresses the need for a natural language generation (NLG) evaluation framework dedicated to the wide collection of Arabic languages and varieties.

Dialogue Generation Machine Translation +3

UBC-DLNLP at SemEval-2023 Task 12: Impact of Transfer Learning on African Sentiment Analysis

no code implementations21 Apr 2023 Gagan Bhatia, Ife Adebara, AbdelRahim Elmadany, Muhammad Abdul-Mageed

We describe our contribution to the SemEVAl 2023 AfriSenti-SemEval shared task, where we tackle the task of sentiment analysis in 14 different African languages.

Sentiment Analysis Transfer Learning

JASMINE: Arabic GPT Models for Few-Shot Learning

no code implementations21 Dec 2022 El Moatez Billah Nagoudi, Muhammad Abdul-Mageed, AbdelRahim Elmadany, Alcides Alcoba Inciarte, Md Tawkat Islam Khondaker

Scholarship on generative pretraining (GPT) remains acutely Anglocentric, leaving serious gaps in our understanding of the whole class of autoregressive models.

Few-Shot Learning

ORCA: A Challenging Benchmark for Arabic Language Understanding

no code implementations21 Dec 2022 AbdelRahim Elmadany, El Moatez Billah Nagoudi, Muhammad Abdul-Mageed

Due to their crucial role in all NLP, several benchmarks have been proposed to evaluate pretrained language models.

SERENGETI: Massively Multilingual Language Models for Africa

no code implementations21 Dec 2022 Ife Adebara, AbdelRahim Elmadany, Muhammad Abdul-Mageed, Alcides Alcoba Inciarte

Multilingual pretrained language models (mPLMs) acquire valuable, generalizable linguistic information during pretraining and have advanced the state of the art on task-specific finetuning.

Language Modelling Natural Language Understanding

ARBERT \& MARBERT: Deep Bidirectional Transformers for Arabic

no code implementations ACL 2021 Muhammad Abdul-Mageed, AbdelRahim Elmadany, El Moatez Billah Nagoudi

To evaluate our models, we also introduce ARLUE, a new benchmark for multi-dialectal Arabic language understanding evaluation.

XLM-R

NADI 2021: The Second Nuanced Arabic Dialect Identification Shared Task

1 code implementation EACL (WANLP) 2021 Muhammad Abdul-Mageed, Chiyu Zhang, AbdelRahim Elmadany, Houda Bouamor, Nizar Habash

This Shared Task includes four subtasks: country-level Modern Standard Arabic (MSA) identification (Subtask 1. 1), country-level dialect identification (Subtask 1. 2), province-level MSA identification (Subtask 2. 1), and province-level sub-dialect identification (Subtask 2. 2).

Dialect Identification

ARBERT & MARBERT: Deep Bidirectional Transformers for Arabic

2 code implementations27 Dec 2020 Muhammad Abdul-Mageed, AbdelRahim Elmadany, El Moatez Billah Nagoudi

To evaluate our models, we also introduce ARLUE, a new benchmark for multi-dialectal Arabic language understanding evaluation.

XLM-R

Machine Generation and Detection of Arabic Manipulated and Fake News

1 code implementation COLING (WANLP) 2020 El Moatez Billah Nagoudi, AbdelRahim Elmadany, Muhammad Abdul-Mageed, Tariq Alhindi, Hasan Cavusoglu

Finally, we develop the first models for detecting manipulated Arabic news and achieve state-of-the-art results on Arabic fake news detection (macro F1=70. 06).

Fake News Detection POS

Toward Micro-Dialect Identification in Diaglossic and Code-Switched Environments

1 code implementation EMNLP 2020 Muhammad Abdul-Mageed, Chiyu Zhang, AbdelRahim Elmadany, Lyle Ungar

Although the prediction of dialects is an important language processing task, with a wide range of applications, existing work is largely limited to coarse-grained varieties.

Dialect Identification Language Modelling +1

Leveraging Affective Bidirectional Transformers for Offensive Language Detection

no code implementations LREC 2020 AbdelRahim Elmadany, Chiyu Zhang, Muhammad Abdul-Mageed, Azadeh Hashemi

Social media are pervasive in our life, making it necessary to ensure safe online experiences by detecting and removing offensive and hate speech.

Data Augmentation Feature Engineering +1

Sentence-Level BERT and Multi-Task Learning of Age and Gender in Social Media

no code implementations2 Nov 2019 Muhammad Abdul-Mageed, Chiyu Zhang, Arun Rajendran, AbdelRahim Elmadany, Michael Przystupa, Lyle Ungar

In this work we exploit a newly-created Arabic dataset with ground truth age and gender labels to learn these attributes both individually and in a multi-task setting at the sentence level.

Multi-Task Learning Sentence

Cannot find the paper you are looking for? You can Submit a new open access paper.