Search Results for author: El Moatez Billah Nagoudi

Found 32 papers, 10 papers with code

Peacock: A Family of Arabic Multimodal Large Language Models and Benchmarks

1 code implementation1 Mar 2024 Fakhraddin Alwajih, El Moatez Billah Nagoudi, Gagan Bhatia, Abdelrahman Mohamed, Muhammad Abdul-Mageed

Multimodal large language models (MLLMs) have proven effective in a wide range of tasks requiring complex reasoning and linguistic comprehension.

Visual Reasoning

FinTral: A Family of GPT-4 Level Multimodal Financial Large Language Models

no code implementations16 Feb 2024 Gagan Bhatia, El Moatez Billah Nagoudi, Hasan Cavusoglu, Muhammad Abdul-Mageed

We introduce FinTral, a suite of state-of-the-art multimodal large language models (LLMs) built upon the Mistral-7b model and tailored for financial analysis.

Decision Making Retrieval

Beyond English: Evaluating LLMs for Arabic Grammatical Error Correction

no code implementations13 Dec 2023 Sang Yun Kwon, Gagan Bhatia, El Moatez Billah Nagoudi, Muhammad Abdul-Mageed

Our best model achieves a new SOTA on Arabic GEC, with $73. 29$ and $73. 26$ F$_{1}$ on the 2014 and 2015 QALB datasets, respectively, compared to peer-reviewed published baselines.

Few-Shot Learning Grammatical Error Correction +1

Octopus: A Multitask Model and Toolkit for Arabic Natural Language Generation

no code implementations24 Oct 2023 AbdelRahim Elmadany, El Moatez Billah Nagoudi, Muhammad Abdul-Mageed

While many researchers have proposed models and solutions for individual problems, there is an acute shortage of a comprehensive Arabic natural language generation toolkit that is capable of handling a wide range of tasks.

Text Generation

GPTAraEval: A Comprehensive Evaluation of ChatGPT on Arabic NLP

no code implementations24 May 2023 Md Tawkat Islam Khondaker, Abdul Waheed, El Moatez Billah Nagoudi, Muhammad Abdul-Mageed

Although we further explore and confirm the utility of employing GPT-4 as a potential alternative for human evaluation, our work adds to a growing body of research underscoring the limitations of ChatGPT.

Natural Language Understanding

Dolphin: A Challenging and Diverse Benchmark for Arabic NLG

no code implementations24 May 2023 El Moatez Billah Nagoudi, AbdelRahim Elmadany, Ahmed El-Shangiti, Muhammad Abdul-Mageed

We present Dolphin, a novel benchmark that addresses the need for a natural language generation (NLG) evaluation framework dedicated to the wide collection of Arabic languages and varieties.

Dialogue Generation Machine Translation +3

ORCA: A Challenging Benchmark for Arabic Language Understanding

no code implementations21 Dec 2022 AbdelRahim Elmadany, El Moatez Billah Nagoudi, Muhammad Abdul-Mageed

Due to their crucial role in all NLP, several benchmarks have been proposed to evaluate pretrained language models.

JASMINE: Arabic GPT Models for Few-Shot Learning

no code implementations21 Dec 2022 El Moatez Billah Nagoudi, Muhammad Abdul-Mageed, AbdelRahim Elmadany, Alcides Alcoba Inciarte, Md Tawkat Islam Khondaker

Scholarship on generative pretraining (GPT) remains acutely Anglocentric, leaving serious gaps in our understanding of the whole class of autoregressive models.

Few-Shot Learning

Decay No More: A Persistent Twitter Dataset for Learning Social Meaning

1 code implementation10 Apr 2022 Chiyu Zhang, Muhammad Abdul-Mageed, El Moatez Billah Nagoudi

With the proliferation of social media, many studies resort to social media to construct datasets for developing social meaning understanding systems.

ARBERT \& MARBERT: Deep Bidirectional Transformers for Arabic

no code implementations ACL 2021 Muhammad Abdul-Mageed, AbdelRahim Elmadany, El Moatez Billah Nagoudi

To evaluate our models, we also introduce ARLUE, a new benchmark for multi-dialectal Arabic language understanding evaluation.

XLM-R

ARBERT & MARBERT: Deep Bidirectional Transformers for Arabic

2 code implementations27 Dec 2020 Muhammad Abdul-Mageed, AbdelRahim Elmadany, El Moatez Billah Nagoudi

To evaluate our models, we also introduce ARLUE, a new benchmark for multi-dialectal Arabic language understanding evaluation.

XLM-R

Translating Similar Languages: Role of Mutual Intelligibility in Multilingual Transformers

no code implementations WMT (EMNLP) 2020 Ife Adebara, El Moatez Billah Nagoudi, Muhammad Abdul Mageed

We investigate different approaches to translate between similar languages under low resource conditions, as part of our contribution to the WMT 2020 Similar Languages Translation Shared Task.

Translation

Machine Generation and Detection of Arabic Manipulated and Fake News

1 code implementation COLING (WANLP) 2020 El Moatez Billah Nagoudi, AbdelRahim Elmadany, Muhammad Abdul-Mageed, Tariq Alhindi, Hasan Cavusoglu

Finally, we develop the first models for detecting manipulated Arabic news and achieve state-of-the-art results on Arabic fake news detection (macro F1=70. 06).

Fake News Detection POS

Growing Together: Modeling Human Language Learning With n-Best Multi-Checkpoint Machine Translation

no code implementations WS 2020 El Moatez Billah Nagoudi, Muhammad Abdul-Mageed, Hasan Cavusoglu

We describe our submission to the 2020 Duolingo Shared Task on Simultaneous Translation And Paraphrase for Language Education (STAPLE) (Mayhew et al., 2020).

Machine Translation Translation

ArbEngVec : Arabic-English Cross-Lingual Word Embedding Model

no code implementations WS 2019 Raki Lachraf, El Moatez Billah Nagoudi, Youcef Ayachi, Ahmed Abdelali, Didier Schwab

Word Embeddings (WE) are getting increasingly popular and widely applied in many Natural Language Processing (NLP) applications due to their effectiveness in capturing semantic properties of words; Machine Translation (MT), Information Retrieval (IR) and Information Extraction (IE) are among such areas.

Information Retrieval Machine Translation +6

Semantic Similarity of Arabic Sentences with Word Embeddings

no code implementations WS 2017 El Moatez Billah Nagoudi, Didier Schwab

Semantic textual similarity is the basis of countless applications and plays an important role in diverse areas, such as information retrieval, plagiarism detection, information extraction and machine translation.

Descriptive Information Retrieval +10

Cannot find the paper you are looking for? You can Submit a new open access paper.