Search Results for author: Mustafa Jarrar

Found 30 papers, 3 papers with code

SinaTools: Open Source Toolkit for Arabic Natural Language Processing

no code implementations3 Nov 2024 Tymaa Hammouda, Mustafa Jarrar, Mohammed Khalilia

We introduce SinaTools, an open-source Python package for Arabic natural language processing and understanding.

Benchmarking Lemmatization +8

Event-Arguments Extraction Corpus and Modeling using BERT for Arabic

no code implementations30 Jul 2024 Alaa Aljabari, Lina Duaibes, Mustafa Jarrar, Mohammed Khalilia

Additionally, we propose a novel method for event relation extraction using BERT, in which we treat the task as text entailment.

Event Argument Extraction Event Relation Extraction +2

ArabicNLU 2024: The First Arabic Natural Language Understanding Shared Task

no code implementations30 Jul 2024 Mohammed Khalilia, Sanad Malaysha, Reem Suwaileh, Mustafa Jarrar, Alaa Aljabari, Tamer Elsayed, Imed Zitouni

This paper presents an overview of the Arabic Natural Language Understanding (ArabicNLU 2024) shared task, focusing on two subtasks: Word Sense Disambiguation (WSD) and Location Mention Disambiguation (LMD).

Natural Language Understanding Word Sense Disambiguation

Sina at FigNews 2024: Multilingual Datasets Annotated with Bias and Propaganda

no code implementations12 Jul 2024 Lina Duaibes, Areej Jaber, Mustafa Jarrar, Ahmad Qadi, Mais Qandeel

The proliferation of bias and propaganda on social media is an increasingly significant concern, leading to the development of techniques for automatic detection.

Qabas: An Open-Source Arabic Lexicographic Database

no code implementations6 Jun 2024 Mustafa Jarrar, Tymaa Hammouda

Compared with other lexicons, Qabas stands as the most extensive Arabic lexicon, encompassing about 58K lemmas (45K nominal lemmas, 12. 5K verbal lemmas, and 473 functional-word lemmas).

SALMA: Arabic Sense-Annotated Corpus and WSD Benchmarks

no code implementations29 Oct 2023 Mustafa Jarrar, Sanad Malaysha, Tymaa Hammouda, Mohammed Khalilia

To establish a Word Sense Disambiguation baseline using our SALMA corpus, we developed an end-to-end Word Sense Disambiguation system using Target Sense Verification.

Word Sense Disambiguation

Arabic Fine-Grained Entity Recognition

no code implementations26 Oct 2023 Haneen Liqreina, Mustafa Jarrar, Mohammed Khalilia, Ahmed Oumar El-Shangiti, Muhammad Abdul-Mageed

To compute the baselines of WojoodF ine, we fine-tune three pre-trained Arabic BERT encoders in three settings: flat NER, nested NER and nested NER with subtypes and achieved F1 score of 0. 920, 0. 866, and 0. 885, respectively.

NER

Nabra: Syrian Arabic Dialects with Morphological Annotations

no code implementations26 Oct 2023 Amal Nayouf, Tymaa Hammouda, Mustafa Jarrar, Fadi Zaraket, Mohamad-Bassam Kurdy

This paper presents Nabra, a corpora of Syrian Arabic dialects with morphological annotations.

Sentence

Offensive Hebrew Corpus and Detection using BERT

1 code implementation6 Sep 2023 Nagham Hamad, Mustafa Jarrar, Mohammad Khalilia, Nadim Nashif

Fine-tuning AlephBERT on our data and testing on D_OLaH yields 69% accuracy, while fine-tuning on D_OLaH and testing on our data yields 57% accuracy, which may be an indication to the generalizability our data offers.

Context-Gloss Augmentation for Improving Arabic Target Sense Verification

no code implementations6 Feb 2023 Sanad Malaysha, Mustafa Jarrar, Mohammed Khalilia

The most common semantically-labeled dataset for Arabic is the ArabGlossBERT, a relatively small dataset that consists of 167K context-gloss pairs (about 60K positive and 107K negative pairs), collected from Arabic dictionaries.

POS

A Benchmark and Scoring Algorithm for Enriching Arabic Synonyms

1 code implementation4 Feb 2023 Sana Ghanem, Mustafa Jarrar, Radi Jarrar, Ibrahim Bounhas

Our proposed algorithm extracts synonyms from existing lexicons and computes a fuzzy value for each candidate.

Current Trends and Approaches in Synonyms Extraction: Potential Adaptation to Arabic

no code implementations20 May 2022 Eman Naser-Karajah, Nabil Arman, Mustafa Jarrar

The third approach is to construct new WordNets by exploring synonymy graphs, and the fourth approach is to find similar words from corpora using Deep Learning methods, such as word embeddings and recently BERT models.

Translation Word Embeddings

Curras + Baladi: Towards a Levantine Corpus

no code implementations LREC 2022 Karim El Haff, Mustafa Jarrar, Tymaa Hammouda, Fadi Zaraket

This is due to many factors, including the complex and rich morphology of Arabic, its high degree of ambiguity, and the presence of several regional varieties that need to be processed while taking into account their unique characteristics.

ArabGlossBERT: Fine-Tuning BERT on Context-Gloss Pairs for WSD

no code implementations RANLP 2021 Moustafa Al-Hajj, Mustafa Jarrar

First, we constructed a dataset of labeled Arabic context-gloss pairs (~167k pairs) we extracted from the Arabic Ontology and the large lexicographic database available at Birzeit University.

Binary Classification Sentence +1

The Arabic Ontology -- An Arabic Wordnet with Ontologically Clean Content

no code implementations19 May 2022 Mustafa Jarrar

The ontology provides a formal representation of the concepts that the Arabic terms convey, and its content was built with ontological analysis in mind, and benchmarked to scientific advances and rigorous knowledge sources as much as this is possible, rather than to only speakers' beliefs as lexicons typically are.

LU-BZU at SemEval-2021 Task 2: Word2Vec and Lemma2Vec performance in Arabic Word-in-Context disambiguation

no code implementations SEMEVAL 2021 Moustafa Al-Hajj, Mustafa Jarrar

This paper presents a set of experiments to evaluate and compare between the performance of using CBOW Word2Vec and Lemma2Vec models for Arabic Word-in-Context (WiC) disambiguation without using sense inventories or sense embeddings.

2k LEMMA +2

Extracting Synonyms from Bilingual Dictionaries

no code implementations EACL (GWC) 2021 Mustafa Jarrar, Eman Karajah, Muhammad Khalifa, Khaled Shaalan

We present our progress in developing a novel algorithm to extract synonyms from bilingual dictionaries.

Translation

Cannot find the paper you are looking for? You can Submit a new open access paper.