Search Results for author: Mustafa Jarrar

Found 19 papers, 3 papers with code

SALMA: Arabic Sense-Annotated Corpus and WSD Benchmarks

no code implementations29 Oct 2023 Mustafa Jarrar, Sanad Malaysha, Tymaa Hammouda, Mohammed Khalilia

To establish a Word Sense Disambiguation baseline using our SALMA corpus, we developed an end-to-end Word Sense Disambiguation system using Target Sense Verification.

Word Sense Disambiguation

Nabra: Syrian Arabic Dialects with Morphological Annotations

no code implementations26 Oct 2023 Amal Nayouf, Tymaa Hammouda, Mustafa Jarrar, Fadi Zaraket, Mohamad-Bassam Kurdy

This paper presents Nabra, a corpora of Syrian Arabic dialects with morphological annotations.

Sentence

Arabic Fine-Grained Entity Recognition

no code implementations26 Oct 2023 Haneen Liqreina, Mustafa Jarrar, Mohammed Khalilia, Ahmed Oumar El-Shangiti, Muhammad Abdul-Mageed

To compute the baselines of WojoodF ine, we fine-tune three pre-trained Arabic BERT encoders in three settings: flat NER, nested NER and nested NER with subtypes and achieved F1 score of 0. 920, 0. 866, and 0. 885, respectively.

NER

Offensive Hebrew Corpus and Detection using BERT

1 code implementation6 Sep 2023 Nagham Hamad, Mustafa Jarrar, Mohammad Khalilia, Nadim Nashif

Fine-tuning AlephBERT on our data and testing on D_OLaH yields 69% accuracy, while fine-tuning on D_OLaH and testing on our data yields 57% accuracy, which may be an indication to the generalizability our data offers.

Context-Gloss Augmentation for Improving Arabic Target Sense Verification

no code implementations6 Feb 2023 Sanad Malaysha, Mustafa Jarrar, Mohammed Khalilia

The most common semantically-labeled dataset for Arabic is the ArabGlossBERT, a relatively small dataset that consists of 167K context-gloss pairs (about 60K positive and 107K negative pairs), collected from Arabic dictionaries.

POS

A Benchmark and Scoring Algorithm for Enriching Arabic Synonyms

1 code implementation4 Feb 2023 Sana Ghanem, Mustafa Jarrar, Radi Jarrar, Ibrahim Bounhas

Our proposed algorithm extracts synonyms from existing lexicons and computes a fuzzy value for each candidate.

Current Trends and Approaches in Synonyms Extraction: Potential Adaptation to Arabic

no code implementations20 May 2022 Eman Naser-Karajah, Nabil Arman, Mustafa Jarrar

The third approach is to construct new WordNets by exploring synonymy graphs, and the fourth approach is to find similar words from corpora using Deep Learning methods, such as word embeddings and recently BERT models.

Translation Word Embeddings

The Arabic Ontology -- An Arabic Wordnet with Ontologically Clean Content

no code implementations19 May 2022 Mustafa Jarrar

The ontology provides a formal representation of the concepts that the Arabic terms convey, and its content was built with ontological analysis in mind, and benchmarked to scientific advances and rigorous knowledge sources as much as this is possible, rather than to only speakers' beliefs as lexicons typically are.

ArabGlossBERT: Fine-Tuning BERT on Context-Gloss Pairs for WSD

no code implementations RANLP 2021 Moustafa Al-Hajj, Mustafa Jarrar

First, we constructed a dataset of labeled Arabic context-gloss pairs (~167k pairs) we extracted from the Arabic Ontology and the large lexicographic database available at Birzeit University.

Binary Classification Sentence +1

Curras + Baladi: Towards a Levantine Corpus

no code implementations LREC 2022 Karim El Haff, Mustafa Jarrar, Tymaa Hammouda, Fadi Zaraket

This is due to many factors, including the complex and rich morphology of Arabic, its high degree of ambiguity, and the presence of several regional varieties that need to be processed while taking into account their unique characteristics.

LU-BZU at SemEval-2021 Task 2: Word2Vec and Lemma2Vec performance in Arabic Word-in-Context disambiguation

no code implementations SEMEVAL 2021 Moustafa Al-Hajj, Mustafa Jarrar

This paper presents a set of experiments to evaluate and compare between the performance of using CBOW Word2Vec and Lemma2Vec models for Arabic Word-in-Context (WiC) disambiguation without using sense inventories or sense embeddings.

2k LEMMA +2

Extracting Synonyms from Bilingual Dictionaries

no code implementations EACL (GWC) 2021 Mustafa Jarrar, Eman Karajah, Muhammad Khalifa, Khaled Shaalan

We present our progress in developing a novel algorithm to extract synonyms from bilingual dictionaries.

Translation

Cannot find the paper you are looking for? You can Submit a new open access paper.