Search Results for author: Amr Keleg

Found 7 papers, 5 papers with code

Automatically Discarding Straplines to Improve Data Quality for Abstractive News Summarization

no code implementations nlppower (ACL) 2022 Amr Keleg, Matthias Lindemann, Danyang Liu, Wanqiu Long, Bonnie L. Webber

Automatic evaluation indicates that removing straplines and noise from the training data of a news summarizer results in higher quality summaries, with improvements as high as 7 points ROUGE score.

News Summarization

SMASH at Qur’an QA 2022: Creating Better Faithful Data Splits for Low-resourced Question Answering Scenarios

1 code implementation OSACT (LREC) 2022 Amr Keleg, Walid Magdy

The Qur’an QA 2022 shared task aims at assessing the possibility of building systems that can extract answers to religious questions given relevant passages from the Holy Qur’an.

Language Modelling Question Answering

Arabic Dialect Identification under Scrutiny: Limitations of Single-label Classification

1 code implementation20 Oct 2023 Amr Keleg, Walid Magdy

Automatic Arabic Dialect Identification (ADI) of text has gained great popularity since it was introduced in the early 2010s.

Dialect Identification Multi-Label Classification

ALDi: Quantifying the Arabic Level of Dialectness of Text

1 code implementation20 Oct 2023 Amr Keleg, Sharon Goldwater, Walid Magdy

Transcribed speech and user-generated text in Arabic typically contain a mixture of Modern Standard Arabic (MSA), the standardized language taught in schools, and Dialectal Arabic (DA), used in daily communications.

Dialect Identification Sentence

DLAMA: A Framework for Curating Culturally Diverse Facts for Probing the Knowledge of Pretrained Language Models

1 code implementation8 Jun 2023 Amr Keleg, Walid Magdy

A new benchmark DLAMA-v1 is built of factual triples from three pairs of contrasting cultures having a total of 78, 259 triples from 20 relation predicates.

Benchmarking Fairness

ASU\_OPTO at OSACT4 - Offensive Language Detection for Arabic text

no code implementations LREC 2020 Amr Keleg, Samhaa R. El-Beltagy, Mahmoud Khalil

In the past years, toxic comments and offensive speech are polluting the internet and manual inspection of these comments is becoming a tiresome task to manage.

An Unsupervised Method for Weighting Finite-state Morphological Analyzers

2 code implementations LREC 2020 Amr Keleg, Francis Tyers, Nick Howell, Tommi Pirinen

In this paper, we have developed a method for weighting a morphological analyzer built using finite state transducers in order to disambiguate its results.

Morphological Analysis

Cannot find the paper you are looking for? You can Submit a new open access paper.