Search Results for author: Eric Atwell

Found 22 papers, 0 papers with code

Quranic Verses Semantic Relatedness Using AraBERT

no code implementations EACL (WANLP) 2021 Abdullah Alsaleh, Eric Atwell, Abdulrahman Altahhan

Bidirectional Encoder Representations from Transformers (BERT) has gained popularity in recent years producing state-of-the-art performances across Natural Language Processing tasks.

Language Modelling

Automatic Hadith Segmentation using PPM Compression

no code implementations ICON 2020 Taghreed Tarmom, Eric Atwell, Mohammad Alsalka

In this paper we explore the use of Prediction by partial matching (PPM) compression based to segment Hadith into its two main components (Isnad and Matan).

Segmentation

WEKA in Forensic Authorship Analysis: A corpus-based approach of Saudi Authors

no code implementations ICON 2020 Mashael AlAmr, Eric Atwell

This is a pilot study that aims to explore the potential of using WEKA in forensic authorship analysis.

Constructing a Bilingual Hadith Corpus Using a Segmentation Tool

no code implementations LREC 2020 Shatha Altammami, Eric Atwell, Ammar Alsalka

This article describes the process of gathering and constructing a bilingual parallel corpus of Islamic Hadith, which is the set of narratives reporting different aspects of the prophet Muhammad{'}s life.

Multi-Level Analysis and Annotation of Arabic Corpora for Text-to-Sign Language MT

no code implementations24 May 2016 Abdelaziz Lakhfif, Mohammed T. Laskri, Eric Atwell

In this paper, we present an ongoing effort in lexical semantic analysis and annotation of Modern Standard Arabic (MSA) text, a semi automatic annotation tool concerned with the morphologic, syntactic, and semantic levels of description.

Compilation of an Arabic Children's Corpus

no code implementations LREC 2016 Latifa Al-Sulaiti, Noorhan Abbas, Claire Brierley, Eric Atwell, Ayman Alghamdi

Inspired by the Oxford Children{'}s Corpus, we have developed a prototype corpus of Arabic texts written and/or selected for children.

General Classification text-classification +1

An Empirical Study of Arabic Formulaic Sequence Extraction Methods

no code implementations LREC 2016 Ayman Alghamdi, Eric Atwell, Claire Brierley

This paper aims to implement what is referred to as the collocation of the Arabic keywords approach for extracting formulaic sequences (FSs) in the form of high frequency but semantically regular formulas that are not restricted to any syntactic construction or semantic domain.

Tools for Arabic Natural Language Processing: a case study in qalqalah prosody

no code implementations LREC 2014 Claire Brierley, Majdi Sawalha, Eric Atwell

In this paper, we focus on the prosodic effect of qalqalah or {``}vibration{''} applied to a subset of Arabic consonants under certain constraints during correct Qur{'}anic recitation or ta{\c{C}}{\S}w{\=\i}d, using our Boundary-Annotated QurÂ’an dataset of 77430 words (Brierley et al 2012; Sawalha et al 2014).

Keyword Extraction

A Comparative Study of Machine Learning Methods for Verbal Autopsy Text Classification

no code implementations18 Feb 2014 Samuel Danso, Eric Atwell, Owen Johnson

We report on a comparative study of the processes involved in Text Classification applied to classifying Cause of Death: feature value representation; machine learning classification algorithms; and feature reduction strategies in order to identify the suitable approaches applicable to the classification of Verbal Autopsy text.

BIG-bench Machine Learning General Classification +2

LAMP: A Multimodal Web Platform for Collaborative Linguistic Analysis

no code implementations LREC 2012 Kais Dukes, Eric Atwell

We provide a description of the underlying software system that has been used to develop the corpus annotations.

Part-Of-Speech Tagging Translation

Predicting Phrase Breaks in Classical and Modern Standard Arabic Text

no code implementations LREC 2012 Majdi Sawalha, Claire Brierley, Eric Atwell

We train and test two probabilistic taggers for Arabic phrase break prediction on a purpose-built, “gold standard”, boundary-annotated and PoS-tagged Qur'an corpus of 77430 words and 8230 sentences.

Chunking Human Parsing +3

QurSim: A corpus for evaluation of relatedness in short texts

no code implementations LREC 2012 Abdul-Baquee Sharaf, Eric Atwell

This paper presents a large corpus created from the original Quranic text, where semantically similar or related verses are linked together.

Information Retrieval Machine Translation +4

QurAna: Corpus of the Quran annotated with Pronominal Anaphora

no code implementations LREC 2012 Abdul-Baquee Sharaf, Eric Atwell

These antecedents are maintained as an ontological list of concepts, which have proved helpful for information retrieval tasks.

Coreference Resolution Information Retrieval +3

Open-Source Boundary-Annotated Corpus for Arabic Speech and Language Processing

no code implementations LREC 2012 Claire Brierley, Majdi Sawalha, Eric Atwell

We take a novel approach to phrase break prediction for Arabic, deriving our prosodic annotation scheme from Tajw{\=\i}d (recitation) mark-up in the Qur'an which we then interpret as additional text-based data for computational analysis.

Chunking Descriptive +2

Cannot find the paper you are looking for? You can Submit a new open access paper.