Search Results for author: Claudia Borg

Found 13 papers, 4 papers with code

Crowd-sourcing evaluation of automatically acquired, morphologically related word groupings

no code implementations LREC 2014 Claudia Borg, Albert Gatt

The automatic discovery and clustering of morphologically related words is an important problem with several practical applications.

Clustering

Morphological Analysis for the Maltese Language: The Challenges of a Hybrid System

no code implementations WS 2017 Claudia Borg, Albert Gatt

In particular, we analyse a dataset of morphologically related word clusters to evaluate the difference in results for concatenative and nonconcatenative clusters.

Clustering Morphological Analysis

Face2Text: Collecting an Annotated Image Description Corpus for the Generation of Rich Face Descriptions

1 code implementation LREC 2018 Albert Gatt, Marc Tanti, Adrian Muscat, Patrizia Paggio, Reuben A. Farrugia, Claudia Borg, Kenneth P. Camilleri, Mike Rosner, Lonneke van der Plas

To gain a better understanding of the variation we find in face description and the possible issues that this may raise, we also conducted an annotation study on a subset of the corpus.

CUNI--Malta system at SIGMORPHON 2019 Shared Task on Morphological Analysis and Lemmatization in context: Operation-based word formation

no code implementations WS 2019 Ronald Cardenas, Claudia Borg, Daniel Zeman

This paper presents the submission by the Charles University-University of Malta team to the SIGMORPHON 2019 Shared Task on Morphological Analysis and Lemmatization in context.

Lemmatization Morphological Analysis +1

On the Language-specificity of Multilingual BERT and the Impact of Fine-tuning

1 code implementation EMNLP (BlackboxNLP) 2021 Marc Tanti, Lonneke van der Plas, Claudia Borg, Albert Gatt

Recent work has shown evidence that the knowledge acquired by multilingual BERT (mBERT) has two components: a language-specific and a language-neutral one.

Language Identification Natural Language Inference +3

Analysis of Data Augmentation Methods for Low-Resource Maltese ASR

no code implementations15 Nov 2021 Andrea DeMarco, Carlos Mena, Albert Gatt, Claudia Borg, Aiden Williams, Lonneke van der Plas

Recent years have seen an increased interest in the computational speech processing of Maltese, but resources remain sparse.

Data Augmentation Language Modelling +2

Face2Text revisited: Improved data set and baseline results

no code implementations PVLAM (LREC) 2022 Marc Tanti, Shaun Abdilla, Adrian Muscat, Claudia Borg, Reuben A. Farrugia, Albert Gatt

To encourage the development of more human-focused descriptions, we developed a new data set of facial descriptions based on the CelebA image data set.

Transfer Learning

Cross-Lingual Transfer from Related Languages: Treating Low-Resource Maltese as Multilingual Code-Switching

1 code implementation30 Jan 2024 Kurt Micallef, Nizar Habash, Claudia Borg, Fadhl Eryani, Houda Bouamor

Although multilingual language models exhibit impressive cross-lingual transfer capabilities on unseen languages, the performance on downstream tasks is impacted when there is a script disparity with the languages used in the multilingual model's pre-training data.

Cross-Lingual Transfer Transliteration

Cannot find the paper you are looking for? You can Submit a new open access paper.