1 code implementation • TRITON 2021 • Maria Carmela Cariello, Alessandro Lenci, Ruslan Mitkov
The domain-specialised application of Named Entity Recognition (NER) is known as Biomedical NER (BioNER), which aims to identify and classify biomedical concepts that are of interest to researchers, such as genes, proteins, chemical compounds, drugs, mutations, diseases, and so on.
no code implementations • TRITON 2021 • Martha Maria Papadopoulou, Anna Zaretskaya, Ruslan Mitkov
This paper offers a comparative evaluation of four commercial ASR systems which are evaluated according to the post-editing effort required to reach “publishable” quality and according to the number of errors they produce.
no code implementations • TRITON 2021 • Marie Escribe, Ruslan Mitkov
Despite the increasingly good quality of Machine Translation (MT) systems, MT outputs require corrections.
no code implementations • TRITON 2021 • Lígia Venturott, Ruslan Mitkov
The exponential growth of the internet and social media in the past decade gave way to the increase in dissemination of false or misleading information.
no code implementations • TRITON 2021 • Ali Hatami, Ruslan Mitkov, Gloria Corpas Pastor
In this paper, we compare the performance of two alignment methods, Grow-diag-final-and and Intersect Symmetrisation heuristics, to exploit the annotation projection of English-Brazilian Portuguese bilingual corpus to detect named entities in Brazilian Portuguese.
no code implementations • EMNLP (LaTeCHCLfL, CLFL, LaTeCH) 2021 • Maria Kunilovskaya, Ekaterina Lapshinova-Koltunski, Ruslan Mitkov
We expect that literary translations from typologically distant languages should exhibit more translationese, and the fingerprints of individual source languages (and their families) are traceable in translations.
no code implementations • RANLP 2021 • Halyna Maslak, Ruslan Mitkov
Although the research on the automatic or semi-automatic generation of multiple-choice test items has been conducted since the beginning of this millennium, most approaches focus on generating questions from a single sentence.
1 code implementation • OSACT (LREC) 2022 • Damith Premasiri, Tharindu Ranasinghe, Wajdi Zaghouani, Ruslan Mitkov
The goal of the Qur’an QA 2022 shared task is to fill this gap by producing state-of-the-art question answering and reading comprehension research on Qur’an.
no code implementations • RANLP 2021 • Maria Kunilovskaya, Ekaterina Lapshinova-Koltunski, Ruslan Mitkov
The texts are represented with frequency-based features that capture structural and lexical properties of language.
no code implementations • RANLP 2021 • Nikola Spasovski, Ruslan Mitkov
Despite the enormous popularity of Translation Memory systems and the active research in the field, their language processing features still suffer from certain limitations.
no code implementations • 2 Apr 2025 • Fabio Yáñez-Romero, Andrés Montoyo, Armando Suárez, Yoan Gutiérrez, Ruslan Mitkov
Researchers have relegated natural language processing tasks to Transformer-type models, particularly generative models, because these models exhibit high versatility when performing generation and classification tasks.
no code implementations • 20 Dec 2024 • Hansi Hettiarachchi, Tharindu Ranasinghe, Paul Rayson, Ruslan Mitkov, Mohamed Gaber, Damith Premasiri, Fiona Anting Tan, Lasitha Uyangodage
The first Workshop on Language Models for Low-Resource Languages (LoResLM 2025) was held in conjunction with the 31st International Conference on Computational Linguistics (COLING 2025) in Abu Dhabi, United Arab Emirates.
no code implementations • 26 Mar 2024 • Anna Beatriz Dimas Furtado, Tharindu Ranasinghe, Frédéric Blain, Ruslan Mitkov
In this research, we fill this gap by introducing DORE; the first dataset for Definition MOdelling for PoRtuguEse containing more than 100, 000 definitions.
1 code implementation • 18 Jul 2023 • Damith Premasiri, Tharindu Ranasinghe, Ruslan Mitkov
Text classification is an area of research which has been studied over the years in Natural Language Processing (NLP).
no code implementations • 18 May 2023 • Amal Haddad Haddad, Damith Premasiri, Tharindu Ranasinghe, Ruslan Mitkov
The domain of Botany is rich with metaphorical terms.
no code implementations • 16 Sep 2022 • Damith Premasiri, Amal Haddad Haddad, Tharindu Ranasinghe, Ruslan Mitkov
In this paper, we explore state-of-the-art neural transformers in the task of detecting MWEs in flower and plant names.
1 code implementation • 12 May 2022 • Damith Premasiri, Tharindu Ranasinghe, Wajdi Zaghouani, Ruslan Mitkov
The goal of the Qur'an QA 2022 shared task is to fill this gap by producing state-of-the-art question answering and reading comprehension research on Qur'an.
no code implementations • 2 May 2022 • Alistair Plum, Tharindu Ranasinghe, Spencer Jones, Constantin Orasan, Ruslan Mitkov
The dataset, which is aimed towards digital humanities (DH) and historical research, is automatically compiled by aligning sentences from Wikipedia articles with matching structured data from sources including Pantheon and Wikidata.
1 code implementation • ACL 2021 • Tharindu Ranasinghe, Constantin Orasan, Ruslan Mitkov
Most studies on word-level Quality Estimation (QE) of machine translation focus on language-specific models.
no code implementations • SEMEVAL 2020 • Tharindu Ranasinghe, Alistair Plum, Constantin Orasan, Ruslan Mitkov
This paper presents the RGCL team submission to SemEval 2020 Task 6: DeftEval, subtasks 1 and 2.
1 code implementation • COLING 2020 • Tharindu Ranasinghe, Constantin Orasan, Ruslan Mitkov
Recent years have seen big advances in the field of sentence-level quality estimation (QE), largely as a result of using neural-based architectures.
no code implementations • 13 Oct 2020 • Tharindu Ranasinghe, Alistair Plum, Constantin Orasan, Ruslan Mitkov
This paper presents the RGCL team submission to SemEval 2020 Task 6: DeftEval, subtasks 1 and 2.
1 code implementation • WMT (EMNLP) 2020 • Tharindu Ranasinghe, Constantin Orasan, Ruslan Mitkov
This paper presents the team TransQuest's participation in Sentence-Level Direct Assessment shared task in WMT 2020.
1 code implementation • EMNLP 2018 • Victoria Yaneva, Le An Ha, Richard Evans, Ruslan Mitkov
When processing a text, humans and machines must disambiguate between different uses of the pronoun it, including non-referential, nominal anaphoric or clause anaphoric ones.
no code implementations • EAMT 2020 • Tharindu Ranasinghe, Constantin Orasan, Ruslan Mitkov
Matching and retrieving previously translated segments from a Translation Memory is the key functionality in Translation Memories systems.
no code implementations • RANLP 2019 • Sheila Castilho, Nat{\'a}lia Resende, Ruslan Mitkov
While a number of studies have shown evidence of translationese phenomena, that is, statistical differences between original texts and translated texts (Gellerstam, 1986), results of studies searching for translationese features in postedited texts (what has been called {''}posteditese{''} (Daems et al., 2017)) have presented mixed results.
no code implementations • RANLP 2019 • Tharindu Ranasinghe, Constantin Orasan, Ruslan Mitkov
Calculating the Semantic Textual Similarity (STS) is an important research area in natural language processing which plays a significant role in many applications such as question answering, document summarisation, information retrieval and information extraction.
no code implementations • RANLP 2019 • Tharindu Ranasinghe, Constantin Orasan, Ruslan Mitkov
Calculating Semantic Textual Similarity (STS) plays a significant role in many applications such as question answering, document summarisation, information retrieval and information extraction.
Contextualised Word Representations
Information Retrieval
+6
no code implementations • SEMEVAL 2019 • Alistair Plum, Tharindu Ranasinghe, Pablo Calleja, Constantin Or{\u{a}}san, Ruslan Mitkov
This article describes the system submitted by the RGCL-WLV team to the SemEval 2019 Task 12: Toponym resolution in scientific papers.
2 code implementations • NAACL 2019 • Omid Rohanian, Shiva Taslimipoor, Samaneh Kouchaki, Le An Ha, Ruslan Mitkov
We introduce a new method to tag Multiword Expressions (MWEs) using a linguistically interpretable language-independent deep learning architecture.
no code implementations • SEMEVAL 2018 • Omid Rohanian, Shiva Taslimipoor, Richard Evans, Ruslan Mitkov
This paper describes the systems submitted to SemEval 2018 Task 3 {``}Irony detection in English tweets{''} for both subtasks A and B.
no code implementations • SEMEVAL 2018 • Shiva Taslimipoor, Omid Rohanian, Le An Ha, Gloria Corpas Pastor, Ruslan Mitkov
This paper describes the system submitted to SemEval 2018 shared task 10 {`}Capturing Dicriminative Attributes{'}.
no code implementations • RANLP 2017 • Andrea Silvestre Baquero, Ruslan Mitkov
These tools operate on fuzzy (surface) matching mostly and cannot benefit from already translated texts which are synonymous to (or paraphrased versions of) the text to be translated.
no code implementations • WS 2017 • Sanja {\v{S}}tajner, Victoria Yaneva, Ruslan Mitkov, Simone Paolo Ponzetto
Eye tracking studies from the past few decades have shaped the way we think of word complexity and cognitive load: words that are long, rare and ambiguous are more difficult to read.
no code implementations • WS 2017 • Shiva Taslimipoor, Omid Rohanian, Ruslan Mitkov, Afsaneh Fazly
This study investigates the supervised token-based identification of Multiword Expressions (MWEs).
no code implementations • LREC 2016 • Victoria Yaneva, Irina Temnikova, Ruslan Mitkov
This division of the groups informs researchers on whether particular fixations were elicited from skillful or less-skillful readers and allows a fair between-group comparison for two levels of reading ability.
no code implementations • LREC 2016 • Victoria Yaneva, Irina Temnikova, Ruslan Mitkov
This paper presents an approach for automatic evaluation of the readability of text simplification output for readers with cognitive disabilities.
no code implementations • LREC 2012 • Sanja {\v{S}}tajner, Ruslan Mitkov
In British English, we compared the complexity of texts published in 1931, 1961 and 1991, while in American English we compared the complexity of texts published in 1961 and 1992.
no code implementations • LREC 2012 • Irina Temnikova, Constantin Orasan, Ruslan Mitkov
This article presents a new linguistic resource in the form of Controlled Language (CL) guidelines for manual text simplification in the CM domain which aims to address high TC in the CM domain and produce clear messages to be used in crisis situations.
no code implementations • LREC 2012 • Natalia Konstantinova, Sheila C.M. de Sousa, Noa P. Cruz, Manuel J. Ma{\~n}a, Maite Taboada, Ruslan Mitkov
This paper presents a freely available resource for research on handling negation and speculation in review texts.