no code implementations • LREC 2022 • Callum Booth, Robert Shoemaker, Robert Gaizauskas
We hypothesise and evaluate a language model-based approach for scoring the quality of OCR transcriptions in the British Library Newspapers (BLN) corpus parts 1 and 2, to identify the best quality OCR for use in further natural language processing tasks, with a wider view to link individual newspaper reports of crime in nineteenth-century London to the Digital Panopticon—a structured repository of criminal lives.
no code implementations • LREC 2022 • Emma Barker, Jon Barker, Robert Gaizauskas, Ning Ma, Monica Lestari Paramita
We present SNuC, the first published corpus of spoken alphanumeric identifiers of the sort typically used as serial and part numbers in the manufacturing sector.
no code implementations • 21 Oct 2024 • Jason Chan, Robert Gaizauskas, Zhixue Zhao
Formal logic has long been applied to natural language reasoning, but this approach can sometimes lead to conclusions that, while logically entailed, are factually inconsistent with the premises or are not typically inferred by humans.
no code implementations • 9 Jan 2018 • Yu-Xing Tang, Josiah Wang, Xiaofang Wang, Boyang Gao, Emmanuel Dellandrea, Robert Gaizauskas, Liming Chen
This is done by modeling the differences between the two on categories with both image-level and bounding box annotations, and transferring this information to convert classifiers to detectors for categories without bounding box annotations.
no code implementations • CVPR 2016 • Yu-Xing Tang, Josiah Wang, Boyang Gao, Emmanuel Dellandrea, Robert Gaizauskas, Liming Chen
This is done by modeling the differences between the two on categories with both image-level and bounding box annotations, and transferring this information to convert classifiers to detectors for categories without bounding box annotations.
no code implementations • LREC 2016 • Adam Funk, Robert Gaizauskas, Benoit Favre
We present a successfully implemented document repository REST service for flexible SCRUD (search, crate, read, update, delete) storage of social media conversations, using a GATE/TIPSTER-like document object model and providing a query language for document features.
no code implementations • LREC 2016 • Josiah Wang, Robert Gaizauskas
The task of automatically generating sentential descriptions of image content has become increasingly popular in recent years, resulting in the development of large-scale image description datasets and the proposal of various metrics for evaluating image description generation systems.
no code implementations • LREC 2016 • Emma Barker, Monica Paramita, Adam Funk, Emina Kurtic, Ahmet Aker, Jonathan Foster, Mark Hepple, Robert Gaizauskas
Second, we define a task-based evaluation framework for reader comment summarization that allows summarization systems to be assessed in terms of how well they support users in a time-limited task of identifying issues and characterising opinion on issues in comments.
no code implementations • LREC 2014 • Ahmet Aker, Monica Paramita, Emma Barker, Robert Gaizauskas
Terminology extraction resources are needed for a wide range of human language technology applications, including knowledge management, information extraction, semantic search, cross-language information retrieval and automatic and assisted translation.
1 code implementation • LREC 2014 • Ahmet Aker, Monica Paramita, M{\=a}rcis Pinnis, Robert Gaizauskas
In this work we present three different methods for cleaning noise from automatically generated bilingual dictionaries: LLR, pivot and translation based approach.
1 code implementation • LREC 2012 • Hector Llorens, Leon Derczynski, Robert Gaizauskas, Estela Saquete
In this paper, we present TIMEN, a community-driven tool for temporal expression normalisation.
Ranked #1 on Timex normalization on TimeBank
no code implementations • LREC 2012 • Monica Lestari Paramita, Paul Clough, Ahmet Aker, Robert Gaizauskas
In this work, we investigate the correlation between similarity measures utilising language-independent and language-dependent features and respective human judgments.
no code implementations • LREC 2012 • Ahmet Aker, Evangelos Kanoulas, Robert Gaizauskas
In this work we aim to reduce the amount of time and resources spent for tasks 1 and 2.
no code implementations • LREC 2012 • Inguna Skadi{\c{n}}a, Ahmet Aker, Nikos Mastropavlos, Fangzhong Su, Dan Tufis, Mateja Verlic, Andrejs Vasi{\c{l}}jevs, Bogdan Babych, Paul Clough, Robert Gaizauskas, Nikos Glaros, Monica Lestari Paramita, M{\=a}rcis Pinnis
Lack of sufficient parallel data for many languages and domains is currently one of the major obstacles to further advancement of automated translation.
no code implementations • LREC 2012 • Emma Barker, Robert Gaizauskas
Comparable news texts are frequently proposed as a potential source of alignable subsentential fragments for use in statistical machine translation systems.