Search Results for author: Robert Gaizauskas

Found 31 papers, 2 papers with code

A Language Modelling Approach to Quality Assessment of OCR’ed Historical Text

no code implementations LREC 2022 Callum Booth, Robert Shoemaker, Robert Gaizauskas

We hypothesise and evaluate a language model-based approach for scoring the quality of OCR transcriptions in the British Library Newspapers (BLN) corpus parts 1 and 2, to identify the best quality OCR for use in further natural language processing tasks, with a wider view to link individual newspaper reports of crime in nineteenth-century London to the Digital Panopticon—a structured repository of criminal lives.

Language Modelling Optical Character Recognition (OCR)

SNuC: The Sheffield Numbers Spoken Language Corpus

no code implementations LREC 2022 Emma Barker, Jon Barker, Robert Gaizauskas, Ning Ma, Monica Lestari Paramita

We present SNuC, the first published corpus of spoken alphanumeric identifiers of the sort typically used as serial and part numbers in the manufacturing sector.

Rulebreakers Challenge: Revealing a Blind Spot in Large Language Models' Reasoning with Formal Logic

no code implementations21 Oct 2024 Jason Chan, Robert Gaizauskas, Zhixue Zhao

Formal logic has long been applied to natural language reasoning, but this approach can sometimes lead to conclusions that, while logically entailed, are factually inconsistent with the premises or are not typically inferred by humans.

Formal Logic World Knowledge

Visual and Semantic Knowledge Transfer for Large Scale Semi-supervised Object Detection

no code implementations9 Jan 2018 Yu-Xing Tang, Josiah Wang, Xiaofang Wang, Boyang Gao, Emmanuel Dellandrea, Robert Gaizauskas, Liming Chen

This is done by modeling the differences between the two on categories with both image-level and bounding box annotations, and transferring this information to convert classifiers to detectors for categories without bounding box annotations.

Object object-detection +3

Large Scale Semi-Supervised Object Detection Using Visual and Semantic Knowledge Transfer

no code implementations CVPR 2016 Yu-Xing Tang, Josiah Wang, Boyang Gao, Emmanuel Dellandrea, Robert Gaizauskas, Liming Chen

This is done by modeling the differences between the two on categories with both image-level and bounding box annotations, and transferring this information to convert classifiers to detectors for categories without bounding box annotations.

Object object-detection +3

A Document Repository for Social Media and Speech Conversations

no code implementations LREC 2016 Adam Funk, Robert Gaizauskas, Benoit Favre

We present a successfully implemented document repository REST service for flexible SCRUD (search, crate, read, update, delete) storage of social media conversations, using a GATE/TIPSTER-like document object model and providing a query language for document features.

Cross-validating Image Description Datasets and Evaluation Metrics

no code implementations LREC 2016 Josiah Wang, Robert Gaizauskas

The task of automatically generating sentential descriptions of image content has become increasingly popular in recent years, resulting in the development of large-scale image description datasets and the proposal of various metrics for evaluating image description generation systems.

Sentence

What's the Issue Here?: Task-based Evaluation of Reader Comment Summarization Systems

no code implementations LREC 2016 Emma Barker, Monica Paramita, Adam Funk, Emina Kurtic, Ahmet Aker, Jonathan Foster, Mark Hepple, Robert Gaizauskas

Second, we define a task-based evaluation framework for reader comment summarization that allows summarization systems to be assessed in terms of how well they support users in a time-limited task of identifying issues and characterising opinion on issues in comments.

Clustering

Bootstrapping Term Extractors for Multiple Languages

no code implementations LREC 2014 Ahmet Aker, Monica Paramita, Emma Barker, Robert Gaizauskas

Terminology extraction resources are needed for a wide range of human language technology applications, including knowledge management, information extraction, semantic search, cross-language information retrieval and automatic and assisted translation.

Information Retrieval Management +4

Bilingual dictionaries for all EU languages

1 code implementation LREC 2014 Ahmet Aker, Monica Paramita, M{\=a}rcis Pinnis, Robert Gaizauskas

In this work we present three different methods for cleaning noise from automatically generated bilingual dictionaries: LLR, pivot and translation based approach.

Translation Transliteration

Correlation between Similarity Measures for Inter-Language Linked Wikipedia Articles

no code implementations LREC 2012 Monica Lestari Paramita, Paul Clough, Ahmet Aker, Robert Gaizauskas

In this work, we investigate the correlation between similarity measures utilising language-independent and language-dependent features and respective human judgments.

Image Captioning Information Retrieval +3

Assessing the Comparability of News Texts

no code implementations LREC 2012 Emma Barker, Robert Gaizauskas

Comparable news texts are frequently proposed as a potential source of alignable subsentential fragments for use in statistical machine translation systems.

Machine Translation Natural Language Inference +1

Cannot find the paper you are looking for? You can Submit a new open access paper.