Search Results for author: Robert Gaizauskas

Found 30 papers, 2 papers with code

SNuC: The Sheffield Numbers Spoken Language Corpus

no code implementations LREC 2022 Emma Barker, Jon Barker, Robert Gaizauskas, Ning Ma, Monica Lestari Paramita

We present SNuC, the first published corpus of spoken alphanumeric identifiers of the sort typically used as serial and part numbers in the manufacturing sector.

A Language Modelling Approach to Quality Assessment of OCR’ed Historical Text

no code implementations LREC 2022 Callum Booth, Robert Shoemaker, Robert Gaizauskas

We hypothesise and evaluate a language model-based approach for scoring the quality of OCR transcriptions in the British Library Newspapers (BLN) corpus parts 1 and 2, to identify the best quality OCR for use in further natural language processing tasks, with a wider view to link individual newspaper reports of crime in nineteenth-century London to the Digital Panopticon—a structured repository of criminal lives.

Language Modelling Optical Character Recognition (OCR)

Visual and Semantic Knowledge Transfer for Large Scale Semi-supervised Object Detection

no code implementations9 Jan 2018 Yu-Xing Tang, Josiah Wang, Xiaofang Wang, Boyang Gao, Emmanuel Dellandrea, Robert Gaizauskas, Liming Chen

This is done by modeling the differences between the two on categories with both image-level and bounding box annotations, and transferring this information to convert classifiers to detectors for categories without bounding box annotations.

Object object-detection +3

Large Scale Semi-Supervised Object Detection Using Visual and Semantic Knowledge Transfer

no code implementations CVPR 2016 Yu-Xing Tang, Josiah Wang, Boyang Gao, Emmanuel Dellandrea, Robert Gaizauskas, Liming Chen

This is done by modeling the differences between the two on categories with both image-level and bounding box annotations, and transferring this information to convert classifiers to detectors for categories without bounding box annotations.

Object object-detection +3

A Document Repository for Social Media and Speech Conversations

no code implementations LREC 2016 Adam Funk, Robert Gaizauskas, Benoit Favre

We present a successfully implemented document repository REST service for flexible SCRUD (search, crate, read, update, delete) storage of social media conversations, using a GATE/TIPSTER-like document object model and providing a query language for document features.

What's the Issue Here?: Task-based Evaluation of Reader Comment Summarization Systems

no code implementations LREC 2016 Emma Barker, Monica Paramita, Adam Funk, Emina Kurtic, Ahmet Aker, Jonathan Foster, Mark Hepple, Robert Gaizauskas

Second, we define a task-based evaluation framework for reader comment summarization that allows summarization systems to be assessed in terms of how well they support users in a time-limited task of identifying issues and characterising opinion on issues in comments.

Clustering

Cross-validating Image Description Datasets and Evaluation Metrics

no code implementations LREC 2016 Josiah Wang, Robert Gaizauskas

The task of automatically generating sentential descriptions of image content has become increasingly popular in recent years, resulting in the development of large-scale image description datasets and the proposal of various metrics for evaluating image description generation systems.

Sentence

Bootstrapping Term Extractors for Multiple Languages

no code implementations LREC 2014 Ahmet Aker, Monica Paramita, Emma Barker, Robert Gaizauskas

Terminology extraction resources are needed for a wide range of human language technology applications, including knowledge management, information extraction, semantic search, cross-language information retrieval and automatic and assisted translation.

Information Retrieval Management +4

Bilingual dictionaries for all EU languages

1 code implementation LREC 2014 Ahmet Aker, Monica Paramita, M{\=a}rcis Pinnis, Robert Gaizauskas

In this work we present three different methods for cleaning noise from automatically generated bilingual dictionaries: LLR, pivot and translation based approach.

Translation Transliteration

Correlation between Similarity Measures for Inter-Language Linked Wikipedia Articles

no code implementations LREC 2012 Monica Lestari Paramita, Paul Clough, Ahmet Aker, Robert Gaizauskas

In this work, we investigate the correlation between similarity measures utilising language-independent and language-dependent features and respective human judgments.

Image Captioning Information Retrieval +3

Assessing the Comparability of News Texts

no code implementations LREC 2012 Emma Barker, Robert Gaizauskas

Comparable news texts are frequently proposed as a potential source of alignable subsentential fragments for use in statistical machine translation systems.

Machine Translation Natural Language Inference +1

Cannot find the paper you are looking for? You can Submit a new open access paper.