Search Results for author: Mark J. F. Gales

Found 26 papers, 10 papers with code

WaterJudge: Quality-Detection Trade-off when Watermarking Large Language Models

no code implementations28 Mar 2024 Piotr Molenda, Adian Liusie, Mark J. F. Gales

Watermarking generative-AI systems, such as LLMs, has gained considerable interest, driven by their enhanced capabilities across a wide range of tasks.

nlg evaluation

Teacher-Student Training for Debiasing: General Permutation Debiasing for Large Language Models

no code implementations20 Mar 2024 Adian Liusie, Yassir Fathullah, Mark J. F. Gales

Large Language Models (LLMs) have demonstrated impressive zero-shot capabilities and versatility in NLP tasks, however they sometimes fail to maintain crucial invariances for specific tasks.

Investigating the Emergent Audio Classification Ability of ASR Foundation Models

1 code implementation15 Nov 2023 Rao Ma, Adian Liusie, Mark J. F. Gales, Kate M. Knill

Text and vision foundation models can perform many tasks in a zero-shot setting, a desirable property that enables these systems to be applied in general and low-resource settings.

Audio Classification speech-recognition +3

Towards End-to-End Spoken Grammatical Error Correction

no code implementations9 Nov 2023 Stefano Bannò, Rao Ma, Mengjie Qian, Kate M. Knill, Mark J. F. Gales

This foundation model can be used to replace the whole framework or part of it, e. g., ASR and disfluency removal.

Grammatical Error Correction speech-recognition +1

Zero-shot Audio Topic Reranking using Large Language Models

no code implementations14 Sep 2023 Mengjie Qian, Rao Ma, Adian Liusie, Erfan Loweimi, Kate M. Knill, Mark J. F. Gales

A key element for this process is highly rapid, flexible, search to support large archives, which in MVSE is facilitated by representing video attributes by embeddings.

Information Retrieval Retrieval

Mitigating Word Bias in Zero-shot Prompt-based Classifiers

1 code implementation10 Sep 2023 Adian Liusie, Potsawee Manakul, Mark J. F. Gales

To address this problem, it is possible to optimise classification thresholds on a labelled data set, however, this mitigates some of the advantages of prompt-based classifiers.

Zero-Shot Learning

LLM Comparative Assessment: Zero-shot NLG Evaluation through Pairwise Comparisons using Large Language Models

1 code implementation15 Jul 2023 Adian Liusie, Potsawee Manakul, Mark J. F. Gales

Current developments in large language models (LLMs) have enabled impressive zero-shot capabilities across various natural language tasks.

nlg evaluation Response Generation +1

Adapting an ASR Foundation Model for Spoken Language Assessment

no code implementations13 Jul 2023 Rao Ma, Mengjie Qian, Mark J. F. Gales, Kate M. Knill

Additionally, these models have a tendency to skip disfluencies and hesitations in the output.

Adapting an Unadaptable ASR System

no code implementations1 Jun 2023 Rao Ma, Mengjie Qian, Mark J. F. Gales, Kate M. Knill

As speech recognition model sizes and training data requirements grow, it is increasingly common for systems to only be available via APIs from online service providers rather than having direct access to models themselves.

speech-recognition Speech Recognition

Multi-Head State Space Model for Speech Recognition

no code implementations21 May 2023 Yassir Fathullah, Chunyang Wu, Yuan Shangguan, Junteng Jia, Wenhan Xiong, Jay Mahadeokar, Chunxi Liu, Yangyang Shi, Ozlem Kalinli, Mike Seltzer, Mark J. F. Gales

State space models (SSMs) have recently shown promising results on small-scale sequence and language modelling tasks, rivalling and outperforming many attention-based approaches.

Language Modelling speech-recognition +1

Who Needs Decoders? Efficient Estimation of Sequence-level Attributes

no code implementations9 May 2023 Yassir Fathullah, Puria Radmard, Adian Liusie, Mark J. F. Gales

In these scenarios, where for example knowing the quality of a system's output to predict poor performance prevails over knowing the output itself, is it possible to bypass the autoregressive decoding?

Attribute Automatic Speech Recognition +4

SelfCheckGPT: Zero-Resource Black-Box Hallucination Detection for Generative Large Language Models

2 code implementations15 Mar 2023 Potsawee Manakul, Adian Liusie, Mark J. F. Gales

In this work, we propose "SelfCheckGPT", a simple sampling-based approach that can be used to fact-check the responses of black-box models in a zero-resource fashion, i. e. without an external database.

Fact Checking Hallucination +1

N-best T5: Robust ASR Error Correction using Multiple Input Hypotheses and Constrained Decoding Space

no code implementations1 Mar 2023 Rao Ma, Mark J. F. Gales, Kate M. Knill, Mengjie Qian

Error correction models form an important part of Automatic Speech Recognition (ASR) post-processing to improve the readability and quality of transcriptions.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

MQAG: Multiple-choice Question Answering and Generation for Assessing Information Consistency in Summarization

2 code implementations28 Jan 2023 Potsawee Manakul, Adian Liusie, Mark J. F. Gales

In this work, we introduce an alternative scheme based on standard information-theoretic measures in which the information present in the source and summary is directly compared.

Hallucination Multiple-choice +1

L2 proficiency assessment using self-supervised speech representations

no code implementations16 Nov 2022 Stefano Bannò, Kate M. Knill, Marco Matassoni, Vyas Raina, Mark J. F. Gales

Though the wav2vec 2. 0 based system is found to be sensitive to the nature of the response, it can be configured to yield comparable performance to systems requiring a speech transcription, and yields gains when appropriately combined with standard approaches.

speech-recognition Speech Recognition

Self-Distribution Distillation: Efficient Uncertainty Estimation

no code implementations15 Mar 2022 Yassir Fathullah, Mark J. F. Gales

Furthermore it is possible to build ensembles of these models and apply hierarchical ensemble distillation approaches.

Out-of-Distribution Detection

Shifts: A Dataset of Real Distributional Shift Across Multiple Large-Scale Tasks

3 code implementations15 Jul 2021 Andrey Malinin, Neil Band, Ganshin, Alexander, German Chesnokov, Yarin Gal, Mark J. F. Gales, Alexey Noskov, Andrey Ploskonosov, Liudmila Prokhorenkova, Ivan Provilkov, Vatsal Raina, Vyas Raina, Roginskiy, Denis, Mariya Shmatova, Panos Tigas, Boris Yangel

However, many tasks of practical interest have different modalities, such as tabular data, audio, text, or sensor data, which offer significant challenges involving regression and discrete or continuous structured prediction.

Image Classification Machine Translation +5

An Initial Investigation of Non-Native Spoken Question-Answering

no code implementations9 Jul 2021 Vatsal Raina, Mark J. F. Gales

The SQA task considered in this paper is to extract the answer from a candidate$\text{'}$s spoken response to a question in a prompt-response style language assessment test.

Question Answering Reading Comprehension +1

Long-Span Summarization via Local Attention and Content Selection

1 code implementation ACL 2021 Potsawee Manakul, Mark J. F. Gales

Transformer-based models have achieved state-of-the-art results in a wide range of natural language processing (NLP) tasks including document summarization.

Abstractive Text Summarization Document Summarization

Attention Forcing for Machine Translation

1 code implementation2 Apr 2021 Qingyun Dou, Yiting Lu, Potsawee Manakul, Xixin Wu, Mark J. F. Gales

This approach guides the model with the generated output history and reference attention, and can reduce the training-inference mismatch without a schedule or a classifier.

Machine Translation NMT +1

Attention Forcing for Sequence-to-sequence Model Training

no code implementations26 Sep 2019 Qingyun Dou, Yiting Lu, Joshua Efiong, Mark J. F. Gales

This paper introduces attention forcing, which guides the model with generated output history and reference attention.

Machine Translation Speech Synthesis +2

Cannot find the paper you are looking for? You can Submit a new open access paper.