Search Results for author: Mark J. F. Gales

Found 26 papers, 10 papers with code

WaterJudge: Quality-Detection Trade-off when Watermarking Large Language Models

no code implementations • 28 Mar 2024 • Piotr Molenda, Adian Liusie, Mark J. F. Gales

Watermarking generative-AI systems, such as LLMs, has gained considerable interest, driven by their enhanced capabilities across a wide range of tasks.

nlg evaluation

Paper
Add Code

Teacher-Student Training for Debiasing: General Permutation Debiasing for Large Language Models

no code implementations • 20 Mar 2024 • Adian Liusie, Yassir Fathullah, Mark J. F. Gales

Large Language Models (LLMs) have demonstrated impressive zero-shot capabilities and versatility in NLP tasks, however they sometimes fail to maintain crucial invariances for specific tasks.

Paper
Add Code

Investigating the Emergent Audio Classification Ability of ASR Foundation Models

1 code implementation • 15 Nov 2023 • Rao Ma, Adian Liusie, Mark J. F. Gales, Kate M. Knill

Text and vision foundation models can perform many tasks in a zero-shot setting, a desirable property that enables these systems to be applied in general and low-resource settings.

Audio Classification speech-recognition +3

Paper
Code

Towards End-to-End Spoken Grammatical Error Correction

no code implementations • 9 Nov 2023 • Stefano Bannò, Rao Ma, Mengjie Qian, Kate M. Knill, Mark J. F. Gales

This foundation model can be used to replace the whole framework or part of it, e. g., ASR and disfluency removal.

Grammatical Error Correction speech-recognition +1

Paper
Add Code

Zero-shot Audio Topic Reranking using Large Language Models

no code implementations • 14 Sep 2023 • Mengjie Qian, Rao Ma, Adian Liusie, Erfan Loweimi, Kate M. Knill, Mark J. F. Gales

A key element for this process is highly rapid, flexible, search to support large archives, which in MVSE is facilitated by representing video attributes by embeddings.

Information Retrieval Retrieval

Paper
Add Code

Mitigating Word Bias in Zero-shot Prompt-based Classifiers

1 code implementation • 10 Sep 2023 • Adian Liusie, Potsawee Manakul, Mark J. F. Gales

To address this problem, it is possible to optimise classification thresholds on a labelled data set, however, this mitigates some of the advantages of prompt-based classifiers.

Zero-Shot Learning

Paper
Code

LLM Comparative Assessment: Zero-shot NLG Evaluation through Pairwise Comparisons using Large Language Models

1 code implementation • 15 Jul 2023 • Adian Liusie, Potsawee Manakul, Mark J. F. Gales

Current developments in large language models (LLMs) have enabled impressive zero-shot capabilities across various natural language tasks.

nlg evaluation Response Generation +1

Paper
Code

Adapting an ASR Foundation Model for Spoken Language Assessment

no code implementations • 13 Jul 2023 • Rao Ma, Mengjie Qian, Mark J. F. Gales, Kate M. Knill

Additionally, these models have a tendency to skip disfluencies and hesitations in the output.

Paper
Add Code

Analysis of the Cambridge Multiple-Choice Questions Reading Dataset with a Focus on Candidate Response Distribution

no code implementations • 22 Jun 2023 • Adian Liusie, Vatsal Raina, Andrew Mullooly, Kate Knill, Mark J. F. Gales

Multiple choice exams are widely used to assess candidates across a diverse range of domains and tasks.

Multiple-choice

Paper
Add Code

Adapting an Unadaptable ASR System

no code implementations • 1 Jun 2023 • Rao Ma, Mengjie Qian, Mark J. F. Gales, Kate M. Knill

As speech recognition model sizes and training data requirements grow, it is increasingly common for systems to only be available via APIs from online service providers rather than having direct access to models themselves.

speech-recognition Speech Recognition

Paper
Add Code

Multi-Head State Space Model for Speech Recognition

no code implementations • 21 May 2023 • Yassir Fathullah, Chunyang Wu, Yuan Shangguan, Junteng Jia, Wenhan Xiong, Jay Mahadeokar, Chunxi Liu, Yangyang Shi, Ozlem Kalinli, Mike Seltzer, Mark J. F. Gales

State space models (SSMs) have recently shown promising results on small-scale sequence and language modelling tasks, rivalling and outperforming many attention-based approaches.

Ranked #8 on Speech Recognition on LibriSpeech test-clean

Language Modelling speech-recognition +1

Paper
Add Code

Who Needs Decoders? Efficient Estimation of Sequence-level Attributes

no code implementations • 9 May 2023 • Yassir Fathullah, Puria Radmard, Adian Liusie, Mark J. F. Gales

In these scenarios, where for example knowing the quality of a system's output to predict poor performance prevails over knowing the output itself, is it possible to bypass the autoregressive decoding?

Attribute Automatic Speech Recognition +4

Paper
Add Code

SelfCheckGPT: Zero-Resource Black-Box Hallucination Detection for Generative Large Language Models

2 code implementations • 15 Mar 2023 • Potsawee Manakul, Adian Liusie, Mark J. F. Gales

In this work, we propose "SelfCheckGPT", a simple sampling-based approach that can be used to fact-check the responses of black-box models in a zero-resource fashion, i. e. without an external database.

Fact Checking Hallucination +1

357

Paper
Code

N-best T5: Robust ASR Error Correction using Multiple Input Hypotheses and Constrained Decoding Space

no code implementations • 1 Mar 2023 • Rao Ma, Mark J. F. Gales, Kate M. Knill, Mengjie Qian

Error correction models form an important part of Automatic Speech Recognition (ASR) post-processing to improve the readability and quality of transcriptions.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Paper
Add Code

MQAG: Multiple-choice Question Answering and Generation for Assessing Information Consistency in Summarization

2 code implementations • 28 Jan 2023 • Potsawee Manakul, Adian Liusie, Mark J. F. Gales

In this work, we introduce an alternative scheme based on standard information-theoretic measures in which the information present in the source and summary is directly compared.

Hallucination Multiple-choice +1

357

Paper
Code

L2 proficiency assessment using self-supervised speech representations

no code implementations • 16 Nov 2022 • Stefano Bannò, Kate M. Knill, Marco Matassoni, Vyas Raina, Mark J. F. Gales

Though the wav2vec 2. 0 based system is found to be sensitive to the nature of the response, it can be configured to yield comparable performance to systems requiring a speech transcription, and yields gains when appropriately combined with standard approaches.

speech-recognition Speech Recognition

Paper
Add Code

Podcast Summary Assessment: A Resource for Evaluating Summary Assessment Methods

1 code implementation • 28 Aug 2022 • Potsawee Manakul, Mark J. F. Gales

The podcast summary assessment data is available.

News Summarization

Paper
Code

Shifts 2.0: Extending The Dataset of Real Distributional Shifts

2 code implementations • 30 Jun 2022 • Andrey Malinin, Andreas Athanasopoulos, Muhamed Barakovic, Meritxell Bach Cuadra, Mark J. F. Gales, Cristina Granziera, Mara Graziani, Nikolay Kartashev, Konstantinos Kyriakopoulos, Po-Jui Lu, Nataliia Molchanova, Antonis Nikitakis, Vatsal Raina, Francesco La Rosa, Eli Sivena, Vasileios Tsarsitalidis, Efi Tsompopoulou, Elena Volf

This creates a need to be able to assess how robustly ML models generalize as well as the quality of their uncertainty estimates.

Autonomous Driving Image Classification +3

224

Paper
Code

Self-Distribution Distillation: Efficient Uncertainty Estimation

no code implementations • 15 Mar 2022 • Yassir Fathullah, Mark J. F. Gales

Furthermore it is possible to build ensembles of these models and apply hierarchical ensemble distillation approaches.

Out-of-Distribution Detection

Paper
Add Code

Sparsity and Sentence Structure in Encoder-Decoder Attention of Summarization Systems

no code implementations • EMNLP 2021 • Potsawee Manakul, Mark J. F. Gales

Second, we propose a modified architecture that selects the subset of sentences to constrain the encoder-decoder attention.

Abstractive Text Summarization Document Summarization +1

Paper
Add Code

Shifts: A Dataset of Real Distributional Shift Across Multiple Large-Scale Tasks

3 code implementations • 15 Jul 2021 • Andrey Malinin, Neil Band, Ganshin, Alexander, German Chesnokov, Yarin Gal, Mark J. F. Gales, Alexey Noskov, Andrey Ploskonosov, Liudmila Prokhorenkova, Ivan Provilkov, Vatsal Raina, Vyas Raina, Roginskiy, Denis, Mariya Shmatova, Panos Tigas, Boris Yangel

However, many tasks of practical interest have different modalities, such as tabular data, audio, text, or sensor data, which offer significant challenges involving regression and discrete or continuous structured prediction.

Ranked #2 on Weather Forecasting on Shifts

Image Classification Machine Translation +5

224

Paper
Code

An Initial Investigation of Non-Native Spoken Question-Answering

no code implementations • 9 Jul 2021 • Vatsal Raina, Mark J. F. Gales

The SQA task considered in this paper is to extract the answer from a candidate$\text{'}$s spoken response to a question in a prompt-response style language assessment test.

Question Answering Reading Comprehension +1

Paper
Add Code

Long-Span Summarization via Local Attention and Content Selection

1 code implementation • ACL 2021 • Potsawee Manakul, Mark J. F. Gales

Transformer-based models have achieved state-of-the-art results in a wide range of natural language processing (NLP) tasks including document summarization.

Abstractive Text Summarization Document Summarization

Paper
Code

Attention Forcing for Machine Translation

1 code implementation • 2 Apr 2021 • Qingyun Dou, Yiting Lu, Potsawee Manakul, Xixin Wu, Mark J. F. Gales

This approach guides the model with the generated output history and reference attention, and can reduce the training-inference mismatch without a schedule or a classifier.

Machine Translation NMT +1

Paper
Code

Non-native Speaker Verification for Spoken Language Assessment

no code implementations • 30 Sep 2019 • Lin-Lin Wang, Yu Wang, Mark J. F. Gales

These systems are explored for non-native spoken English data in this paper.

Speaker Verification

Paper
Add Code

Attention Forcing for Sequence-to-sequence Model Training

no code implementations • 26 Sep 2019 • Qingyun Dou, Yiting Lu, Joshua Efiong, Mark J. F. Gales

This paper introduces attention forcing, which guides the model with generated output history and reference attention.

Machine Translation Speech Synthesis +2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.