Search Results for author: Rao Ma

Found 12 papers, 2 papers with code

Investigating the Emergent Audio Classification Ability of ASR Foundation Models

1 code implementation • 15 Nov 2023 • Rao Ma, Adian Liusie, Mark J. F. Gales, Kate M. Knill

Text and vision foundation models can perform many tasks in a zero-shot setting, a desirable property that enables these systems to be applied in general and low-resource settings.

Audio Classification speech-recognition +3

Paper
Code

Towards End-to-End Spoken Grammatical Error Correction

no code implementations • 9 Nov 2023 • Stefano Bannò, Rao Ma, Mengjie Qian, Kate M. Knill, Mark J. F. Gales

This foundation model can be used to replace the whole framework or part of it, e. g., ASR and disfluency removal.

Grammatical Error Correction speech-recognition +1

Paper
Add Code

Zero-shot Audio Topic Reranking using Large Language Models

no code implementations • 14 Sep 2023 • Mengjie Qian, Rao Ma, Adian Liusie, Erfan Loweimi, Kate M. Knill, Mark J. F. Gales

A key element for this process is highly rapid, flexible, search to support large archives, which in MVSE is facilitated by representing video attributes by embeddings.

Information Retrieval Retrieval

Paper
Add Code

Adapting an ASR Foundation Model for Spoken Language Assessment

no code implementations • 13 Jul 2023 • Rao Ma, Mengjie Qian, Mark J. F. Gales, Kate M. Knill

Additionally, these models have a tendency to skip disfluencies and hesitations in the output.

Paper
Add Code

Can Generative Large Language Models Perform ASR Error Correction?

no code implementations • 9 Jul 2023 • Rao Ma, Mengjie Qian, Potsawee Manakul, Mark Gales, Kate Knill

In this paper we investigate using ChatGPT, a generative LLM, for ASR error correction.

speech-recognition Speech Recognition

Paper
Add Code

Adapting an Unadaptable ASR System

no code implementations • 1 Jun 2023 • Rao Ma, Mengjie Qian, Mark J. F. Gales, Kate M. Knill

As speech recognition model sizes and training data requirements grow, it is increasingly common for systems to only be available via APIs from online service providers rather than having direct access to models themselves.

speech-recognition Speech Recognition

Paper
Add Code

N-best T5: Robust ASR Error Correction using Multiple Input Hypotheses and Constrained Decoding Space

no code implementations • 1 Mar 2023 • Rao Ma, Mark J. F. Gales, Kate M. Knill, Mengjie Qian

Error correction models form an important part of Automatic Speech Recognition (ASR) post-processing to improve the readability and quality of transcriptions.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Paper
Add Code

Internal Language Model Estimation based Adaptive Language Model Fusion for Domain Adaptation

no code implementations • 2 Nov 2022 • Rao Ma, Xiaobo Wu, Jin Qiu, Yanan Qin, HaiHua Xu, Peihao Wu, Zejun Ma

The proposed method can achieve significantly better performance on the target test sets while it gets minimal performance degradation on the general test set, compared with both shallow and ILME-based LM fusion methods.

Domain Adaptation Language Modelling