Search Results for author: Rao Ma

Found 12 papers, 2 papers with code

Investigating the Emergent Audio Classification Ability of ASR Foundation Models

1 code implementation15 Nov 2023 Rao Ma, Adian Liusie, Mark J. F. Gales, Kate M. Knill

Text and vision foundation models can perform many tasks in a zero-shot setting, a desirable property that enables these systems to be applied in general and low-resource settings.

Audio Classification speech-recognition +3

Towards End-to-End Spoken Grammatical Error Correction

no code implementations9 Nov 2023 Stefano Bannò, Rao Ma, Mengjie Qian, Kate M. Knill, Mark J. F. Gales

This foundation model can be used to replace the whole framework or part of it, e. g., ASR and disfluency removal.

Grammatical Error Correction speech-recognition +1

Zero-shot Audio Topic Reranking using Large Language Models

no code implementations14 Sep 2023 Mengjie Qian, Rao Ma, Adian Liusie, Erfan Loweimi, Kate M. Knill, Mark J. F. Gales

A key element for this process is highly rapid, flexible, search to support large archives, which in MVSE is facilitated by representing video attributes by embeddings.

Information Retrieval Retrieval

Adapting an ASR Foundation Model for Spoken Language Assessment

no code implementations13 Jul 2023 Rao Ma, Mengjie Qian, Mark J. F. Gales, Kate M. Knill

Additionally, these models have a tendency to skip disfluencies and hesitations in the output.

Adapting an Unadaptable ASR System

no code implementations1 Jun 2023 Rao Ma, Mengjie Qian, Mark J. F. Gales, Kate M. Knill

As speech recognition model sizes and training data requirements grow, it is increasingly common for systems to only be available via APIs from online service providers rather than having direct access to models themselves.

speech-recognition Speech Recognition

N-best T5: Robust ASR Error Correction using Multiple Input Hypotheses and Constrained Decoding Space

no code implementations1 Mar 2023 Rao Ma, Mark J. F. Gales, Kate M. Knill, Mengjie Qian

Error correction models form an important part of Automatic Speech Recognition (ASR) post-processing to improve the readability and quality of transcriptions.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Internal Language Model Estimation based Adaptive Language Model Fusion for Domain Adaptation

no code implementations2 Nov 2022 Rao Ma, Xiaobo Wu, Jin Qiu, Yanan Qin, HaiHua Xu, Peihao Wu, Zejun Ma

The proposed method can achieve significantly better performance on the target test sets while it gets minimal performance degradation on the general test set, compared with both shallow and ILME-based LM fusion methods.

Domain Adaptation Language Modelling

An Investigation on Different Underlying Quantization Schemes for Pre-trained Language Models

no code implementations14 Oct 2020 Zihan Zhao, Yuncong Liu, Lu Chen, Qi Liu, Rao Ma, Kai Yu

Recently, pre-trained language models like BERT have shown promising performance on multiple natural language processing tasks.

Clustering Quantization

Cannot find the paper you are looking for? You can Submit a new open access paper.