no code implementations • 16 Dec 2024 • Mengjie Qian, Kate Knill, Stefano Banno, Siyuan Tang, Penny Karanasou, Mark J. F. Gales, Diane Nicholls
Linked with the challenge, the Speak & Improve (S&I) Corpus 2025 is being pre-released, a dataset of L2 learner English data with holistic scores and language error annotation, collected from open (spontaneous) speaking tests on the Speak & Improve learning platform.
Automatic Speech Recognition
Automatic Speech Recognition (ASR)
+2
no code implementations • 16 Dec 2024 • Kate Knill, Diane Nicholls, Mark J. F. Gales, Mengjie Qian, Pawel Stroinski
This enables a range of language-learning tasks to be examined, such as assessing speaking proficiency or providing feedback on grammatical errors in a learner's speech.
Automatic Speech Recognition
Automatic Speech Recognition (ASR)
+2
no code implementations • 18 Aug 2024 • Stefano Bannò, Kate Knill, Mark J. F. Gales
Most research in computer-assisted language learning has focused on feedback through grammatical error correction (GEC) systems, rather than examining more holistic feedback that may be more useful for learners.
no code implementations • 9 Jul 2024 • Mengjie Qian, Siyuan Tang, Rao Ma, Kate M. Knill, Mark J. F. Gales
If only adaptation parameters are used, the language capabilities are maintained but at the cost of performance in the new language.
no code implementations • 1 May 2024 • Yassir Fathullah, Mark J. F. Gales
Encoder-decoder foundation models have displayed state-of-the-art performance on a range of autoregressive sequence tasks.
no code implementations • 29 Apr 2024 • Stefano Bannò, Hari Krishna Vydana, Kate M. Knill, Mark J. F. Gales
Automated essay scoring (AES) to evaluate second language (L2) proficiency has been a firmly established technology used in educational contexts for decades.
no code implementations • 28 Mar 2024 • Piotr Molenda, Adian Liusie, Mark J. F. Gales
Watermarking generative-AI systems, such as LLMs, has gained considerable interest, driven by their enhanced capabilities across a wide range of tasks.
no code implementations • 20 Mar 2024 • Adian Liusie, Yassir Fathullah, Mark J. F. Gales
Large Language Models (LLMs) have demonstrated impressive zero-shot capabilities and versatility in NLP tasks, however they sometimes fail to maintain crucial invariances for specific tasks.
1 code implementation • 15 Nov 2023 • Rao Ma, Adian Liusie, Mark J. F. Gales, Kate M. Knill
Text and vision foundation models can perform many tasks in a zero-shot setting, a desirable property that enables these systems to be applied in general and low-resource settings.
no code implementations • 9 Nov 2023 • Stefano Bannò, Rao Ma, Mengjie Qian, Kate M. Knill, Mark J. F. Gales
This foundation model can be used to replace the whole framework or part of it, e. g., ASR and disfluency removal.
no code implementations • 14 Sep 2023 • Mengjie Qian, Rao Ma, Adian Liusie, Erfan Loweimi, Kate M. Knill, Mark J. F. Gales
To gain a deeper understanding and further insights into the performance differences and limitations of these text sources, we employ a fact-checking approach to analyse the information consistency among them.
1 code implementation • 10 Sep 2023 • Adian Liusie, Potsawee Manakul, Mark J. F. Gales
To address this problem, it is possible to optimise classification thresholds on a labelled data set, however, this mitigates some of the advantages of prompt-based classifiers.
1 code implementation • 15 Jul 2023 • Adian Liusie, Potsawee Manakul, Mark J. F. Gales
Current developments in large language models (LLMs) have enabled impressive zero-shot capabilities across various natural language tasks.
no code implementations • 13 Jul 2023 • Rao Ma, Mengjie Qian, Mark J. F. Gales, Kate M. Knill
Additionally, these models have a tendency to skip disfluencies and hesitations in the output.
no code implementations • 22 Jun 2023 • Adian Liusie, Vatsal Raina, Andrew Mullooly, Kate Knill, Mark J. F. Gales
Multiple choice exams are widely used to assess candidates across a diverse range of domains and tasks.
no code implementations • 1 Jun 2023 • Rao Ma, Mengjie Qian, Mark J. F. Gales, Kate M. Knill
As speech recognition model sizes and training data requirements grow, it is increasingly common for systems to only be available via APIs from online service providers rather than having direct access to models themselves.
no code implementations • 21 May 2023 • Yassir Fathullah, Chunyang Wu, Yuan Shangguan, Junteng Jia, Wenhan Xiong, Jay Mahadeokar, Chunxi Liu, Yangyang Shi, Ozlem Kalinli, Mike Seltzer, Mark J. F. Gales
State space models (SSMs) have recently shown promising results on small-scale sequence and language modelling tasks, rivalling and outperforming many attention-based approaches.
Ranked #10 on
Speech Recognition
on LibriSpeech test-clean
no code implementations • 9 May 2023 • Yassir Fathullah, Puria Radmard, Adian Liusie, Mark J. F. Gales
In these scenarios, where for example knowing the quality of a system's output to predict poor performance prevails over knowing the output itself, is it possible to bypass the autoregressive decoding?
1 code implementation • 15 Mar 2023 • Potsawee Manakul, Adian Liusie, Mark J. F. Gales
In this work, we propose "SelfCheckGPT", a simple sampling-based approach that can be used to fact-check the responses of black-box models in a zero-resource fashion, i. e. without an external database.
no code implementations • 1 Mar 2023 • Rao Ma, Mark J. F. Gales, Kate M. Knill, Mengjie Qian
Error correction models form an important part of Automatic Speech Recognition (ASR) post-processing to improve the readability and quality of transcriptions.
Automatic Speech Recognition
Automatic Speech Recognition (ASR)
+4
2 code implementations • 28 Jan 2023 • Potsawee Manakul, Adian Liusie, Mark J. F. Gales
In this work, we introduce an alternative scheme based on standard information-theoretic measures in which the information present in the source and summary is directly compared.
no code implementations • 16 Nov 2022 • Stefano Bannò, Kate M. Knill, Marco Matassoni, Vyas Raina, Mark J. F. Gales
Though the wav2vec 2. 0 based system is found to be sensitive to the nature of the response, it can be configured to yield comparable performance to systems requiring a speech transcription, and yields gains when appropriately combined with standard approaches.
1 code implementation • 28 Aug 2022 • Potsawee Manakul, Mark J. F. Gales
The podcast summary assessment data is available.
2 code implementations • 30 Jun 2022 • Andrey Malinin, Andreas Athanasopoulos, Muhamed Barakovic, Meritxell Bach Cuadra, Mark J. F. Gales, Cristina Granziera, Mara Graziani, Nikolay Kartashev, Konstantinos Kyriakopoulos, Po-Jui Lu, Nataliia Molchanova, Antonis Nikitakis, Vatsal Raina, Francesco La Rosa, Eli Sivena, Vasileios Tsarsitalidis, Efi Tsompopoulou, Elena Volf
This creates a need to be able to assess how robustly ML models generalize as well as the quality of their uncertainty estimates.
no code implementations • 15 Mar 2022 • Yassir Fathullah, Mark J. F. Gales
Furthermore it is possible to build ensembles of these models and apply hierarchical ensemble distillation approaches.
no code implementations • EMNLP 2021 • Potsawee Manakul, Mark J. F. Gales
Second, we propose a modified architecture that selects the subset of sentences to constrain the encoder-decoder attention.
3 code implementations • 15 Jul 2021 • Andrey Malinin, Neil Band, Ganshin, Alexander, German Chesnokov, Yarin Gal, Mark J. F. Gales, Alexey Noskov, Andrey Ploskonosov, Liudmila Prokhorenkova, Ivan Provilkov, Vatsal Raina, Vyas Raina, Roginskiy, Denis, Mariya Shmatova, Panos Tigas, Boris Yangel
However, many tasks of practical interest have different modalities, such as tabular data, audio, text, or sensor data, which offer significant challenges involving regression and discrete or continuous structured prediction.
Ranked #2 on
Weather Forecasting
on Shifts
no code implementations • 9 Jul 2021 • Vatsal Raina, Mark J. F. Gales
The SQA task considered in this paper is to extract the answer from a candidate$\text{'}$s spoken response to a question in a prompt-response style language assessment test.
1 code implementation • ACL 2021 • Potsawee Manakul, Mark J. F. Gales
Transformer-based models have achieved state-of-the-art results in a wide range of natural language processing (NLP) tasks including document summarization.
1 code implementation • 2 Apr 2021 • Qingyun Dou, Yiting Lu, Potsawee Manakul, Xixin Wu, Mark J. F. Gales
This approach guides the model with the generated output history and reference attention, and can reduce the training-inference mismatch without a schedule or a classifier.
no code implementations • 30 Sep 2019 • Lin-Lin Wang, Yu Wang, Mark J. F. Gales
These systems are explored for non-native spoken English data in this paper.
no code implementations • 26 Sep 2019 • Qingyun Dou, Yiting Lu, Joshua Efiong, Mark J. F. Gales
This paper introduces attention forcing, which guides the model with generated output history and reference attention.