no code implementations • 23 Jan 2024 • Md Asif Jalal, Pablo Peso Parada, George Pavlidis, Vasileios Moschopoulos, Karthikeyan Saravanan, Chrysovalantis-Giorgos Kontoulis, Jisi Zhang, Anastasios Drosou, Gil Ho Lee, Jungin Lee, Seokyeong Jung
During training, a list of biasing phrases are selected from a large pool of phrases following a sampling strategy.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +1
no code implementations • 22 Jan 2024 • Jisi Zhang, Vandana Rajan, Haaris Mehmood, David Tuckey, Pablo Peso Parada, Md Asif Jalal, Karthikeyan Saravanan, Gil Ho Lee, Jungin Lee, Seokyeong Jung
On-device Automatic Speech Recognition (ASR) models trained on speech data of a large population might underperform for individuals unseen during training.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +1
no code implementations • 25 Jul 2023 • Md Asif Jalal, Pablo Peso Parada, Jisi Zhang, Karthikeyan Saravanan, Mete Ozay, Myoungji Han, Jung In Lee, Seokyeong Jung
Our paper proposes a privacy-enhancing framework that targets speaker identity anonymization while preserving speech recognition accuracy for our downstream task~-~Automatic Speech Recognition (ASR).
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2
no code implementations • 30 Jun 2023 • Anna Ollerenshaw, Md Asif Jalal, Rosanna Milner, Thomas Hain
The benefits of using a distributed approach to speech emotion understanding are supported by the results of cross-corpora analysis experiments.
no code implementations • 1 Mar 2023 • Rehan Ahmad, Md Asif Jalal, Muhammad Umar Farooq, Anna Ollerenshaw, Thomas Hain
Knowledge distillation has widely been used for model compression and domain adaptation for speech applications.
no code implementations • 3 Nov 2022 • Anna Ollerenshaw, Md Asif Jalal, Thomas Hain
Instead, this paper proposes an approach to increase the model resolution capability using attention-based dynamic kernels in a convolutional neural network to adapt the model parameters to be feature-conditioned.
no code implementations • 3 Nov 2022 • Anna Ollerenshaw, Md Asif Jalal, Thomas Hain
End-to-End automatic speech recognition (ASR) models aim to learn a generalised speech representation to perform recognition.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2
no code implementations • 5 Jul 2022 • Rosanna Milner, Md Asif Jalal, Raymond W. M. Ng, Thomas Hain
This shows positive information transfer from acted datasets to those with more natural emotions and the benefits from training on different corpora.
no code implementations • 19 May 2022 • Anna Ollerenshaw, Md Asif Jalal, Thomas Hain
This paper analyses and explores the internal dynamics between layers during training with CNN, LSTM and Transformer based approaches using Canonical correlation analysis (CCA) and centered kernel alignment (CKA) for the experiments.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +1