no code implementations • 30 Jun 2023 • Anna Ollerenshaw, Md Asif Jalal, Rosanna Milner, Thomas Hain
The benefits of using a distributed approach to speech emotion understanding are supported by the results of cross-corpora analysis experiments.
no code implementations • 1 Mar 2023 • Rehan Ahmad, Md Asif Jalal, Muhammad Umar Farooq, Anna Ollerenshaw, Thomas Hain
Knowledge distillation has widely been used for model compression and domain adaptation for speech applications.
no code implementations • 3 Nov 2022 • Anna Ollerenshaw, Md Asif Jalal, Thomas Hain
Instead, this paper proposes an approach to increase the model resolution capability using attention-based dynamic kernels in a convolutional neural network to adapt the model parameters to be feature-conditioned.
no code implementations • 3 Nov 2022 • Anna Ollerenshaw, Md Asif Jalal, Thomas Hain
End-to-End automatic speech recognition (ASR) models aim to learn a generalised speech representation to perform recognition.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2
no code implementations • 19 May 2022 • Anna Ollerenshaw, Md Asif Jalal, Thomas Hain
This paper analyses and explores the internal dynamics between layers during training with CNN, LSTM and Transformer based approaches using Canonical correlation analysis (CCA) and centered kernel alignment (CKA) for the experiments.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +1