no code implementations • 10 Jun 2023 • Mu Yang, Ram C. M. C. Shekar, Okim Kang, John H. L. Hansen
This study is focused on understanding and quantifying the change in phoneme and prosody information encoded in the Self-Supervised Learning (SSL) model, brought by an accent identification (AID) fine-tuning task.
no code implementations • 19 Nov 2022 • Iván López-Espejo, Ram C. M. C. Shekar, Zheng-Hua Tan, Jesper Jensen, John H. L. Hansen
In the context of keyword spotting (KWS), the replacement of handcrafted speech features by learnable features has not yielded superior KWS performance.