no code implementations • 14 Mar 2023 • Amirhossein Hajavi, Ali Etemad
With the ubiquity of smart devices that use speaker recognition (SR) systems as a means of authenticating individuals and personalizing their services, fairness of SR systems has becomes an important point of focus.
no code implementations • 6 Feb 2023 • Amirhossein Hajavi, Ali Etemad
In this work, we propose a novel approach for deep audio representation learning using audio-visual data when the video modality is absent at inference.
no code implementations • 28 Sep 2020 • Amirhossein Hajavi, Ali Etemad
Our model uses thin-ResNet for extracting speaker embeddings from utterances and a Siamese capsule network and dynamic routing as the Back-end to calculate a similarity score between the embeddings.
no code implementations • 23 Sep 2020 • Tedd Kourkounakis, Amirhossein Hajavi, Ali Etemad
We also evaluate FluentNet on this dataset, showing the strong performance of our model versus a number of benchmark techniques.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +1
no code implementations • 3 Sep 2020 • Amirhossein Hajavi, Ali Etemad
We evaluate the proposed model on three tasks of speaker recognition, speech emotion recognition, and spoken digit recognition.
no code implementations • IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2019 • Tedd Kourkounakis, Amirhossein Hajavi, Ali Etemad
Stuttering is a speech impediment affecting tens of millions of people on an everyday basis.
no code implementations • 22 Jul 2019 • Amirhossein Hajavi, Ali Etemad
Todays interactive devices such as smart-phone assistants and smart speakers often deal with short-duration speech segments.