Search Results for author: Amirhossein Hajavi

Found 7 papers, 0 papers with code

A Study on Bias and Fairness In Deep Speaker Recognition

no code implementations • 14 Mar 2023 • Amirhossein Hajavi, Ali Etemad

With the ubiquity of smart devices that use speaker recognition (SR) systems as a means of authenticating individuals and personalizing their services, fairness of SR systems has becomes an important point of focus.

Fairness Speaker Recognition

Paper
Add Code

Audio Representation Learning by Distilling Video as Privileged Information

no code implementations • 6 Feb 2023 • Amirhossein Hajavi, Ali Etemad

In this work, we propose a novel approach for deep audio representation learning using audio-visual data when the video modality is absent at inference.

Knowledge Distillation Representation Learning +2

Paper
Add Code

Siamese Capsule Network for End-to-End Speaker Recognition In The Wild

no code implementations • 28 Sep 2020 • Amirhossein Hajavi, Ali Etemad

Our model uses thin-ResNet for extracting speaker embeddings from utterances and a Siamese capsule network and dynamic routing as the Back-end to calculate a similarity score between the embeddings.

Speaker Recognition Speaker Verification

Paper
Add Code

FluentNet: End-to-End Detection of Speech Disfluency with Deep Learning

no code implementations • 23 Sep 2020 • Tedd Kourkounakis, Amirhossein Hajavi, Ali Etemad

We also evaluate FluentNet on this dataset, showing the strong performance of our model versus a number of benchmark techniques.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Add Code

Fine-grained Early Frequency Attention for Deep Speaker Representation Learning

no code implementations • 3 Sep 2020 • Amirhossein Hajavi, Ali Etemad

We evaluate the proposed model on three tasks of speaker recognition, speech emotion recognition, and spoken digit recognition.

Representation Learning Speaker Recognition +3

Paper
Add Code

Detecting Multiple Speech Disfluencies using a Deep Residual Network with Bidirectional Long Short-Term Memory

no code implementations • IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2019 • Tedd Kourkounakis, Amirhossein Hajavi, Ali Etemad

Stuttering is a speech impediment affecting tens of millions of people on an everyday basis.

General Classification speech-recognition +1

Paper
Add Code

A Deep Neural Network for Short-Segment Speaker Recognition

no code implementations • 22 Jul 2019 • Amirhossein Hajavi, Ali Etemad

Todays interactive devices such as smart-phone assistants and smart speakers often deal with short-duration speech segments.

Speaker Recognition

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.