Search Results for author: Hassan Taherian

Found 6 papers, 0 papers with code

Multi-channel Conversational Speaker Separation via Neural Diarization

no code implementations15 Nov 2023 Hassan Taherian, DeLiang Wang

To enhance ASR performance in conversational or meeting environments, continuous speaker separation (CSS) is commonly employed.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Breaking the trade-off in personalized speech enhancement with cross-task knowledge distillation

no code implementations5 Nov 2022 Hassan Taherian, Sefik Emre Eskimez, Takuya Yoshioka

This prevents the PSE model from being too aggressive while still allowing the model to learn to suppress the input speech when it is likely to be spoken by interfering speakers.

Knowledge Distillation Speech Enhancement

One model to enhance them all: array geometry agnostic multi-channel personalized speech enhancement

no code implementations20 Oct 2021 Hassan Taherian, Sefik Emre Eskimez, Takuya Yoshioka, Huaming Wang, Zhuo Chen, Xuedong Huang

Experimental results show that the proposed geometry agnostic model outperforms the model trained on a specific microphone array geometry in both speech quality and automatic speech recognition accuracy.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Location-based training for multi-channel talker-independent speaker separation

no code implementations8 Oct 2021 Hassan Taherian, Ke Tan, DeLiang Wang

We further demonstrate the effectiveness of LBT for the separation of four and five concurrent speakers.

Speaker Separation

End-to-end attention-based distant speech recognition with Highway LSTM

no code implementations17 Oct 2016 Hassan Taherian

End-to-end attention-based models have been shown to be competitive alternatives to conventional DNN-HMM models in the Speech Recognition Systems.

Distant Speech Recognition speech-recognition

Cannot find the paper you are looking for? You can Submit a new open access paper.