Search Results for author: Hassan Taherian

Found 6 papers, 0 papers with code

Multi-channel Conversational Speaker Separation via Neural Diarization

no code implementations • 15 Nov 2023 • Hassan Taherian, DeLiang Wang

To enhance ASR performance in conversational or meeting environments, continuous speaker separation (CSS) is commonly employed.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Add Code

Multi-resolution location-based training for multi-channel continuous speech separation

no code implementations • 16 Jan 2023 • Hassan Taherian, DeLiang Wang

The performance of automatic speech recognition (ASR) systems severely degrades when multi-talker speech overlap occurs.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Paper
Add Code

Breaking the trade-off in personalized speech enhancement with cross-task knowledge distillation

no code implementations • 5 Nov 2022 • Hassan Taherian, Sefik Emre Eskimez, Takuya Yoshioka

This prevents the PSE model from being too aggressive while still allowing the model to learn to suppress the input speech when it is likely to be spoken by interfering speakers.

Knowledge Distillation Speech Enhancement

Paper
Add Code

One model to enhance them all: array geometry agnostic multi-channel personalized speech enhancement

no code implementations • 20 Oct 2021 • Hassan Taherian, Sefik Emre Eskimez, Takuya Yoshioka, Huaming Wang, Zhuo Chen, Xuedong Huang

Experimental results show that the proposed geometry agnostic model outperforms the model trained on a specific microphone array geometry in both speech quality and automatic speech recognition accuracy.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Add Code

Location-based training for multi-channel talker-independent speaker separation

no code implementations • 8 Oct 2021 • Hassan Taherian, Ke Tan, DeLiang Wang

We further demonstrate the effectiveness of LBT for the separation of four and five concurrent speakers.

Speaker Separation

Paper
Add Code

End-to-end attention-based distant speech recognition with Highway LSTM

no code implementations • 17 Oct 2016 • Hassan Taherian

End-to-end attention-based models have been shown to be competitive alternatives to conventional DNN-HMM models in the Speech Recognition Systems.

Distant Speech Recognition speech-recognition

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.