no code implementations • 15 Nov 2023 • Hassan Taherian, DeLiang Wang
To enhance ASR performance in conversational or meeting environments, continuous speaker separation (CSS) is commonly employed.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2
no code implementations • 16 Jan 2023 • Hassan Taherian, DeLiang Wang
The performance of automatic speech recognition (ASR) systems severely degrades when multi-talker speech overlap occurs.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +3
no code implementations • 5 Nov 2022 • Hassan Taherian, Sefik Emre Eskimez, Takuya Yoshioka
This prevents the PSE model from being too aggressive while still allowing the model to learn to suppress the input speech when it is likely to be spoken by interfering speakers.
no code implementations • 20 Oct 2021 • Hassan Taherian, Sefik Emre Eskimez, Takuya Yoshioka, Huaming Wang, Zhuo Chen, Xuedong Huang
Experimental results show that the proposed geometry agnostic model outperforms the model trained on a specific microphone array geometry in both speech quality and automatic speech recognition accuracy.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2
no code implementations • 8 Oct 2021 • Hassan Taherian, Ke Tan, DeLiang Wang
We further demonstrate the effectiveness of LBT for the separation of four and five concurrent speakers.
no code implementations • 17 Oct 2016 • Hassan Taherian
End-to-end attention-based models have been shown to be competitive alternatives to conventional DNN-HMM models in the Speech Recognition Systems.