no code implementations • 17 Jan 2022 • Achintya kr. Sarkar, Zheng-Hua Tan
Furthermore, we study a range of loss functions when speaker identity is used as the training target.
no code implementations • 25 Nov 2020 • Achintya kr. Sarkar, Zheng-Hua Tan
In this letter, we propose a vocal tract length (VTL) perturbation method for text-dependent speaker verification (TD-SV), in which a set of TD-SV systems are trained, one for each VTL factor, and score-level fusion is applied to make a final decision.
3 code implementations • 9 Jun 2019 • Zheng-Hua Tan, Achintya kr. Sarkar, Najim Dehak
In the end, a posteriori SNR weighted energy difference is applied to the extended pitch segments of the denoised speech signal for detecting voice activity.
no code implementations • 11 May 2019 • Achintya kr. Sarkar, Zheng-Hua Tan, Hao Tang, Suwon Shon, James Glass
There are a number of studies about extraction of bottleneck (BN) features from deep neural networks (DNNs)trained to discriminate speakers, pass-phrases and triphone states for improving the performance of text-dependent speaker verification (TD-SV).
Automatic Speech Recognition Automatic Speech Recognition (ASR) +4
no code implementations • 6 Apr 2017 • Achintya Kr. Sarkar, Zheng-Hua Tan
It is well-known that speech signals exhibit quasi-stationary behavior in and only in a short interval, and the TCL method aims to exploit this temporal structure.