Search Results for author: Achintya kr. Sarkar

Found 5 papers, 1 papers with code

On Training Targets and Activation Functions for Deep Representation Learning in Text-Dependent Speaker Verification

no code implementations • 17 Jan 2022 • Achintya kr. Sarkar, Zheng-Hua Tan

Furthermore, we study a range of loss functions when speaker identity is used as the training target.

Contrastive Learning Representation Learning +1

Paper
Add Code

Vocal Tract Length Perturbation for Text-Dependent Speaker Verification with Autoregressive Prediction Coding

no code implementations • 25 Nov 2020 • Achintya kr. Sarkar, Zheng-Hua Tan

In this letter, we propose a vocal tract length (VTL) perturbation method for text-dependent speaker verification (TD-SV), in which a set of TD-SV systems are trained, one for each VTL factor, and score-level fusion is applied to make a final decision.

Text-Dependent Speaker Verification

Paper
Add Code

rVAD: An Unsupervised Segment-Based Robust Voice Activity Detection Method

3 code implementations • 9 Jun 2019 • Zheng-Hua Tan, Achintya kr. Sarkar, Najim Dehak

In the end, a posteriori SNR weighted energy difference is applied to the extended pitch segments of the denoised speech signal for detecting voice activity.

Action Detection Activity Detection +3

119

Paper
Code

Time-Contrastive Learning Based Deep Bottleneck Features for Text-Dependent Speaker Verification

no code implementations • 11 May 2019 • Achintya kr. Sarkar, Zheng-Hua Tan, Hao Tang, Suwon Shon, James Glass

There are a number of studies about extraction of bottleneck (BN) features from deep neural networks (DNNs)trained to discriminate speakers, pass-phrases and triphone states for improving the performance of text-dependent speaker verification (TD-SV).

Automatic Speech Recognition Automatic Speech Recognition (ASR) +4

Paper
Add Code

Time-Contrastive Learning Based DNN Bottleneck Features for Text-Dependent Speaker Verification

no code implementations • 6 Apr 2017 • Achintya Kr. Sarkar, Zheng-Hua Tan

It is well-known that speech signals exhibit quasi-stationary behavior in and only in a short interval, and the TCL method aims to exploit this temporal structure.

Contrastive Learning Text-Dependent Speaker Verification

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.