no code implementations • 1 Apr 2024 • Ruijie Tao, Zhan Shi, Yidi Jiang, Tianchi Liu, Haizhou Li
Our experimental results on three created datasets demonstrated that VCA-NN effectively mitigates these dataset problems, which provides a new direction for handling the speaker recognition problems from the data aspect.
1 code implementation • 6 Dec 2023 • Tianchi Liu, Kong Aik Lee, Qiongqiong Wang, Haizhou Li
We represent the stride space on a trellis diagram, and conduct a systematic study on the impact of temporal and frequency resolutions on the performance and further identify two optimal points, namely Golden Gemini, which serves as a guiding principle for designing 2D ResNet-based speaker verification models.
no code implementations • 23 Feb 2023 • Qiongqiong Wang, Kong Aik Lee, Tianchi Liu
We propose a log-likelihood ratio function for the PLDA scoring with the uncertainty propagation.
no code implementations • 8 Apr 2022 • Qiongqiong Wang, Kong Aik Lee, Tianchi Liu
The emergence of large-margin softmax cross-entropy losses in training deep speaker embedding neural networks has triggered a gradual shift from parametric back-ends to a simpler cosine similarity measure for speaker verification.
no code implementations • 3 Feb 2022 • Tianchi Liu, Rohan Kumar Das, Kong Aik Lee, Haizhou Li
The time delay neural network (TDNN) represents one of the state-of-the-art of neural solutions to text-independent speaker verification.
no code implementations • 20 Aug 2020 • Tianchi Liu, Rohan Kumar Das, Maulik Madhavi, ShengMei Shen, Haizhou Li
The proposed SUDA features an attention mask mechanism to learn the interaction between the speaker and utterance information streams.