Search Results for author: Yunqi Cai

Found 7 papers, 5 papers with code

Deep Speaker Vector Normalization with Maximum Gaussianality Training

1 code implementation30 Oct 2020 Yunqi Cai, Lantian Li, Dong Wang, Andrew Abel

In this paper, we argue that this problem is largely attributed to the maximum-likelihood (ML) training criterion of the DNF model, which aims to maximize the likelihood of the observations but not necessarily improve the Gaussianality of the latent codes.

Speaker Recognition

Deep generative LDA

1 code implementation30 Oct 2020 Yunqi Cai, Dong Wang

Limited by its linear form and the underlying Gaussian assumption, however, LDA is not applicable in situations where the data distribution is complex.

Dimensionality Reduction Speaker Recognition

Deep generative factorization for speech signal

no code implementations27 Oct 2020 Haoran Sun, Lantian Li, Yunqi Cai, Yang Zhang, Thomas Fang Zheng, Dong Wang

Various information factors are blended in speech signals, which forms the primary difficulty for most speech information processing tasks.

Domain-Invariant Speaker Vector Projection by Model-Agnostic Meta-Learning

1 code implementation25 May 2020 Jiawen Kang, Ruiqi Liu, Lantian Li, Yunqi Cai, Dong Wang, Thomas Fang Zheng

Domain generalization remains a critical problem for speaker recognition, even with the state-of-the-art architectures based on deep neural nets.

Audio and Speech Processing

Deep Normalization for Speaker Vectors

1 code implementation7 Apr 2020 Yunqi Cai, Lantian Li, Dong Wang, Andrew Abel

Deep speaker embedding has demonstrated state-of-the-art performance in speaker recognition tasks.

Speaker Recognition

CN-CELEB: a challenging Chinese speaker recognition dataset

2 code implementations31 Oct 2019 Yue Fan, Jiawen Kang, Lantian Li, Kaicheng Li, Haolin Chen, Sitong Cheng, Pengyuan Zhang, Ziya Zhou, Yunqi Cai, Dong Wang

These datasets tend to deliver over optimistic performance and do not meet the request of research on speaker recognition in unconstrained conditions.

Speaker Recognition

On Investigation of Unsupervised Speech Factorization Based on Normalization Flow

no code implementations29 Oct 2019 Haoran Sun, Yunqi Cai, Lantian Li, Dong Wang

Speech signals are complex composites of various information, including phonetic content, speaker traits, channel effect, etc.

Cannot find the paper you are looking for? You can Submit a new open access paper.