Search Results for author: Qiongqiong Wang

Found 10 papers, 1 papers with code

Cosine Scoring with Uncertainty for Neural Speaker Embedding

no code implementations • 11 Mar 2024 • Qiongqiong Wang, Kong Aik Lee

Uncertainty modeling in speaker representation aims to learn the variability present in speech utterances.

Paper
Add Code

Golden Gemini is All You Need: Finding the Sweet Spots for Speaker Verification

1 code implementation • 6 Dec 2023 • Tianchi Liu, Kong Aik Lee, Qiongqiong Wang, Haizhou Li

We represent the stride space on a trellis diagram, and conduct a systematic study on the impact of temporal and frequency resolutions on the performance and further identify two optimal points, namely Golden Gemini, which serves as a guiding principle for designing 2D ResNet-based speaker verification models.

Speaker Verification

535

Paper
Code

Generalized domain adaptation framework for parametric back-end in speaker recognition

no code implementations • 24 May 2023 • Qiongqiong Wang, Koji Okabe, Kong Aik Lee, Takafumi Koshinaka

The efficacy of the proposed techniques has been experimentally validated on NIST 2016, 2018, and 2019 Speaker Recognition Evaluation (SRE'16, SRE'18, and SRE'19) datasets.

Speaker Recognition Unsupervised Domain Adaptation

Paper
Add Code

Incorporating Uncertainty from Speaker Embedding Estimation to Speaker Verification

no code implementations • 23 Feb 2023 • Qiongqiong Wang, Kong Aik Lee, Tianchi Liu

We propose a log-likelihood ratio function for the PLDA scoring with the uncertainty propagation.

Speaker Verification

Paper
Add Code

I4U System Description for NIST SRE'20 CTS Challenge

no code implementations • 2 Nov 2022 • Kong Aik Lee, Tomi Kinnunen, Daniele Colibro, Claudio Vair, Andreas Nautsch, Hanwu Sun, Liang He, Tianyu Liang, Qiongqiong Wang, Mickael Rouvier, Pierre-Michel Bousquet, Rohan Kumar Das, Ignacio Viñals Bailo, Meng Liu, Héctor Deldago, Xuechen Liu, Md Sahidullah, Sandro Cumani, Boning Zhang, Koji Okabe, Hitoshi Yamamoto, Ruijie Tao, Haizhou Li, Alfonso Ortega Giménez, Longbiao Wang, Luis Buera

This manuscript describes the I4U submission to the 2020 NIST Speaker Recognition Evaluation (SRE'20) Conversational Telephone Speech (CTS) Challenge.

Speaker Recognition

Paper
Add Code

Scoring of Large-Margin Embeddings for Speaker Verification: Cosine or PLDA?

no code implementations • 8 Apr 2022 • Qiongqiong Wang, Kong Aik Lee, Tianchi Liu

The emergence of large-margin softmax cross-entropy losses in training deep speaker embedding neural networks has triggered a gradual shift from parametric back-ends to a simpler cosine similarity measure for speaker verification.

Speaker Verification

Paper
Add Code

Task-aware Warping Factors in Mask-based Speech Enhancement

no code implementations • 27 Aug 2021 • Qiongqiong Wang, Kong Aik Lee, Takafumi Koshinaka, Koji Okabe, Hitoshi Yamamoto

It is easy to apply the proposed dual-warping factors approach to any mask-based SE method, and it allows a single SE system to handle multiple tasks without task-dependent training.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Paper
Add Code

Xi-Vector Embedding for Speaker Recognition

no code implementations • 12 Aug 2021 • Kong Aik Lee, Qiongqiong Wang, Takafumi Koshinaka

We present a Bayesian formulation for deep speaker embedding, wherein the xi-vector is the Bayesian counterpart of the x-vector, taking into account the uncertainty estimate.

Speaker Recognition

Paper
Add Code

I4U Submission to NIST SRE 2018: Leveraging from a Decade of Shared Experiences

no code implementations • 16 Apr 2019 • Kong Aik Lee, Ville Hautamaki, Tomi Kinnunen, Hitoshi Yamamoto, Koji Okabe, Ville Vestman, Jing Huang, Guohong Ding, Hanwu Sun, Anthony Larcher, Rohan Kumar Das, Haizhou Li, Mickael Rouvier, Pierre-Michel Bousquet, Wei Rao, Qing Wang, Chunlei Zhang, Fahimeh Bahmaninezhad, Hector Delgado, Jose Patino, Qiongqiong Wang, Ling Guo, Takafumi Koshinaka, Jiacen Zhang, Koichi Shinoda, Trung Ngo Trong, Md Sahidullah, Fan Lu, Yun Tang, Ming Tu, Kah Kuan Teh, Huy Dat Tran, Kuruvachan K. George, Ivan Kukanov, Florent Desnous, Jichen Yang, Emre Yilmaz, Longting Xu, Jean-Francois Bonastre, Cheng-Lin Xu, Zhi Hao Lim, Eng Siong Chng, Shivesh Ranjan, John H. L. Hansen, Massimiliano Todisco, Nicholas Evans

The I4U consortium was established to facilitate a joint entry to NIST speaker recognition evaluations (SRE).

Domain Adaptation Speaker Recognition

Paper
Add Code

The CORAL+ Algorithm for Unsupervised Domain Adaptation of PLDA

no code implementations • 26 Dec 2018 • Kong Aik Lee, Qiongqiong Wang, Takafumi Koshinaka

We refer to the model-based adaptation technique proposed in this paper as CORAL+.

Speaker Recognition Unsupervised Domain Adaptation

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.