no code implementations • 22 Apr 2024 • Zuheng Kang, Yayun He, Botao Zhao, Xiaoyang Qu, Junqing Peng, Jing Xiao, Jianzong Wang
With recent advances in speech synthesis including text-to-speech (TTS) and voice conversion (VC) systems enabling the generation of ultra-realistic audio deepfakes, there is growing concern about their potential misuse.
no code implementations • 7 Oct 2023 • Yayun He, Zuheng Kang, Jianzong Wang, Junqing Peng, Jing Xiao
Speaker verification (SV) performance deteriorates as utterances become shorter.
no code implementations • 31 May 2023 • Zuheng Kang, Jianzong Wang, Junqing Peng, Jing Xiao
To address this, we propose a speaker verification-based voice activity detection (SVVAD) framework that can adapt the speech features according to which are most informative for SV.
no code implementations • 14 Mar 2023 • Zuheng Kang, Yayun He, Jianzong Wang, Junqing Peng, Xiaoyang Qu, Jing Xiao
Data-Free Knowledge Distillation (DFKD) has recently attracted growing attention in the academic community, especially with major breakthroughs in computer vision.
no code implementations • 18 Oct 2022 • Zuheng Kang, Jianzong Wang, Junqing Peng, Jing Xiao
Estimating age from a single speech is a classic and challenging topic.
no code implementations • 27 Jun 2022 • Zuheng Kang, Junqing Peng, Jianzong Wang, Jing Xiao
Speech emotion recognition (SER) has many challenges, but one of the main challenges is that each framework does not have a unified standard.
no code implementations • 23 Feb 2022 • Shijing Si, Jianzong Wang, Junqing Peng, Jing Xiao
To address this, we utilize the ambiguous information among the age labels, convert each age label into a discrete label distribution and leverage the label distribution learning (LDL) method to fit the data.
no code implementations • 4 Mar 2020 • Chen Feng, Jianzong Wang, Tongxu Li, Junqing Peng, Jing Xiao
Recently, the speaker clustering model based on aggregation hierarchy cluster (AHC) is a common method to solve two main problems: no preset category number clustering and fix category number clustering.