Search Results for author: Junqing Peng

Found 8 papers, 0 papers with code

Retrieval-Augmented Audio Deepfake Detection

no code implementations • 22 Apr 2024 • Zuheng Kang, Yayun He, Botao Zhao, Xiaoyang Qu, Junqing Peng, Jing Xiao, Jianzong Wang

With recent advances in speech synthesis including text-to-speech (TTS) and voice conversion (VC) systems enabling the generation of ultra-realistic audio deepfakes, there is growing concern about their potential misuse.

Paper
Add Code

VoiceExtender: Short-utterance Text-independent Speaker Verification with Guided Diffusion Model

no code implementations • 7 Oct 2023 • Yayun He, Zuheng Kang, Jianzong Wang, Junqing Peng, Jing Xiao

Speaker verification (SV) performance deteriorates as utterances become shorter.

Text-Independent Speaker Verification

Paper
Add Code

SVVAD: Personal Voice Activity Detection for Speaker Verification

no code implementations • 31 May 2023 • Zuheng Kang, Jianzong Wang, Junqing Peng, Jing Xiao

To address this, we propose a speaker verification-based voice activity detection (SVVAD) framework that can adapt the speech features according to which are most informative for SV.

Action Detection Activity Detection +1

Paper
Add Code

Feature-Rich Audio Model Inversion for Data-Free Knowledge Distillation Towards General Sound Classification

no code implementations • 14 Mar 2023 • Zuheng Kang, Yayun He, Jianzong Wang, Junqing Peng, Xiaoyang Qu, Jing Xiao

Data-Free Knowledge Distillation (DFKD) has recently attracted growing attention in the academic community, especially with major breakthroughs in computer vision.

Data-free Knowledge Distillation Sound Classification

Paper
Add Code

SVLDL: Improved Speaker Age Estimation Using Selective Variance Label Distribution Learning

no code implementations • 18 Oct 2022 • Zuheng Kang, Jianzong Wang, Junqing Peng, Jing Xiao

Estimating age from a single speech is a classic and challenging topic.

Age Estimation

Paper
Add Code

SpeechEQ: Speech Emotion Recognition based on Multi-scale Unified Datasets and Multitask Learning

no code implementations • 27 Jun 2022 • Zuheng Kang, Junqing Peng, Jianzong Wang, Jing Xiao

Speech emotion recognition (SER) has many challenges, but one of the main challenges is that each framework does not have a unified standard.

Speech Emotion Recognition

Paper
Add Code

Towards Speaker Age Estimation with Label Distribution Learning

no code implementations • 23 Feb 2022 • Shijing Si, Jianzong Wang, Junqing Peng, Jing Xiao

To address this, we utilize the ambiguous information among the age labels, convert each age label into a discrete label distribution and leverage the label distribution learning (LDL) method to fit the data.

Age Classification Age Estimation +2

Paper
Add Code

A Robust Speaker Clustering Method Based on Discrete Tied Variational Autoencoder

no code implementations • 4 Mar 2020 • Chen Feng, Jianzong Wang, Tongxu Li, Junqing Peng, Jing Xiao

Recently, the speaker clustering model based on aggregation hierarchy cluster (AHC) is a common method to solve two main problems: no preset category number clustering and fix category number clustering.

Clustering

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.