Search Results for author: Xiaoyi Qin

Found 16 papers, 5 papers with code

Multi-objective Progressive Clustering for Semi-supervised Domain Adaptation in Speaker Verification

no code implementations7 Oct 2023 Ze Li, Yuke Lin, Ning Jiang, Xiaoyi Qin, Guoqing Zhao, Haiying Wu, Ming Li

Utilizing the pseudo-labeling algorithm with large-scale unlabeled data becomes crucial for semi-supervised domain adaptation in speaker verification tasks.

Clustering Denoising +3

Haha-Pod: An Attempt for Laughter-based Non-Verbal Speaker Verification

1 code implementation25 Sep 2023 Yuke Lin, Xiaoyi Qin, Ning Jiang, Guoqing Zhao, Ming Li

It is widely acknowledged that discriminative representation for speaker verification can be extracted from verbal speech.

Speaker Verification

VoxBlink: A Large Scale Speaker Verification Dataset on Camera

no code implementations14 Aug 2023 Yuke Lin, Xiaoyi Qin, Guoqing Zhao, Ming Cheng, Ning Jiang, Haiyang Wu, Ming Li

In this paper, we introduce a large-scale and high-quality audio-visual speaker verification dataset, named VoxBlink.

Speaker Recognition Speaker Verification

Target-Speaker Voice Activity Detection via Sequence-to-Sequence Prediction

no code implementations28 Oct 2022 Ming Cheng, Weiqing Wang, Yucong Zhang, Xiaoyi Qin, Ming Li

Target-speaker voice activity detection is currently a promising approach for speaker diarization in complex acoustic environments.

Action Detection Activity Detection +2

Laugh Betrays You? Learning Robust Speaker Representation From Speech Containing Non-Verbal Fragments

no code implementations28 Oct 2022 Yuke Lin, Xiaoyi Qin, Huahua Cui, Zhenyi Zhu, Ming Li

We collect a set of clips with laughter components by conducting a laughter detection script on VoxCeleb and part of the CN-Celeb dataset.

Speaker Verification

The DKU-DukeECE Diarization System for the VoxCeleb Speaker Recognition Challenge 2022

no code implementations4 Oct 2022 Weiqing Wang, Xiaoyi Qin, Ming Cheng, Yucong Zhang, Kangyue Wang, Ming Li

This paper discribes the DKU-DukeECE submission to the 4th track of the VoxCeleb Speaker Recognition Challenge 2022 (VoxSRC-22).

Action Detection Activity Detection +2

The DKU-OPPO System for the 2022 Spoofing-Aware Speaker Verification Challenge

no code implementations15 Jul 2022 Xingming Wang, Xiaoyi Qin, Yikang Wang, Yunfei Xu, Ming Li

For CM systems, we propose two methods on top of the challenge baseline to further improve the performance, namely Embedding Random Sampling Augmentation (ERSA) and One-Class Confusion Loss(OCCL).

Speaker Verification

Cross-Age Speaker Verification: Learning Age-Invariant Speaker Embeddings

1 code implementation13 Jul 2022 Xiaoyi Qin, Na Li, Chao Weng, Dan Su, Ming Li

In this paper, we mine cross-age test sets based on the VoxCeleb dataset and propose our age-invariant speaker representation(AISR) learning method.

Age Estimation Speaker Verification

Building Bilingual and Code-Switched Voice Conversion with Limited Training Data Using Embedding Consistency Loss

1 code implementation22 Apr 2021 Yaogen Yang, Haozhe Zhang, Xiaoyi Qin, Shanshan Liang, Huahua Cui, Mingyang Xu, Ming Li

We achieve cross-lingual VC between Mandarin speech with multiple speakers and English speech with multiple speakers by applying bilingual bottleneck features.

Voice Cloning Voice Conversion

Binary Neural Network for Speaker Verification

no code implementations6 Apr 2021 Tinglong Zhu, Xiaoyi Qin, Ming Li

Although deep neural networks are successful for many tasks in the speech domain, the high computational and memory costs of deep neural networks make it difficult to directly deploy highperformance Neural Network systems on low-resource embedded devices.

Binarization Quantization +1

Cannot find the paper you are looking for? You can Submit a new open access paper.