Search Results for author: Zhengyang Chen

Found 7 papers, 4 papers with code

Target Speech Diarization with Multimodal Prompts

no code implementations11 Jun 2024 Yidi Jiang, Ruijie Tao, Zhengyang Chen, Yanmin Qian, Haizhou Li

Extending to target speech diarization, we detect ``when target event occurs'' according to the semantic characteristics of speech.

speaker-diarization Speaker Diarization

Prompt-driven Target Speech Diarization

no code implementations23 Oct 2023 Yidi Jiang, Zhengyang Chen, Ruijie Tao, Liqun Deng, Yanmin Qian, Haizhou Li

We introduce a novel task named `target speech diarization', which seeks to determine `when target event occurred' within an audio signal.

Action Detection Activity Detection

Leveraging In-the-Wild Data for Effective Self-Supervised Pretraining in Speaker Recognition

1 code implementation21 Sep 2023 Shuai Wang, Qibing Bai, Qi Liu, Jianwei Yu, Zhengyang Chen, Bing Han, Yanmin Qian, Haizhou Li

Current speaker recognition systems primarily rely on supervised approaches, constrained by the scale of labeled datasets.

Speaker Recognition

Exploring Binary Classification Loss For Speaker Verification

1 code implementation17 Jul 2023 Bing Han, Zhengyang Chen, Yanmin Qian

The mismatch between close-set training and open-set testing usually leads to significant performance degradation for speaker verification task.

Binary Classification Classification +2

Wespeaker baselines for VoxSRC2023

no code implementations27 Jun 2023 Shuai Wang, Chengdong Liang, Xu Xiang, Bing Han, Zhengyang Chen, Hongji Wang, Wen Ding

This report showcases the results achieved using the wespeaker toolkit for the VoxSRC2023 Challenge.

Cannot find the paper you are looking for? You can Submit a new open access paper.