Search Results for author: Luyao Cheng

Found 7 papers, 5 papers with code

3D-Speaker-Toolkit: An Open Source Toolkit for Multi-modal Speaker Verification and Diarization

1 code implementation • 29 Mar 2024 • Yafeng Chen, Siqi Zheng, Hui Wang, Luyao Cheng, Tinglong Zhu, Changhe Song, Rongjie Huang, Ziyang Ma, Qian Chen, Shiliang Zhang, Xihao Li

This paper introduces 3D-Speaker-Toolkit, an open source toolkit for multi-modal speaker verification and diarization.

Self-Supervised Learning speaker-diarization +3

684

Paper
Code

Improving Speaker Diarization using Semantic Information: Joint Pairwise Constraints Propagation

no code implementations • 19 Sep 2023 • Luyao Cheng, Siqi Zheng, Qinglin Zhang, Hui Wang, Yafeng Chen, Qian Chen, Shiliang Zhang

Speaker diarization has gained considerable attention within speech processing research community.

speaker-diarization Speaker Diarization +1

Paper
Add Code

Self-Distillation Network with Ensemble Prototypes: Learning Robust Speaker Representations without Supervision

1 code implementation • 5 Aug 2023 • Yafeng Chen, Siqi Zheng, Hui Wang, Luyao Cheng, Qian Chen, Shiliang Zhang

It assigns representation of augmented views of utterances to the same prototypes as the representation of the original view, thereby enabling effective knowledge transfer between the views.

Representation Learning Speaker Verification +1

684

Paper
Code

3D-Speaker: A Large-Scale Multi-Device, Multi-Distance, and Multi-Dialect Corpus for Speech Representation Disentanglement

1 code implementation • 27 Jun 2023 • Siqi Zheng, Luyao Cheng, Yafeng Chen, Hui Wang, Qian Chen

Disentangling uncorrelated information in speech utterances is a crucial research topic within speech community.

Disentanglement Self-Supervised Learning

684

Paper
Code

Exploring Speaker-Related Information in Spoken Language Understanding for Better Speaker Diarization

no code implementations • 22 May 2023 • Luyao Cheng, Siqi Zheng, Zhang Qinglin, Hui Wang, Yafeng Chen, Qian Chen

In this paper, we propose methods to extract speaker-related information from semantic content in multi-party meetings, which, as we will show, can further benefit speaker diarization.

speaker-diarization Speaker Diarization +1

Paper
Add Code

An Enhanced Res2Net with Local and Global Feature Fusion for Speaker Verification

2 code implementations • 22 May 2023 • Yafeng Chen, Siqi Zheng, Hui Wang, Luyao Cheng, Qian Chen, Jiajun Qi

This paper proposes a novel architecture called Enhanced Res2Net (ERes2Net), which incorporates both local and global feature fusion techniques to improve the performance.

Speaker Verification

684

Paper
Code

Pushing the limits of self-supervised speaker verification using regularized distillation framework

1 code implementation • 8 Nov 2022 • Yafeng Chen, Siqi Zheng, Hui Wang, Luyao Cheng, Qian Chen

A range of experiments conducted on the VoxCeleb datasets demonstrate the superiority of the regularized DINO framework in speaker verification.

Data Augmentation Self-Supervised Learning +1

684

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.