Search Results for author: Yahuan Cong

Found 2 papers, 0 papers with code

GenerTTS: Pronunciation Disentanglement for Timbre and Style Generalization in Cross-Lingual Text-to-Speech

no code implementations • 27 Jun 2023 • Yahuan Cong, Haoyu Zhang, Haopeng Lin, Shichao Liu, Chunfeng Wang, Yi Ren, Xiang Yin, Zejun Ma

Cross-lingual timbre and style generalizable text-to-speech (TTS) aims to synthesize speech with a specific reference timbre or style that is never trained in the target language.

Disentanglement Style Generalization

Paper
Add Code

Self-supervised learning for audio-visual speaker diarization

no code implementations • 13 Feb 2020 • Yifan Ding, Yong Xu, Shi-Xiong Zhang, Yahuan Cong, Liqiang Wang

Speaker diarization, which is to find the speech segments of specific speakers, has been widely used in human-centered applications such as video conferences or human-computer interaction systems.

Self-Supervised Learning speaker-diarization +2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.