Search Results for author: Jinzuomu Zhong

Found 2 papers, 1 papers with code

Prior-agnostic Multi-scale Contrastive Text-Audio Pre-training for Parallelized TTS Frontend Modeling

no code implementations • 14 Apr 2024 • Quanxiu Wang, Hui Huang, Mingjie Wang, Yong Dai, Jinzuomu Zhong, Benlai Tang

Furthermore, a parallelized TTS frontend model is delicately devised to execute TN, PD, and PBP prediction tasks, respectively in the second stage.

Polyphone disambiguation

Paper
Add Code

Multi-Modal Automatic Prosody Annotation with Contrastive Pretraining of SSWP

1 code implementation • 11 Sep 2023 • Jinzuomu Zhong, Yang Li, Hui Huang, Jie Liu, Zhiba Su, Jing Guo, Benlai Tang, Fengjie Zhu

While human prosody annotation contributes a lot to the performance, it is a labor-intensive and time-consuming process, often resulting in inconsistent outcomes.

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.