Search Results for author: Heyang Xue

Found 2 papers, 0 papers with code

Multi-GradSpeech: Towards Diffusion-based Multi-Speaker Text-to-speech Using Consistent Diffusion Models

no code implementations • 21 Aug 2023 • Heyang Xue, Shuai Guo, Pengcheng Zhu, Mengxiao Bi

Despite imperfect score-matching causing drift in training and sampling distributions of diffusion models, recent advances in diffusion-based acoustic models have revolutionized data-sufficient single-speaker Text-to-Speech (TTS) approaches, with Grad-TTS being a prime example.

Paper
Add Code

VISinger: Variational Inference with Adversarial Learning for End-to-End Singing Voice Synthesis

no code implementations • 17 Oct 2021 • Yongmao Zhang, Jian Cong, Heyang Xue, Lei Xie, Pengcheng Zhu, Mengxiao Bi

In this paper, we propose VISinger, a complete end-to-end high-quality singing voice synthesis (SVS) system that directly generates audio waveform from lyrics and musical score.

Singing Voice Synthesis Variational Inference

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.