Waveform generation for text-to-speech synthesis using pitch-synchronous multi-scale generative adversarial networks

30 Oct 2018Lauri JuvelaBajibabu BollepalliJunichi YamagishiPaavo Alku

The state-of-the-art in text-to-speech synthesis has recently improved considerably due to novel neural waveform generation methods, such as WaveNet. However, these methods suffer from their slow sequential inference process, while their parallel versions are difficult to train and even more expensive computationally... (read more)

PDF Abstract

Results from the Paper


  Submit results from this paper to get state-of-the-art GitHub badges and help the community compare results to other papers.