Efficiently Trainable Text-to-Speech System Based on Deep Convolutional Networks with Guided Attention

24 Oct 2017Hideyuki TachibanaKatsuya UenoyamaShunsuke Aihara

This paper describes a novel text-to-speech (TTS) technique based on deep convolutional neural networks (CNN), without any recurrent units. Recurrent neural network (RNN) has been a standard technique to model sequential data recently, and this technique has been used in some cutting-edge neural TTS techniques... (read more)

PDF Abstract

Evaluation results from the paper


  Submit results from this paper to get state-of-the-art GitHub badges and help community compare results to other papers.