Fine-grained robust prosody transfer for single-speaker neural text-to-speech

4 Jul 2019Viacheslav KlimkovSrikanth RonankiJonas RohnkeThomas Drugman

We present a neural text-to-speech system for fine-grained prosody transfer from one speaker to another. Conventional approaches for end-to-end prosody transfer typically use either fixed-dimensional or variable-length prosody embedding via a secondary attention to encode the reference signal... (read more)

PDF Abstract


No code implementations yet. Submit your code now


Results from the Paper

  Submit results from this paper to get state-of-the-art GitHub badges and help the community compare results to other papers.