Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis

NeurIPS 2018 Ye JiaYu ZhangRon J. WeissQuan WangJonathan ShenFei RenZhifeng ChenPatrick NguyenRuoming PangIgnacio Lopez MorenoYonghui Wu

Clone a voice in 5 seconds to generate arbitrary speech in real-time..

PDF Abstract

Evaluation Results from the Paper


 SOTA for Text-To-Speech Synthesis on LJSpeech (using extra training data)

     Get a GitHub badge
TASK DATASET MODEL METRIC NAME METRIC VALUE GLOBAL RANK USES EXTRA
TRAINING DATA
COMPARE
Text-To-Speech Synthesis LJSpeech tacotron Accuracy 12 # 1