no code implementations • 26 Sep 2022 • Yusuke Nakai, Yuki Saito, Kenta Udagawa, Hiroshi Saruwatari
A conventional generative adversarial network (GAN)-based training algorithm significantly improves the quality of synthetic speech by reducing the statistical difference between natural and synthetic speech.
no code implementations • 21 Jun 2022 • Kenta Udagawa, Yuki Saito, Hiroshi Saruwatari
With a conventional speaker-adaptation method, a target speaker's embedding vector is extracted from his/her reference speech using a speaker encoder trained on a speaker-discriminative task.