no code implementations • 29 Jun 2021 • Jinhyeok Yang, Jae-Sung Bae, Taejun Bak, Youngik Kim, Hoon-Young Cho
Recent advances in neural multi-speaker text-to-speech (TTS) models have enabled the generation of reasonably good speech quality with a single model and made it possible to synthesize the speech of a speaker with limited training data.
2 code implementations • 30 Jul 2020 • Jinhyeok Yang, Jun-Mo Lee, Youngik Kim, Hoon-Young Cho, Injung Kim
Additionally, compared with Parallel WaveGAN, another recently developed high-fidelity vocoder, VocGAN is 6. 98x faster on a CPU and exhibits higher MOS.