no code implementations • 6 Nov 2022 • JIhwan Lee, Jae-Sung Bae, Seongkyu Mun, Heejin Choi, Joun Yeop Lee, Hoon-Young Cho, Chanwoo Kim
With the recent developments in cross-lingual Text-to-Speech (TTS) systems, L2 (second-language, or foreign) accent problems arise.
no code implementations • 29 Jun 2021 • Jae-Sung Bae, Tae-Jun Bak, Young-Sun Joo, Hoon-Young Cho
Therefore, to improve the modeling performance of the TNA-TTS model we propose a hierarchical Transformer structure-based text encoder and audio decoder that are designed to accommodate the characteristics of each module.
no code implementations • 29 Jun 2021 • Gyeong-Hoon Lee, Tae-Woo Kim, Hanbin Bae, Min-Ji Lee, Young-Ik Kim, Hoon-Young Cho
N-Singer consists of a Transformer-based mel-generator, a convolutional network-based postnet, and voicing-aware discriminators.
1 code implementation • 29 Jun 2021 • Taejun Bak, Jae-Sung Bae, Hanbin Bae, Young-Ik Kim, Hoon-Young Cho
Methods for modeling and controlling prosody with acoustic features have been proposed for neural text-to-speech (TTS) models.
no code implementations • 29 Jun 2021 • Jinhyeok Yang, Jae-Sung Bae, Taejun Bak, Youngik Kim, Hoon-Young Cho
Recent advances in neural multi-speaker text-to-speech (TTS) models have enabled the generation of reasonably good speech quality with a single model and made it possible to synthesize the speech of a speaker with limited training data.
no code implementations • 4 Mar 2021 • Hanbin Bae, Jae-Sung Bae, Young-Sun Joo, Young-Ik Kim, Hoon-Young Cho
Second, the GST-TTS model with an auxiliary quality classifier is trained with the filtered speech and a small amount of clean speech.
2 code implementations • 30 Jul 2020 • Jinhyeok Yang, Jun-Mo Lee, Youngik Kim, Hoon-Young Cho, Injung Kim
Additionally, compared with Parallel WaveGAN, another recently developed high-fidelity vocoder, VocGAN is 6. 98x faster on a CPU and exhibits higher MOS.
no code implementations • 20 Mar 2020 • Yoonjae Jeong, Hoon-Young Cho
The purpose of this study is to detect the mismatch between text script and voice-over.