1 code implementation • 29 Jun 2021 • Taejun Bak, Jae-Sung Bae, Hanbin Bae, Young-Ik Kim, Hoon-Young Cho
Methods for modeling and controlling prosody with acoustic features have been proposed for neural text-to-speech (TTS) models.
no code implementations • 29 Jun 2021 • Gyeong-Hoon Lee, Tae-Woo Kim, Hanbin Bae, Min-Ji Lee, Young-Ik Kim, Hoon-Young Cho
N-Singer consists of a Transformer-based mel-generator, a convolutional network-based postnet, and voicing-aware discriminators.
no code implementations • 4 Mar 2021 • Hanbin Bae, Jae-Sung Bae, Young-Sun Joo, Young-Ik Kim, Hoon-Young Cho
Second, the GST-TTS model with an auxiliary quality classifier is trained with the filtered speech and a small amount of clean speech.