Search Results for author: Young-Ik Kim

FastPitchFormant: Source-filter based Decomposed Modeling for Speech Synthesis

Methods for modeling and controlling prosody with acoustic features have been proposed for neural text-to-speech (TTS) models.

Paper
Code

N-Singer consists of a Transformer-based mel-generator, a convolutional network-based postnet, and voicing-aware discriminators.

Paper
Add Code

Second, the GST-TTS model with an auxiliary quality classifier is trained with the filtered speech and a small amount of clean speech.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.