no code implementations • 6 Sep 2023 • Hyungseob Lim, Kyungguen Byun, Sunkuk Moon, Erik Visser
Finally, content information extracted from the source speech and content-dependent target style embeddings are fed into a diffusion-based decoder to generate the converted speech mel-spectrogram.
no code implementations • 9 Nov 2018 • Eunwoo Song, Kyungguen Byun, Hong-Goo Kang
Conventional WaveNet-based neural vocoding systems significantly improve the perceptual quality of synthesized speech by statistically generating a time sequence of speech waveforms through an auto-regressive framework.
no code implementations • 8 Nov 2018 • Eunwoo Song, Jin-Seob Kim, Kyungguen Byun, Hong-Goo Kang
To generate more natural speech signals with the constraint of limited training data, we propose a speaker adaptation task with an effective variation of neural vocoding models.