no code implementations • 7 Dec 2021 • Sungjae Cho, Soo-Young Lee
We present a methodology to train our multi-speaker emotional text-to-speech synthesizer that can express speech for 10 speakers' 7 different emotions.
no code implementations • 26 Nov 2020 • Jihyeon Roh, Sang-Hoon Oh, Soo-Young Lee
Although Perplexity is a widely used performance metric for language models, the values are highly dependent upon the number of words in the corpus and is useful to compare performance of the same corpus only.
no code implementations • 18 Sep 2020 • Jihyeon Roh, Huiseong Gim, Soo-Young Lee
First, we propose a hierarchical GPT which consists of three blocks, i. e., a sentence encoding block, a sentence generating block, and a sentence decoding block.
1 code implementation • 14 Mar 2020 • Bo-Kyeong Kim, Sungjin Park, Geonmin Kim, Soo-Young Lee
We aim to separate the generative factors of data into two latent vectors in a variational autoencoder.
1 code implementation • 11 Nov 2019 • Tae-Ho Kim, Sungjae Cho, Shinkook Choi, Sejik Park, Soo-Young Lee
The embedding space of seq2seq-based TTS has abundant information on the text.
1 code implementation • 6 Nov 2018 • Geonmin Kim, Hwaran Lee, Bo-Kyeong Kim, Sang-Hoon Oh, Soo-Young Lee
Many speech enhancement methods try to learn the relationship between noisy and clean speech, obtained using an acoustic room simulator.
1 code implementation • 12 Oct 2018 • Azam Rabiee, Soo-Young Lee
This paper introduces a deep neural network model for subband-based speech synthesizer.
1 code implementation • 4 Sep 2018 • Myungsu Chae, Tae-Ho Kim, Young Hoon Shin, June-Woo Kim, Soo-Young Lee
In our experiments, emotion and gender recognition with the proposed method yielded a lower joint loss, which is computed as the negative log-likelihood, than using static weights for joint loss.
no code implementations • journal 2018 • Young-Gun Lee, Taesu Kim, Soo-Young Lee
We propose a neural text-to-speech (TTS) model that can imitate a new speaker's voice using only a small amount of speech sample.
1 code implementation • 15 Nov 2017 • Young-Gun Lee, Azam Rabiee, Soo-Young Lee
In this paper, we introduce an emotional speech synthesizer based on the recent end-to-end neural model, named Tacotron.
no code implementations • 10 Jun 2016 • Hwaran Lee, Geonmin Kim, Ho-Gyeong Kim, Sang-Hoon Oh, Soo-Young Lee
Convolutional neural networks (CNNs) with convolutional and pooling operations along the frequency axis have been proposed to attain invariance to frequency shifts of features.
no code implementations • 2 May 2016 • Geonmin Kim, Hwaran Lee, Jisu Choi, Soo-Young Lee
In the HCRN, word representations are built from characters, thus resolving the data-sparsity problem, and inter-sentence dependency is embedded into sentence representation at the level of sentence composition.