no code implementations • 6 Nov 2022 • JIhwan Lee, Jae-Sung Bae, Seongkyu Mun, Heejin Choi, Joun Yeop Lee, Hoon-Young Cho, Chanwoo Kim
With the recent developments in cross-lingual Text-to-Speech (TTS) systems, L2 (second-language, or foreign) accent problems arise.
no code implementations • 4 Apr 2022 • JIhwan Lee, Joun Yeop Lee, Heejin Choi, Seongkyu Mun, Sangjun Park, Jae-Sung Bae, Chanwoo Kim
Two proposed modules are added to the end-to-end TTS framework: an intonation predictor and an intonation encoder.
no code implementations • NAACL 2019 • Jihwan Lee, Ruhi Sarikaya, Young-Bum Kim
In this paper, we introduce an approach for leveraging available data across multiple locales sharing the same language to 1) improve domain classification model accuracy in Spoken Language Understanding and user experience even if new locales do not have sufficient data and 2) reduce the cost of scaling the domain classifier to a large number of locales.
no code implementations • NAACL 2019 • Han Li, JIhwan Lee, Sidharth Mudgal, Ruhi Sarikaya, Young-Bum Kim
This is a major component in mainstream IPDAs in industry.
no code implementations • 13 Dec 2018 • JIhwan Lee, Dongchan Kim, Ruhi Sarikaya, Young-Bum Kim
Our proposed model learns the vector representation of intents based on the slots tied to these intents by aggregating the representations of the slots.