no code implementations • 6 Nov 2022 • JIhwan Lee, Jae-Sung Bae, Seongkyu Mun, Heejin Choi, Joun Yeop Lee, Hoon-Young Cho, Chanwoo Kim
With the recent developments in cross-lingual Text-to-Speech (TTS) systems, L2 (second-language, or foreign) accent problems arise.
no code implementations • 4 Apr 2022 • JIhwan Lee, Joun Yeop Lee, Heejin Choi, Seongkyu Mun, Sangjun Park, Jae-Sung Bae, Chanwoo Kim
Two proposed modules are added to the end-to-end TTS framework: an intonation predictor and an intonation encoder.
no code implementations • 4 May 2021 • Chanwoo Kim, Abhinav Garg, Dhananjaya Gowda, Seongkyu Mun, Changwoo Han
In this paper, we present a streaming end-to-end speech recognition model based on Monotonic Chunkwise Attention (MoCha) jointly trained with enhancement layers.
1 code implementation • 24 Oct 2019 • Joon Son Chung, Jaesung Huh, Seongkyu Mun
Research in speaker recognition has recently seen significant progress due to the application of neural network models and the availability of new large-scale datasets.
no code implementations • 21 Sep 2016 • Suwon Shon, Seongkyu Mun, John H. L. Hansen, Hanseok Ko
The experimental results show that the use of duration and score fusion improves language recognition performance by 5% relative in LRiMLC15 cost.