Search Results for author: Seongkyu Mun

Found 6 papers, 1 papers with code

Latent Filling: Latent Space Data Augmentation for Zero-shot Speech Synthesis

no code implementations5 Oct 2023 Jae-Sung Bae, Joun Yeop Lee, Ji-Hyun Lee, Seongkyu Mun, Taehwa Kang, Hoon-Young Cho, Chanwoo Kim

Previous works in zero-shot text-to-speech (ZS-TTS) have attempted to enhance its systems by enlarging the training data through crowd-sourcing or augmenting existing speech data.

Data Augmentation Speech Synthesis

An Empirical Study on L2 Accents of Cross-lingual Text-to-Speech Systems via Vowel Space

no code implementations6 Nov 2022 JIhwan Lee, Jae-Sung Bae, Seongkyu Mun, Heejin Choi, Joun Yeop Lee, Hoon-Young Cho, Chanwoo Kim

With the recent developments in cross-lingual Text-to-Speech (TTS) systems, L2 (second-language, or foreign) accent problems arise.

Into-TTS : Intonation Template Based Prosody Control System

no code implementations4 Apr 2022 JIhwan Lee, Joun Yeop Lee, Heejin Choi, Seongkyu Mun, Sangjun Park, Jae-Sung Bae, Chanwoo Kim

Two proposed modules are added to the end-to-end TTS framework: an intonation predictor and an intonation encoder.

Language Modelling

Streaming end-to-end speech recognition with jointly trained neural feature enhancement

no code implementations4 May 2021 Chanwoo Kim, Abhinav Garg, Dhananjaya Gowda, Seongkyu Mun, Changwoo Han

In this paper, we present a streaming end-to-end speech recognition model based on Monotonic Chunkwise Attention (MoCha) jointly trained with enhancement layers.

speech-recognition Speech Recognition

Delving into VoxCeleb: environment invariant speaker recognition

1 code implementation24 Oct 2019 Joon Son Chung, Jaesung Huh, Seongkyu Mun

Research in speaker recognition has recently seen significant progress due to the application of neural network models and the availability of new large-scale datasets.

Speaker Identification Speaker Recognition

KU-ISPL Language Recognition System for NIST 2015 i-Vector Machine Learning Challenge

no code implementations21 Sep 2016 Suwon Shon, Seongkyu Mun, John H. L. Hansen, Hanseok Ko

The experimental results show that the use of duration and score fusion improves language recognition performance by 5% relative in LRiMLC15 cost.

BIG-bench Machine Learning

Cannot find the paper you are looking for? You can Submit a new open access paper.