Search Results for author: Seongkyu Mun

Found 6 papers, 1 papers with code

Latent Filling: Latent Space Data Augmentation for Zero-shot Speech Synthesis

no code implementations • 5 Oct 2023 • Jae-Sung Bae, Joun Yeop Lee, Ji-Hyun Lee, Seongkyu Mun, Taehwa Kang, Hoon-Young Cho, Chanwoo Kim

Previous works in zero-shot text-to-speech (ZS-TTS) have attempted to enhance its systems by enlarging the training data through crowd-sourcing or augmenting existing speech data.

Data Augmentation Speech Synthesis

Paper
Add Code

An Empirical Study on L2 Accents of Cross-lingual Text-to-Speech Systems via Vowel Space

no code implementations • 6 Nov 2022 • JIhwan Lee, Jae-Sung Bae, Seongkyu Mun, Heejin Choi, Joun Yeop Lee, Hoon-Young Cho, Chanwoo Kim

With the recent developments in cross-lingual Text-to-Speech (TTS) systems, L2 (second-language, or foreign) accent problems arise.

Paper
Add Code

Into-TTS : Intonation Template Based Prosody Control System

no code implementations • 4 Apr 2022 • JIhwan Lee, Joun Yeop Lee, Heejin Choi, Seongkyu Mun, Sangjun Park, Jae-Sung Bae, Chanwoo Kim

Two proposed modules are added to the end-to-end TTS framework: an intonation predictor and an intonation encoder.

Language Modelling

Paper
Add Code

Streaming end-to-end speech recognition with jointly trained neural feature enhancement

no code implementations • 4 May 2021 • Chanwoo Kim, Abhinav Garg, Dhananjaya Gowda, Seongkyu Mun, Changwoo Han

In this paper, we present a streaming end-to-end speech recognition model based on Monotonic Chunkwise Attention (MoCha) jointly trained with enhancement layers.

speech-recognition Speech Recognition

Paper
Add Code

Delving into VoxCeleb: environment invariant speaker recognition

1 code implementation • 24 Oct 2019 • Joon Son Chung, Jaesung Huh, Seongkyu Mun

Research in speaker recognition has recently seen significant progress due to the application of neural network models and the availability of new large-scale datasets.

Speaker Identification Speaker Recognition

Paper
Code

KU-ISPL Language Recognition System for NIST 2015 i-Vector Machine Learning Challenge

no code implementations • 21 Sep 2016 • Suwon Shon, Seongkyu Mun, John H. L. Hansen, Hanseok Ko

The experimental results show that the use of duration and score fusion improves language recognition performance by 5% relative in LRiMLC15 cost.

BIG-bench Machine Learning

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.