Search Results for author: Keon Lee

Found 5 papers, 3 papers with code

CLaM-TTS: Improving Neural Codec Language Model for Zero-Shot Text-to-Speech

no code implementations3 Apr 2024 Jaehyeon Kim, Keon Lee, Seungjun Chung, Jaewoong Cho

With the emergence of neural audio codecs, which encode multiple streams of discrete tokens from audio, large language models have recently gained attention as a promising approach for zero-shot Text-to-Speech (TTS) synthesis.

Language Modelling Quantization

RedPen: Region- and Reason-Annotated Dataset of Unnatural Speech

no code implementations26 Oct 2022 Kyumin Park, Keon Lee, Daeyoung Kim, Dongyeop Kang

We present a novel speech dataset, RedPen, with human annotations on unnatural speech regions and their corresponding reasons.

Speech Synthesis

DailyTalk: Spoken Dialogue Dataset for Conversational Text-to-Speech

1 code implementation3 Jul 2022 Keon Lee, Kyumin Park, Daeyoung Kim

The majority of current Text-to-Speech (TTS) datasets, which are collections of individual utterances, contain few conversational aspects.

STYLER: Style Factor Modeling with Rapidity and Robustness via Speech Decomposition for Expressive and Controllable Neural Text to Speech

1 code implementation17 Mar 2021 Keon Lee, Kyumin Park, Daeyoung Kim

Previous works on neural text-to-speech (TTS) have been addressed on limited speed in training and inference time, robustness for difficult synthesis conditions, expressiveness, and controllability.

Speech Synthesis Style Transfer

Cannot find the paper you are looking for? You can Submit a new open access paper.