Search Results for author: Keon Lee

Found 5 papers, 3 papers with code

CLaM-TTS: Improving Neural Codec Language Model for Zero-Shot Text-to-Speech

no code implementations • 3 Apr 2024 • Jaehyeon Kim, Keon Lee, Seungjun Chung, Jaewoong Cho

With the emergence of neural audio codecs, which encode multiple streams of discrete tokens from audio, large language models have recently gained attention as a promising approach for zero-shot Text-to-Speech (TTS) synthesis.

Language Modelling Quantization

Paper
Add Code

Mini-Batch Optimization of Contrastive Loss

1 code implementation • 12 Jul 2023 • Jaewoong Cho, Kartik Sreenivasan, Keon Lee, Kyunghoo Mun, Soheun Yi, Jeong-Gwan Lee, Anna Lee, Jy-yong Sohn, Dimitris Papailiopoulos, Kangwook Lee

Contrastive learning has gained significant attention as a method for self-supervised learning.

Contrastive Learning Self-Supervised Learning

Paper
Code

RedPen: Region- and Reason-Annotated Dataset of Unnatural Speech

no code implementations • 26 Oct 2022 • Kyumin Park, Keon Lee, Daeyoung Kim, Dongyeop Kang

We present a novel speech dataset, RedPen, with human annotations on unnatural speech regions and their corresponding reasons.

Speech Synthesis

Paper
Add Code

DailyTalk: Spoken Dialogue Dataset for Conversational Text-to-Speech

1 code implementation • 3 Jul 2022 • Keon Lee, Kyumin Park, Daeyoung Kim

The majority of current Text-to-Speech (TTS) datasets, which are collections of individual utterances, contain few conversational aspects.

180

Paper
Code

STYLER: Style Factor Modeling with Rapidity and Robustness via Speech Decomposition for Expressive and Controllable Neural Text to Speech

1 code implementation • 17 Mar 2021 • Keon Lee, Kyumin Park, Daeyoung Kim

Previous works on neural text-to-speech (TTS) have been addressed on limited speed in training and inference time, robustness for difficult synthesis conditions, expressiveness, and controllability.

Speech Synthesis Style Transfer

149

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.