Search Results for author: Jaehyeon Kim

Found 4 papers, 3 papers with code

Glow-TTS: A Generative Flow for Text-to-Speech via Monotonic Alignment Search

5 code implementations NeurIPS 2020 Jaehyeon Kim, Sungwon Kim, Jungil Kong, Sungroh Yoon

By leveraging the properties of flows, MAS searches for the most probable monotonic alignment between text and the latent representation of speech.

Ranked #4 on Text-To-Speech Synthesis on LJSpeech (using extra training data)

Text-To-Speech Synthesis

Progressive Deblurring of Diffusion Models for Coarse-to-Fine Image Synthesis

1 code implementation16 Jul 2022 Sangyun Lee, Hyungjin Chung, Jaehyeon Kim, Jong Chul Ye

We further propose a blur diffusion as a special case, where each frequency component of an image is diffused at different speeds.

Deblurring Image Generation +1

CLaM-TTS: Improving Neural Codec Language Model for Zero-Shot Text-to-Speech

no code implementations3 Apr 2024 Jaehyeon Kim, Keon Lee, Seungjun Chung, Jaewoong Cho

With the emergence of neural audio codecs, which encode multiple streams of discrete tokens from audio, large language models have recently gained attention as a promising approach for zero-shot Text-to-Speech (TTS) synthesis.

Language Modelling Quantization

Cannot find the paper you are looking for? You can Submit a new open access paper.