Search Results for author: Jaehyeon Kim

Found 4 papers, 3 papers with code

Glow-TTS: A Generative Flow for Text-to-Speech via Monotonic Alignment Search

5 code implementations • NeurIPS 2020 • Jaehyeon Kim, Sungwon Kim, Jungil Kong, Sungroh Yoon

By leveraging the properties of flows, MAS searches for the most probable monotonic alignment between text and the latent representation of speech.

Ranked #4 on Text-To-Speech Synthesis on LJSpeech (using extra training data)

Text-To-Speech Synthesis

29,084

Paper
Code

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

10 code implementations • NeurIPS 2020) 2020 • Jungil Kong, Jaehyeon Kim, Jaekyoung Bae

Several recent work on speech synthesis have employed generative adversarial networks (GANs) to produce raw waveforms.

Ranked #10 on Speech Synthesis on LibriTTS

Speech Synthesis

12,594

Paper
Code

Progressive Deblurring of Diffusion Models for Coarse-to-Fine Image Synthesis

1 code implementation • 16 Jul 2022 • Sangyun Lee, Hyungjin Chung, Jaehyeon Kim, Jong Chul Ye

We further propose a blur diffusion as a special case, where each frequency component of an image is diffused at different speeds.

Deblurring Image Generation +1

142

Paper
Code

CLaM-TTS: Improving Neural Codec Language Model for Zero-Shot Text-to-Speech

no code implementations • 3 Apr 2024 • Jaehyeon Kim, Keon Lee, Seungjun Chung, Jaewoong Cho

With the emergence of neural audio codecs, which encode multiple streams of discrete tokens from audio, large language models have recently gained attention as a promising approach for zero-shot Text-to-Speech (TTS) synthesis.

Language Modelling Quantization

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.