Search Results for author: Hyeongju Kim

Found 7 papers, 5 papers with code

EdiTTS: Score-based Editing for Controllable Text-to-Speech

1 code implementation • 6 Oct 2021 • Jaesung Tae, Hyeongju Kim, Taesu Kim

We present EdiTTS, an off-the-shelf speech editing methodology based on score-based generative modeling for text-to-speech synthesis.

Speech Synthesis Text-To-Speech Synthesis

114

Paper
Code

Scheduling Optimization Techniques for Neural Network Training

no code implementations • 3 Oct 2021 • Hyungjun Oh, Hyeongju Kim, Jiwon Seo

In data-parallel training, we reorder the gradient computations to maximize the overlapping of computation and parameter communication; in pipeline-parallel training, we prioritize critical gradient computations to reduce the pipeline stalls. We evaluate our optimizations with twelve neural networks including a light-weight computer vision model (MobileNet) and largeNLP models (BERT and GPT-3) with up to forty eight V100 GPUs. Our scheduling algorithms effectively improve the performance of single-GPU training as well as data- and pipeline-parallel training. Compared to the respective state of the art training systems, the throughput is substantially improved for single-GPU, data-parallel, and pipeline-parallel training.

Scheduling

Paper
Add Code

MLP Singer: Towards Rapid Parallel Singing Voice Synthesis

2 code implementations • arXiv 2021 • Jaesung Tae, Hyeongju Kim, Younggun Lee

Recent developments in deep learning have significantly improved the quality of synthesized singing voice audio.

Singing Voice Synthesis

114

Paper
Code

Diff-TTS: A Denoising Diffusion Model for Text-to-Speech

1 code implementation • 3 Apr 2021 • Myeonghun Jeong, Hyeongju Kim, Sung Jun Cheon, Byoung Jin Choi, Nam Soo Kim

Although neural text-to-speech (TTS) models have attracted a lot of attention and succeeded in generating human-like speech, there is still room for improvements to its naturalness and architectural efficiency.

Denoising Speech Synthesis

224

Paper
Code

Continuous Monitoring of Blood Pressure with Evidential Regression

no code implementations • 6 Feb 2021 • Hyeongju Kim, Woo Hyun Kang, Hyeonseung Lee, Nam Soo Kim

Photoplethysmogram (PPG) signal-based blood pressure (BP) estimation is a promising candidate for modern BP measurements, as PPG signals can be easily obtained from wearable devices in a non-invasive manner, allowing quick BP measurement.

regression

Paper
Add Code

WaveNODE: A Continuous Normalizing Flow for Speech Synthesis

1 code implementation • 8 Jun 2020 • Hyeongju Kim, Hyeonseung Lee, Woo Hyun Kang, Sung Jun Cheon, Byoung Jin Choi, Nam Soo Kim

In recent years, various flow-based generative models have been proposed to generate high-fidelity waveforms in real-time.

Speech Synthesis

Paper
Code

SoftFlow: Probabilistic Framework for Normalizing Flow on Manifolds

1 code implementation • NeurIPS 2020 • Hyeongju Kim, Hyeonseung Lee, Woo Hyun Kang, Joun Yeop Lee, Nam Soo Kim

Flow-based generative models are composed of invertible transformations between two random variables of the same dimension.

Ranked #3 on Point Cloud Generation on ShapeNet Airplane

Point Cloud Generation

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.