1 code implementation • 30 May 2023 • Sungwon Kim, Junseok Lee, Namkyeong Lee, Wonjoong Kim, Seungyoon Choi, Chanyoung Park
To solve this problem, it is important for GNNs to be able to classify nodes with a limited number of labeled nodes, known as few-shot node classification.
1 code implementation • 29 Apr 2023 • Namkyeong Lee, Dongmin Hyun, Gyoung S. Na, Sungwon Kim, Junseok Lee, Chanyoung Park
Molecular relational learning, whose goal is to learn the interaction behavior between molecular pairs, got a surge of interest in molecular sciences due to its wide range of applications.
1 code implementation • 13 Mar 2023 • Namkyeong Lee, Heewoong Noh, Sungwon Kim, Dongmin Hyun, Gyoung S. Na, Chanyoung Park
The density of states (DOS) is a spectral property of materials, which provides fundamental insights on various characteristics of materials.
no code implementations • 30 May 2022 • Sungwon Kim, Heeseung Kim, Sungroh Yoon
We train the speaker-conditional diffusion model on large-scale untranscribed datasets for a classifier-free guidance method and further fine-tune the diffusion model on the reference speech of the target speaker for adaptation, which only takes 40 seconds.
4 code implementations • CVPR 2022 • Jooyoung Choi, Jungbeom Lee, Chaehun Shin, Sungwon Kim, Hyunwoo Kim, Sungroh Yoon
Diffusion models learn to restore noisy data, which is corrupted with different levels of noise, by optimizing the weighted sum of the corresponding loss terms, i. e., denoising score matching loss.
no code implementations • 23 Nov 2021 • Heeseung Kim, Sungwon Kim, Sungroh Yoon
For TTS synthesis, we guide the generative process of the diffusion model with a phoneme classifier trained on a large-scale speech recognition dataset.
no code implementations • 2 Oct 2021 • Yonghyun Jeong, Jooyoung Choi, Sungwon Kim, Youngmin Ro, Tae-Hyun Oh, Doyeon Kim, Heonseok Ha, Sungroh Yoon
In this work, we present Facial Identity Controllable GAN (FICGAN) for not only generating high-quality de-identified face images with ensured privacy protection, but also detailed controllability on attribute preservation for enhanced data utility.
no code implementations • 29 Sep 2021 • Heeseung Kim, Sungwon Kim, Sungroh Yoon
By modeling the unconditional distribution for speech, our model can utilize the untranscribed data for training.
no code implementations • EMNLP 2021 • Jongyoon Song, Sungwon Kim, Sungroh Yoon
Non-autoregressive neural machine translation (NART) models suffer from the multi-modality problem which causes translation inconsistency such as token repetition.
1 code implementation • ICCV 2021 • Jooyoung Choi, Sungwon Kim, Yonghyun Jeong, Youngjune Gwon, Sungroh Yoon
In this work, we propose Iterative Latent Variable Refinement (ILVR), a method to guide the generative process in DDPM to generate high-quality images based on a given reference image.
1 code implementation • NeurIPS 2020 • Sang-gil Lee, Sungwon Kim, Sungroh Yoon
Normalizing flows (NFs) have become a prominent method for deep generative models that allow for an analytic probability density estimation and efficient synthesis.
5 code implementations • NeurIPS 2020 • Jaehyeon Kim, Sungwon Kim, Jungil Kong, Sungroh Yoon
By leveraging the properties of flows, MAS searches for the most probable monotonic alignment between text and the latent representation of speech.
Ranked #4 on
Text-To-Speech Synthesis
on LJSpeech
(using extra training data)
2 code implementations • 6 Nov 2018 • Sungwon Kim, Sang-gil Lee, Jongyoon Song, Sungroh Yoon
Most of modern text-to-speech architectures use a WaveNet vocoder for synthesizing a high-fidelity waveform audio, but there has been a limitation for practical applications due to its slow autoregressive sampling scheme.
Sound Audio and Speech Processing