Search Results for author: Sang-gil Lee

Found 10 papers, 7 papers with code

Edit-A-Video: Single Video Editing with Object-Aware Consistency

no code implementations • 14 Mar 2023 • Chaehun Shin, Heeseung Kim, Che Hyun Lee, Sang-gil Lee, Sungroh Yoon

Despite the fact that text-to-video (TTV) model has recently achieved remarkable success, there have been few approaches on TTV for its extension to video editing.

Video Editing

Paper
Add Code

BigVGAN: A Universal Neural Vocoder with Large-Scale Training

3 code implementations • 9 Jun 2022 • Sang-gil Lee, Wei Ping, Boris Ginsburg, Bryan Catanzaro, Sungroh Yoon

Despite recent progress in generative adversarial network (GAN)-based vocoders, where the model generates raw waveform conditioned on acoustic features, it is challenging to synthesize high-fidelity audio for numerous speakers across various recording environments.

Ranked #5 on Speech Synthesis on LibriTTS

Audio Generation Audio Synthesis +4

1,070

Paper
Code

Robust End-to-End Focal Liver Lesion Detection using Unregistered Multiphase Computed Tomography Images

2 code implementations • 2 Dec 2021 • Sang-gil Lee, Eunji Kim, Jae Seok Bae, Jung Hoon Kim, Sungroh Yoon

The computer-aided diagnosis of focal liver lesions (FLLs) can help improve workflow and enable correct diagnoses; FLL detection is the first step in such a computer-aided diagnosis.

Automatic Liver And Tumor Segmentation Computed Tomography (CT) +4

Paper
Code

PriorGrad: Improving Conditional Denoising Diffusion Models with Data-Dependent Adaptive Prior

1 code implementation • ICLR 2022 • Sang-gil Lee, Heeseung Kim, Chaehun Shin, Xu Tan, Chang Liu, Qi Meng, Tao Qin, Wei Chen, Sungroh Yoon, Tie-Yan Liu

Denoising diffusion probabilistic models have been recently proposed to generate high-quality samples by estimating the gradient of the data density.

Audio Generation Denoising +2

1,286

Paper
Code

NanoFlow: Scalable Normalizing Flows with Sublinear Parameter Complexity

1 code implementation • NeurIPS 2020 • Sang-gil Lee, Sungwon Kim, Sungroh Yoon

Normalizing flows (NFs) have become a prominent method for deep generative models that allow for an analytic probability density estimation and efficient synthesis.

Density Estimation Normalising Flows +1

Paper
Code

One-Shot Learning for Text-to-SQL Generation

no code implementations • 26 Apr 2019 • Dongjun Lee, Jaesik Yoon, Jongyun Song, Sang-gil Lee, Sungroh Yoon

We show that our model outperforms state-of-the-art approaches for various text-to-SQL datasets in two aspects: 1) the SQL generation accuracy for the trained templates, and 2) the adaptability to the unseen SQL templates based on a single example without any additional training.

One-Shot Learning Text-To-SQL

Paper
Add Code

FloWaveNet : A Generative Flow for Raw Audio

2 code implementations • 6 Nov 2018 • Sungwon Kim, Sang-gil Lee, Jongyoon Song, Sungroh Yoon

Most of modern text-to-speech architectures use a WaveNet vocoder for synthesizing a high-fidelity waveform audio, but there has been a limitation for practical applications due to its slow autoregressive sampling scheme.

Sound Audio and Speech Processing

493

Paper
Code

Liver Lesion Detection from Weakly-labeled Multi-phase CT Volumes with a Grouped Single Shot MultiBox Detector

1 code implementation • 2 Jul 2018 • Sang-gil Lee, Jae Seok Bae, Hyunjae Kim, Jung Hoon Kim, Sungroh Yoon

We present a focal liver lesion detection model leveraged by custom-designed multi-phase computed tomography (CT) volumes, which reflects real-world clinical lesion detection practice using a Single Shot MultiBox Detector (SSD).

Computed Tomography (CT) Lesion Detection +2

Paper
Code

Polyphonic Music Generation with Sequence Generative Adversarial Networks

1 code implementation • 31 Oct 2017 • Sang-gil Lee, Uiwon Hwang, Seonwoo Min, Sungroh Yoon

We propose an application of sequence generative adversarial networks (SeqGAN), which are generative adversarial networks for discrete sequence generation, for creating polyphonic musical sequences.

Sound Audio and Speech Processing

Paper
Code

An Efficient Approach to Boosting Performance of Deep Spiking Network Training

no code implementations • 8 Nov 2016 • Seongsik Park, Sang-gil Lee, Hyunha Nam, Sungroh Yoon

In order to eliminate this workaround, recently proposed is a new class of SNN named deep spiking networks (DSNs), which can be trained directly (without a mapping from conventional deep networks) by error backpropagation with stochastic gradient descent.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.