Search Results for author: Sang-gil Lee

Found 10 papers, 7 papers with code

Edit-A-Video: Single Video Editing with Object-Aware Consistency

no code implementations14 Mar 2023 Chaehun Shin, Heeseung Kim, Che Hyun Lee, Sang-gil Lee, Sungroh Yoon

Despite the fact that text-to-video (TTV) model has recently achieved remarkable success, there have been few approaches on TTV for its extension to video editing.

Video Editing

BigVGAN: A Universal Neural Vocoder with Large-Scale Training

3 code implementations9 Jun 2022 Sang-gil Lee, Wei Ping, Boris Ginsburg, Bryan Catanzaro, Sungroh Yoon

Despite recent progress in generative adversarial network (GAN)-based vocoders, where the model generates raw waveform conditioned on acoustic features, it is challenging to synthesize high-fidelity audio for numerous speakers across various recording environments.

Audio Generation Audio Synthesis +4

Robust End-to-End Focal Liver Lesion Detection using Unregistered Multiphase Computed Tomography Images

2 code implementations2 Dec 2021 Sang-gil Lee, Eunji Kim, Jae Seok Bae, Jung Hoon Kim, Sungroh Yoon

The computer-aided diagnosis of focal liver lesions (FLLs) can help improve workflow and enable correct diagnoses; FLL detection is the first step in such a computer-aided diagnosis.

Automatic Liver And Tumor Segmentation Computed Tomography (CT) +4

NanoFlow: Scalable Normalizing Flows with Sublinear Parameter Complexity

1 code implementation NeurIPS 2020 Sang-gil Lee, Sungwon Kim, Sungroh Yoon

Normalizing flows (NFs) have become a prominent method for deep generative models that allow for an analytic probability density estimation and efficient synthesis.

Density Estimation Normalising Flows +1

One-Shot Learning for Text-to-SQL Generation

no code implementations26 Apr 2019 Dongjun Lee, Jaesik Yoon, Jongyun Song, Sang-gil Lee, Sungroh Yoon

We show that our model outperforms state-of-the-art approaches for various text-to-SQL datasets in two aspects: 1) the SQL generation accuracy for the trained templates, and 2) the adaptability to the unseen SQL templates based on a single example without any additional training.

One-Shot Learning Text-To-SQL

FloWaveNet : A Generative Flow for Raw Audio

2 code implementations6 Nov 2018 Sungwon Kim, Sang-gil Lee, Jongyoon Song, Sungroh Yoon

Most of modern text-to-speech architectures use a WaveNet vocoder for synthesizing a high-fidelity waveform audio, but there has been a limitation for practical applications due to its slow autoregressive sampling scheme.

Sound Audio and Speech Processing

Liver Lesion Detection from Weakly-labeled Multi-phase CT Volumes with a Grouped Single Shot MultiBox Detector

1 code implementation2 Jul 2018 Sang-gil Lee, Jae Seok Bae, Hyunjae Kim, Jung Hoon Kim, Sungroh Yoon

We present a focal liver lesion detection model leveraged by custom-designed multi-phase computed tomography (CT) volumes, which reflects real-world clinical lesion detection practice using a Single Shot MultiBox Detector (SSD).

Computed Tomography (CT) Lesion Detection +2

Polyphonic Music Generation with Sequence Generative Adversarial Networks

1 code implementation31 Oct 2017 Sang-gil Lee, Uiwon Hwang, Seonwoo Min, Sungroh Yoon

We propose an application of sequence generative adversarial networks (SeqGAN), which are generative adversarial networks for discrete sequence generation, for creating polyphonic musical sequences.

Sound Audio and Speech Processing

An Efficient Approach to Boosting Performance of Deep Spiking Network Training

no code implementations8 Nov 2016 Seongsik Park, Sang-gil Lee, Hyunha Nam, Sungroh Yoon

In order to eliminate this workaround, recently proposed is a new class of SNN named deep spiking networks (DSNs), which can be trained directly (without a mapping from conventional deep networks) by error backpropagation with stochastic gradient descent.

Cannot find the paper you are looking for? You can Submit a new open access paper.