Search Results for author: Hanseok Ko

Found 31 papers, 9 papers with code

Towards Multi-domain Face Landmark Detection with Synthetic Data from Diffusion model

no code implementations • 24 Jan 2024 • Yuanming Li, Gwantae Kim, Jeong-gi Kwak, Bon-hwa Ku, Hanseok Ko

Finally, we fine-tuned a pre-trained face landmark detection model on the synthetic dataset to achieve multi-domain face landmark detection.

Caricature Face Generation +1

Paper
Add Code

ViVid-1-to-3: Novel View Synthesis with Video Diffusion Models

no code implementations • 3 Dec 2023 • Jeong-gi Kwak, Erqun Dong, Yuhe Jin, Hanseok Ko, Shweta Mahajan, Kwang Moo Yi

Thus, to perform novel-view synthesis, we create a smooth camera trajectory to the target view that we wish to render, and denoise using both a view-conditioned diffusion model and a video diffusion model.

Novel View Synthesis Object

Paper
Add Code

MPE4G: Multimodal Pretrained Encoder for Co-Speech Gesture Generation

no code implementations • 25 May 2023 • Gwantae Kim, Seonghyeok Noh, Insung Ham, Hanseok Ko

Through the series of experiments and human evaluation, the proposed method renders realistic co-speech gestures not only when all input modalities are given but also when the input modalities are missing or noisy.

Gesture Generation Self-Supervised Learning

Paper
Add Code

Spatial-temporal Transformer-guided Diffusion based Data Augmentation for Efficient Skeleton-based Action Recognition

no code implementations • 26 Feb 2023 • Yifan Jiang, Han Chen, Hanseok Ko

In this paper, we introduce a novel data augmentation method for skeleton-based action recognition tasks, which can effectively generate high-quality and diverse sequential actions.

Action Recognition Data Augmentation +2

Paper
Add Code

DIFAI: Diverse Facial Inpainting using StyleGAN Inversion

no code implementations • 20 Jan 2023 • Dongsik Yoon, Jeong-gi Kwak, Yuanming Li, David Han, Hanseok Ko

Image inpainting is an old problem in computer vision that restores occluded regions and completes damaged images.

Facial Inpainting

Paper
Add Code

Reference Guided Image Inpainting using Facial Attributes

1 code implementation • 19 Jan 2023 • Dongsik Yoon, Jeonggi Kwak, Yuanming Li, David Han, Youngsaeng Jin, Hanseok Ko

Image inpainting is a technique of completing missing pixels such as occluded region restoration, distracting objects removal, and facial completion.

Attribute Facial Inpainting +2

Paper
Code

Single Cell Training on Architecture Search for Image Denoising

no code implementations • 13 Dec 2022 • Bokyeung Lee, Kyungdeuk Ko, Jonghwan Hong, Hanseok Ko

With these modules, then we employ reinforcement learning in search of an optimal image denoising network at a module level.

Computational Efficiency Image Denoising +4

Paper
Add Code

3d human motion generation from the text via gesture action classification and the autoregressive model

no code implementations • 18 Nov 2022 • Gwantae Kim, Youngsuk Ryu, Junyeop Lee, David K. Han, Jeongmin Bae, Hanseok Ko

To achieve the goal, the proposed method predicts expression from the sentences using a text classification model based on a pretrained language model and generates gestures using the gate recurrent unit-based autoregressive model.

Action Classification Action Recognition +4

Paper
Add Code

Discriminatory and orthogonal feature learning for noise robust keyword spotting

no code implementations • 20 Oct 2022 • Donghyeon Kim, Kyungdeuk Ko, David K. Han, Hanseok Ko

In order to train the network for more robust performance in noisy environments, we introduce the LOw Variant Orthogonal (LOVO) loss.

Keyword Spotting

Paper
Add Code

Pose-Guided Graph Convolutional Networks for Skeleton-Based Action Recognition

no code implementations • 10 Oct 2022 • Han Chen, Yifan Jiang, Hanseok Ko

Graph convolutional networks (GCNs), which can model the human body skeletons as spatial and temporal graphs, have shown remarkable potential in skeleton-based action recognition.

Action Recognition Skeleton Based Action Recognition +1

Paper
Add Code

Controllable Face Manipulation and UV Map Generation by Self-supervised Learning

no code implementations • 24 Sep 2022 • Yuanming Li, Jeong-gi Kwak, David Han, Hanseok Ko

Our model relies on pretrained StyleGAN, and the proposed model is trained in a self-supervised manner without any manual annotations or datasets.

Attribute Self-Supervised Learning

Paper
Add Code

Injecting 3D Perception of Controllable NeRF-GAN into StyleGAN for Editable Portrait Image Synthesis

1 code implementation • 21 Jul 2022 • Jeong-gi Kwak, Yuanming Li, Dongsik Yoon, Donghyeon Kim, David Han, Hanseok Ko

To alleviate the issue, many 3D-aware GANs have been proposed and shown notable results, but 3D GANs struggle with editing semantic attributes.

Image Generation

121

Paper
Code

Generate and Edit Your Own Character in a Canonical View

no code implementations • 6 May 2022 • Jeong-gi Kwak, Yuanming Li, Dongsik Yoon, David Han, Hanseok Ko

Although the progress of generative models enables the stylization of a portrait, obtaining the stylized image in canonical view is still a challenging task.

Paper
Add Code

Efficient dynamic filter for robust and low computational feature extraction

no code implementations • 3 May 2022 • Donghyeon Kim, Gwantae Kim, Bokyeung Lee, Jeong-gi Kwak, David K. Han, Hanseok Ko

However, the performance of the dynamic filter might be degraded since simple feature pooling is used to reduce the computational resource in the IDF part.

Keyword Spotting Speaker Verification

Paper
Add Code

Adverse Weather Image Translation with Asymmetric and Uncertainty-aware GAN

1 code implementation • 8 Dec 2021 • Jeong-gi Kwak, Youngsaeng Jin, Yuanming Li, Dongsik Yoon, Donghyeon Kim, Hanseok Ko

To address this issue, we propose a novel GAN model, i. e., AU-GAN, which has an asymmetric architecture for adverse domain translation.

Disentanglement Translation

Paper
Code

Action Recognition with Domain Invariant Features of Skeleton Image

no code implementations • 19 Nov 2021 • Han Chen, Yifan Jiang, Hanseok Ko

Due to the fast processing-speed and robustness it can achieve, skeleton-based action recognition has recently received the attention of the computer vision community.

Action Recognition Skeleton Based Action Recognition

Paper
Add Code

A Teacher-Student Framework with Fourier Augmentation for COVID-19 Infection Segmentation in CT Images

no code implementations • 13 Oct 2021 • Han Chen, Yifan Jiang, Hanseok Ko, Murray Loew

Automatic segmentation of infected regions in computed tomography (CT) images is necessary for the initial diagnosis of COVID-19.

Computed Tomography (CT) Segmentation

Paper
Add Code

A Lightweight dynamic filter for keyword spotting

no code implementations • 23 Sep 2021 • Donghyeon Kim, Kyungdeuk Ko, Jeonggi Kwak, David K. Han, Hanseok Ko

Keyword Spotting (KWS) from speech signals is widely applied to perform fully hands-free speech recognition.

Keyword Spotting speech-recognition +1

Paper
Add Code

CaraNet: Context Axial Reverse Attention Network for Segmentation of Small Medical Objects

1 code implementation • 16 Aug 2021 • Ange Lou, Shuyue Guan, Hanseok Ko, Murray Loew

Segmenting medical images accurately and reliably is important for disease diagnosis and treatment.

Ranked #9 on Medical Image Segmentation on ETIS-LARIBPOLYPDB

Medical Image Segmentation Segmentation

453

Paper
Code

Memory-based Semantic Segmentation for Off-road Unstructured Natural Environments

1 code implementation • 12 Aug 2021 • Youngsaeng Jin, David K. Han, Hanseok Ko

In this paper, a built-in memory module for semantic segmentation is proposed to overcome these problems.

Autonomous Driving Segmentation +1

Paper
Code

CPNet: Cross-Parallel Network for Efficient Anomaly Detection

1 code implementation • 10 Aug 2021 • Youngsaeng Jin, Jonghwan Hong, David Han, Hanseok Ko

Anomaly detection in video streams is a challenging problem because of the scarcity of abnormal events and the difficulty of accurately annotating them.

Anomaly Detection

Paper
Code

Few-shot Learning for CT Scan based COVID-19 Diagnosis

no code implementations • 1 Feb 2021 • Yifan Jiang, Han Chen, David K. Han, Hanseok Ko

To compensate for the sparseness of labeled data, the proposed method utilizes a large amount of synthetic COVID-19 CT images and adjusts the networks from the source domain (synthetic data) to the target domain (real data) with a cross-domain training mechanism.

Computed Tomography (CT) COVID-19 Diagnosis +2

Paper
Add Code

CAFE-GAN: Arbitrary Face Attribute Editing with Complementary Attention Feature

1 code implementation • ECCV 2020 • Jeong-gi Kwak, David K. Han, Hanseok Ko

The goal of face attribute editing is altering a facial image according to given target attributes such as hair color, mustache, gender, etc.

Attribute Generative Adversarial Network

Paper
Code

Unsupervised domain adaptation based COVID-19 CT infection segmentation network

no code implementations • 23 Nov 2020 • Han Chen, Yifan Jiang, Murray Loew, Hanseok Ko

In this paper, we propose an unsupervised domain adaptation based segmentation network to improve the segmentation performance of the infection areas in COVID-19 CT images.

Computed Tomography (CT) Segmentation +1

Paper
Add Code

COVID-19 CT Image Synthesis with a Conditional Generative Adversarial Network

no code implementations • 29 Jul 2020 • Yifan Jiang, Han Chen, Murray Loew, Hanseok Ko

However, training a deep-learning model requires large volumes of data, and medical staff faces a high risk when collecting COVID-19 CT data due to the high infectivity of the disease.

Computed Tomography (CT) COVID-19 Diagnosis +3

Paper
Add Code

Data Separability for Neural Network Classifiers and the Development of a Separability Index

1 code implementation • 27 May 2020 • Shuyue Guan, Murray Loew, Hanseok Ko

In machine learning, the performance of a classifier depends on both the classifier model and the dataset.

BIG-bench Machine Learning

Paper
Code

NTIRE 2020 Challenge on Real-World Image Super-Resolution: Methods and Results

5 code implementations • 5 May 2020 • Andreas Lugmayr, Martin Danelljan, Radu Timofte, Namhyuk Ahn, Dongwoon Bai, Jie Cai, Yun Cao, Junyang Chen, Kaihua Cheng, SeYoung Chun, Wei Deng, Mostafa El-Khamy, Chiu Man Ho, Xiaozhong Ji, Amin Kheradmand, Gwantae Kim, Hanseok Ko, Kanghyu Lee, Jungwon Lee, Hao Li, Ziluan Liu, Zhi-Song Liu, Shuai Liu, Yunhua Lu, Zibo Meng, Pablo Navarrete Michelini, Christian Micheloni, Kalpesh Prajapati, Haoyu Ren, Yong Hyeok Seo, Wan-Chi Siu, Kyung-Ah Sohn, Ying Tai, Rao Muhammad Umer, Shuangquan Wang, Huibing Wang, Timothy Haoning Wu, Hao-Ning Wu, Biao Yang, Fuzhi Yang, Jaejun Yoo, Tongtong Zhao, Yuanbo Zhou, Haijie Zhuo, Ziyao Zong, Xueyi Zou

This paper reviews the NTIRE 2020 challenge on real world super-resolution.

Benchmarking Image Super-Resolution

1,048

Paper
Code

Correlation Distance Skip Connection Denoising Autoencoder (CDSK-DAE) for Speech Feature Enhancement

no code implementations • 26 Jul 2019 • Alzahra Badi, Sangwook Park, David K. Han, Hanseok Ko

Performance of learning based Automatic Speech Recognition (ASR) is susceptible to noise, especially when it is introduced in the testing data while not presented in the training data.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Add Code

Sinusoidal wave generating network based on adversarial learning and its application: synthesizing frog sounds for data augmentation

no code implementations • 7 Jan 2019 • Sangwook Park, David K. Han, Hanseok Ko

Audio waveform generation can then be performed using the proposed network.

Data Augmentation Sound Classification

Paper
Add Code

KU-ISPL Speaker Recognition Systems under Language mismatch condition for NIST 2016 Speaker Recognition Evaluation

no code implementations • 3 Feb 2017 • Suwon Shon, Hanseok Ko

As development dataset which is spoken in Cebuano and Mandarin, we could prepare the evaluation trials through preliminary experiments to compensate the language mismatched condition.

Clustering Speaker Recognition

Paper
Add Code

KU-ISPL Language Recognition System for NIST 2015 i-Vector Machine Learning Challenge

no code implementations • 21 Sep 2016 • Suwon Shon, Seongkyu Mun, John H. L. Hansen, Hanseok Ko

The experimental results show that the use of duration and score fusion improves language recognition performance by 5% relative in LRiMLC15 cost.

BIG-bench Machine Learning

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.