Search Results for author: Hanseok Ko

Found 31 papers, 9 papers with code

Towards Multi-domain Face Landmark Detection with Synthetic Data from Diffusion model

no code implementations24 Jan 2024 Yuanming Li, Gwantae Kim, Jeong-gi Kwak, Bon-hwa Ku, Hanseok Ko

Finally, we fine-tuned a pre-trained face landmark detection model on the synthetic dataset to achieve multi-domain face landmark detection.

Caricature Face Generation +1

ViVid-1-to-3: Novel View Synthesis with Video Diffusion Models

no code implementations3 Dec 2023 Jeong-gi Kwak, Erqun Dong, Yuhe Jin, Hanseok Ko, Shweta Mahajan, Kwang Moo Yi

Thus, to perform novel-view synthesis, we create a smooth camera trajectory to the target view that we wish to render, and denoise using both a view-conditioned diffusion model and a video diffusion model.

Novel View Synthesis Object

MPE4G: Multimodal Pretrained Encoder for Co-Speech Gesture Generation

no code implementations25 May 2023 Gwantae Kim, Seonghyeok Noh, Insung Ham, Hanseok Ko

Through the series of experiments and human evaluation, the proposed method renders realistic co-speech gestures not only when all input modalities are given but also when the input modalities are missing or noisy.

Gesture Generation Self-Supervised Learning

Spatial-temporal Transformer-guided Diffusion based Data Augmentation for Efficient Skeleton-based Action Recognition

no code implementations26 Feb 2023 Yifan Jiang, Han Chen, Hanseok Ko

In this paper, we introduce a novel data augmentation method for skeleton-based action recognition tasks, which can effectively generate high-quality and diverse sequential actions.

Action Recognition Data Augmentation +2

DIFAI: Diverse Facial Inpainting using StyleGAN Inversion

no code implementations20 Jan 2023 Dongsik Yoon, Jeong-gi Kwak, Yuanming Li, David Han, Hanseok Ko

Image inpainting is an old problem in computer vision that restores occluded regions and completes damaged images.

Facial Inpainting

Reference Guided Image Inpainting using Facial Attributes

1 code implementation19 Jan 2023 Dongsik Yoon, Jeonggi Kwak, Yuanming Li, David Han, Youngsaeng Jin, Hanseok Ko

Image inpainting is a technique of completing missing pixels such as occluded region restoration, distracting objects removal, and facial completion.

Attribute Facial Inpainting +2

Single Cell Training on Architecture Search for Image Denoising

no code implementations13 Dec 2022 Bokyeung Lee, Kyungdeuk Ko, Jonghwan Hong, Hanseok Ko

With these modules, then we employ reinforcement learning in search of an optimal image denoising network at a module level.

Computational Efficiency Image Denoising +4

3d human motion generation from the text via gesture action classification and the autoregressive model

no code implementations18 Nov 2022 Gwantae Kim, Youngsuk Ryu, Junyeop Lee, David K. Han, Jeongmin Bae, Hanseok Ko

To achieve the goal, the proposed method predicts expression from the sentences using a text classification model based on a pretrained language model and generates gestures using the gate recurrent unit-based autoregressive model.

Action Classification Action Recognition +4

Discriminatory and orthogonal feature learning for noise robust keyword spotting

no code implementations20 Oct 2022 Donghyeon Kim, Kyungdeuk Ko, David K. Han, Hanseok Ko

In order to train the network for more robust performance in noisy environments, we introduce the LOw Variant Orthogonal (LOVO) loss.

Keyword Spotting

Pose-Guided Graph Convolutional Networks for Skeleton-Based Action Recognition

no code implementations10 Oct 2022 Han Chen, Yifan Jiang, Hanseok Ko

Graph convolutional networks (GCNs), which can model the human body skeletons as spatial and temporal graphs, have shown remarkable potential in skeleton-based action recognition.

Action Recognition Skeleton Based Action Recognition +1

Controllable Face Manipulation and UV Map Generation by Self-supervised Learning

no code implementations24 Sep 2022 Yuanming Li, Jeong-gi Kwak, David Han, Hanseok Ko

Our model relies on pretrained StyleGAN, and the proposed model is trained in a self-supervised manner without any manual annotations or datasets.

Attribute Self-Supervised Learning

Injecting 3D Perception of Controllable NeRF-GAN into StyleGAN for Editable Portrait Image Synthesis

1 code implementation21 Jul 2022 Jeong-gi Kwak, Yuanming Li, Dongsik Yoon, Donghyeon Kim, David Han, Hanseok Ko

To alleviate the issue, many 3D-aware GANs have been proposed and shown notable results, but 3D GANs struggle with editing semantic attributes.

Image Generation

Generate and Edit Your Own Character in a Canonical View

no code implementations6 May 2022 Jeong-gi Kwak, Yuanming Li, Dongsik Yoon, David Han, Hanseok Ko

Although the progress of generative models enables the stylization of a portrait, obtaining the stylized image in canonical view is still a challenging task.

Efficient dynamic filter for robust and low computational feature extraction

no code implementations3 May 2022 Donghyeon Kim, Gwantae Kim, Bokyeung Lee, Jeong-gi Kwak, David K. Han, Hanseok Ko

However, the performance of the dynamic filter might be degraded since simple feature pooling is used to reduce the computational resource in the IDF part.

Keyword Spotting Speaker Verification

Adverse Weather Image Translation with Asymmetric and Uncertainty-aware GAN

1 code implementation8 Dec 2021 Jeong-gi Kwak, Youngsaeng Jin, Yuanming Li, Dongsik Yoon, Donghyeon Kim, Hanseok Ko

To address this issue, we propose a novel GAN model, i. e., AU-GAN, which has an asymmetric architecture for adverse domain translation.

Disentanglement Translation

Action Recognition with Domain Invariant Features of Skeleton Image

no code implementations19 Nov 2021 Han Chen, Yifan Jiang, Hanseok Ko

Due to the fast processing-speed and robustness it can achieve, skeleton-based action recognition has recently received the attention of the computer vision community.

Action Recognition Skeleton Based Action Recognition

A Teacher-Student Framework with Fourier Augmentation for COVID-19 Infection Segmentation in CT Images

no code implementations13 Oct 2021 Han Chen, Yifan Jiang, Hanseok Ko, Murray Loew

Automatic segmentation of infected regions in computed tomography (CT) images is necessary for the initial diagnosis of COVID-19.

Computed Tomography (CT) Segmentation

A Lightweight dynamic filter for keyword spotting

no code implementations23 Sep 2021 Donghyeon Kim, Kyungdeuk Ko, Jeonggi Kwak, David K. Han, Hanseok Ko

Keyword Spotting (KWS) from speech signals is widely applied to perform fully hands-free speech recognition.

Keyword Spotting speech-recognition +1

Memory-based Semantic Segmentation for Off-road Unstructured Natural Environments

1 code implementation12 Aug 2021 Youngsaeng Jin, David K. Han, Hanseok Ko

In this paper, a built-in memory module for semantic segmentation is proposed to overcome these problems.

Autonomous Driving Segmentation +1

CPNet: Cross-Parallel Network for Efficient Anomaly Detection

1 code implementation10 Aug 2021 Youngsaeng Jin, Jonghwan Hong, David Han, Hanseok Ko

Anomaly detection in video streams is a challenging problem because of the scarcity of abnormal events and the difficulty of accurately annotating them.

Anomaly Detection

Few-shot Learning for CT Scan based COVID-19 Diagnosis

no code implementations1 Feb 2021 Yifan Jiang, Han Chen, David K. Han, Hanseok Ko

To compensate for the sparseness of labeled data, the proposed method utilizes a large amount of synthetic COVID-19 CT images and adjusts the networks from the source domain (synthetic data) to the target domain (real data) with a cross-domain training mechanism.

Computed Tomography (CT) COVID-19 Diagnosis +2

CAFE-GAN: Arbitrary Face Attribute Editing with Complementary Attention Feature

1 code implementation ECCV 2020 Jeong-gi Kwak, David K. Han, Hanseok Ko

The goal of face attribute editing is altering a facial image according to given target attributes such as hair color, mustache, gender, etc.

Attribute Generative Adversarial Network

Unsupervised domain adaptation based COVID-19 CT infection segmentation network

no code implementations23 Nov 2020 Han Chen, Yifan Jiang, Murray Loew, Hanseok Ko

In this paper, we propose an unsupervised domain adaptation based segmentation network to improve the segmentation performance of the infection areas in COVID-19 CT images.

Computed Tomography (CT) Segmentation +1

COVID-19 CT Image Synthesis with a Conditional Generative Adversarial Network

no code implementations29 Jul 2020 Yifan Jiang, Han Chen, Murray Loew, Hanseok Ko

However, training a deep-learning model requires large volumes of data, and medical staff faces a high risk when collecting COVID-19 CT data due to the high infectivity of the disease.

Computed Tomography (CT) COVID-19 Diagnosis +3

Data Separability for Neural Network Classifiers and the Development of a Separability Index

1 code implementation27 May 2020 Shuyue Guan, Murray Loew, Hanseok Ko

In machine learning, the performance of a classifier depends on both the classifier model and the dataset.

BIG-bench Machine Learning

Correlation Distance Skip Connection Denoising Autoencoder (CDSK-DAE) for Speech Feature Enhancement

no code implementations26 Jul 2019 Alzahra Badi, Sangwook Park, David K. Han, Hanseok Ko

Performance of learning based Automatic Speech Recognition (ASR) is susceptible to noise, especially when it is introduced in the testing data while not presented in the training data.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

KU-ISPL Speaker Recognition Systems under Language mismatch condition for NIST 2016 Speaker Recognition Evaluation

no code implementations3 Feb 2017 Suwon Shon, Hanseok Ko

As development dataset which is spoken in Cebuano and Mandarin, we could prepare the evaluation trials through preliminary experiments to compensate the language mismatched condition.

Clustering Speaker Recognition

KU-ISPL Language Recognition System for NIST 2015 i-Vector Machine Learning Challenge

no code implementations21 Sep 2016 Suwon Shon, Seongkyu Mun, John H. L. Hansen, Hanseok Ko

The experimental results show that the use of duration and score fusion improves language recognition performance by 5% relative in LRiMLC15 cost.

BIG-bench Machine Learning

Cannot find the paper you are looking for? You can Submit a new open access paper.