no code implementations • 24 Apr 2024 • Gwantae Kim, Bokyeung Lee, Donghyeon Kim, Hanseok Ko
To tackle this problem, we propose code-switching speech recognition models that incorporate fine-tuned monolingual and multilingual speech recognition models.
Automatic Speech Recognition parameter-efficient fine-tuning +2
no code implementations • 24 Jan 2024 • Yuanming Li, Gwantae Kim, Jeong-gi Kwak, Bon-hwa Ku, Hanseok Ko
Finally, we fine-tuned a pre-trained face landmark detection model on the synthetic dataset to achieve multi-domain face landmark detection.
no code implementations • CVPR 2024 • Jeong-gi Kwak, Erqun Dong, Yuhe Jin, Hanseok Ko, Shweta Mahajan, Kwang Moo Yi
Thus, to perform novel-view synthesis, we create a smooth camera trajectory to the target view that we wish to render, and denoise using both a view-conditioned diffusion model and a video diffusion model.
no code implementations • 25 May 2023 • Gwantae Kim, Seonghyeok Noh, Insung Ham, Hanseok Ko
Through the series of experiments and human evaluation, the proposed method renders realistic co-speech gestures not only when all input modalities are given but also when the input modalities are missing or noisy.
no code implementations • 26 Feb 2023 • Yifan Jiang, Han Chen, Hanseok Ko
In this paper, we introduce a novel data augmentation method for skeleton-based action recognition tasks, which can effectively generate high-quality and diverse sequential actions.
no code implementations • 20 Jan 2023 • Dongsik Yoon, Jeong-gi Kwak, Yuanming Li, David Han, Hanseok Ko
Image inpainting is an old problem in computer vision that restores occluded regions and completes damaged images.
1 code implementation • 19 Jan 2023 • Dongsik Yoon, Jeonggi Kwak, Yuanming Li, David Han, Youngsaeng Jin, Hanseok Ko
Image inpainting is a technique of completing missing pixels such as occluded region restoration, distracting objects removal, and facial completion.
no code implementations • 13 Dec 2022 • Bokyeung Lee, Kyungdeuk Ko, Jonghwan Hong, Hanseok Ko
With these modules, then we employ reinforcement learning in search of an optimal image denoising network at a module level.
no code implementations • 18 Nov 2022 • Gwantae Kim, Youngsuk Ryu, Junyeop Lee, David K. Han, Jeongmin Bae, Hanseok Ko
To achieve the goal, the proposed method predicts expression from the sentences using a text classification model based on a pretrained language model and generates gestures using the gate recurrent unit-based autoregressive model.
no code implementations • 20 Oct 2022 • Donghyeon Kim, Kyungdeuk Ko, David K. Han, Hanseok Ko
In order to train the network for more robust performance in noisy environments, we introduce the LOw Variant Orthogonal (LOVO) loss.
no code implementations • 10 Oct 2022 • Han Chen, Yifan Jiang, Hanseok Ko
Graph convolutional networks (GCNs), which can model the human body skeletons as spatial and temporal graphs, have shown remarkable potential in skeleton-based action recognition.
no code implementations • 24 Sep 2022 • Yuanming Li, Jeong-gi Kwak, David Han, Hanseok Ko
Our model relies on pretrained StyleGAN, and the proposed model is trained in a self-supervised manner without any manual annotations or datasets.
1 code implementation • 21 Jul 2022 • Jeong-gi Kwak, Yuanming Li, Dongsik Yoon, Donghyeon Kim, David Han, Hanseok Ko
To alleviate the issue, many 3D-aware GANs have been proposed and shown notable results, but 3D GANs struggle with editing semantic attributes.
no code implementations • 6 May 2022 • Jeong-gi Kwak, Yuanming Li, Dongsik Yoon, David Han, Hanseok Ko
Although the progress of generative models enables the stylization of a portrait, obtaining the stylized image in canonical view is still a challenging task.
no code implementations • 3 May 2022 • Donghyeon Kim, Gwantae Kim, Bokyeung Lee, Jeong-gi Kwak, David K. Han, Hanseok Ko
However, the performance of the dynamic filter might be degraded since simple feature pooling is used to reduce the computational resource in the IDF part.
1 code implementation • 8 Dec 2021 • Jeong-gi Kwak, Youngsaeng Jin, Yuanming Li, Dongsik Yoon, Donghyeon Kim, Hanseok Ko
To address this issue, we propose a novel GAN model, i. e., AU-GAN, which has an asymmetric architecture for adverse domain translation.
no code implementations • 19 Nov 2021 • Han Chen, Yifan Jiang, Hanseok Ko
Due to the fast processing-speed and robustness it can achieve, skeleton-based action recognition has recently received the attention of the computer vision community.
no code implementations • 13 Oct 2021 • Han Chen, Yifan Jiang, Hanseok Ko, Murray Loew
Automatic segmentation of infected regions in computed tomography (CT) images is necessary for the initial diagnosis of COVID-19.
no code implementations • 23 Sep 2021 • Donghyeon Kim, Kyungdeuk Ko, Jeonggi Kwak, David K. Han, Hanseok Ko
Keyword Spotting (KWS) from speech signals is widely applied to perform fully hands-free speech recognition.
1 code implementation • 16 Aug 2021 • Ange Lou, Shuyue Guan, Hanseok Ko, Murray Loew
Segmenting medical images accurately and reliably is important for disease diagnosis and treatment.
Ranked #14 on Medical Image Segmentation on ETIS-LARIBPOLYPDB
1 code implementation • 12 Aug 2021 • Youngsaeng Jin, David K. Han, Hanseok Ko
In this paper, a built-in memory module for semantic segmentation is proposed to overcome these problems.
1 code implementation • 10 Aug 2021 • Youngsaeng Jin, Jonghwan Hong, David Han, Hanseok Ko
Anomaly detection in video streams is a challenging problem because of the scarcity of abnormal events and the difficulty of accurately annotating them.
no code implementations • 1 Feb 2021 • Yifan Jiang, Han Chen, David K. Han, Hanseok Ko
To compensate for the sparseness of labeled data, the proposed method utilizes a large amount of synthetic COVID-19 CT images and adjusts the networks from the source domain (synthetic data) to the target domain (real data) with a cross-domain training mechanism.
1 code implementation • ECCV 2020 • Jeong-gi Kwak, David K. Han, Hanseok Ko
The goal of face attribute editing is altering a facial image according to given target attributes such as hair color, mustache, gender, etc.
no code implementations • 23 Nov 2020 • Han Chen, Yifan Jiang, Murray Loew, Hanseok Ko
In this paper, we propose an unsupervised domain adaptation based segmentation network to improve the segmentation performance of the infection areas in COVID-19 CT images.
no code implementations • 29 Jul 2020 • Yifan Jiang, Han Chen, Murray Loew, Hanseok Ko
However, training a deep-learning model requires large volumes of data, and medical staff faces a high risk when collecting COVID-19 CT data due to the high infectivity of the disease.
1 code implementation • 27 May 2020 • Shuyue Guan, Murray Loew, Hanseok Ko
In machine learning, the performance of a classifier depends on both the classifier model and the dataset.
5 code implementations • 5 May 2020 • Andreas Lugmayr, Martin Danelljan, Radu Timofte, Namhyuk Ahn, Dongwoon Bai, Jie Cai, Yun Cao, Junyang Chen, Kaihua Cheng, SeYoung Chun, Wei Deng, Mostafa El-Khamy, Chiu Man Ho, Xiaozhong Ji, Amin Kheradmand, Gwantae Kim, Hanseok Ko, Kanghyu Lee, Jungwon Lee, Hao Li, Ziluan Liu, Zhi-Song Liu, Shuai Liu, Yunhua Lu, Zibo Meng, Pablo Navarrete Michelini, Christian Micheloni, Kalpesh Prajapati, Haoyu Ren, Yong Hyeok Seo, Wan-Chi Siu, Kyung-Ah Sohn, Ying Tai, Rao Muhammad Umer, Shuangquan Wang, Huibing Wang, Timothy Haoning Wu, Hao-Ning Wu, Biao Yang, Fuzhi Yang, Jaejun Yoo, Tongtong Zhao, Yuanbo Zhou, Haijie Zhuo, Ziyao Zong, Xueyi Zou
This paper reviews the NTIRE 2020 challenge on real world super-resolution.
no code implementations • 26 Jul 2019 • Alzahra Badi, Sangwook Park, David K. Han, Hanseok Ko
Performance of learning based Automatic Speech Recognition (ASR) is susceptible to noise, especially when it is introduced in the testing data while not presented in the training data.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +3
no code implementations • 7 Jan 2019 • Sangwook Park, David K. Han, Hanseok Ko
Audio waveform generation can then be performed using the proposed network.
no code implementations • 3 Feb 2017 • Suwon Shon, Hanseok Ko
As development dataset which is spoken in Cebuano and Mandarin, we could prepare the evaluation trials through preliminary experiments to compensate the language mismatched condition.
no code implementations • 21 Sep 2016 • Suwon Shon, Seongkyu Mun, John H. L. Hansen, Hanseok Ko
The experimental results show that the use of duration and score fusion improves language recognition performance by 5% relative in LRiMLC15 cost.