no code implementations • 18 Nov 2022 • Gwantae Kim, Youngsuk Ryu, Junyeop Lee, David K. Han, Jeongmin Bae, Hanseok Ko
To achieve the goal, the proposed method predicts expression from the sentences using a text classification model based on a pretrained language model and generates gestures using the gate recurrent unit-based autoregressive model.
no code implementations • 20 Oct 2022 • Donghyeon Kim, Kyungdeuk Ko, David K. Han, Hanseok Ko
In order to train the network for more robust performance in noisy environments, we introduce the LOw Variant Orthogonal (LOVO) loss.
no code implementations • 3 May 2022 • Donghyeon Kim, Gwantae Kim, Bokyeung Lee, Jeong-gi Kwak, David K. Han, Hanseok Ko
However, the performance of the dynamic filter might be degraded since simple feature pooling is used to reduce the computational resource in the IDF part.
no code implementations • 23 Sep 2021 • Donghyeon Kim, Kyungdeuk Ko, Jeonggi Kwak, David K. Han, Hanseok Ko
Keyword Spotting (KWS) from speech signals is widely applied to perform fully hands-free speech recognition.
1 code implementation • 12 Aug 2021 • Youngsaeng Jin, David K. Han, Hanseok Ko
In this paper, a built-in memory module for semantic segmentation is proposed to overcome these problems.
1 code implementation • 27 May 2021 • Sangwook Park, David K. Han, Mounya Elhilali
Sound event detection is an important facet of audio tagging that aims to identify sounds of interest and define both the sound category and time boundaries for each sound event in a continuous recording.
no code implementations • 1 Feb 2021 • Yifan Jiang, Han Chen, David K. Han, Hanseok Ko
To compensate for the sparseness of labeled data, the proposed method utilizes a large amount of synthetic COVID-19 CT images and adjusts the networks from the source domain (synthetic data) to the target domain (real data) with a cross-domain training mechanism.
1 code implementation • ECCV 2020 • Jeong-gi Kwak, David K. Han, Hanseok Ko
The goal of face attribute editing is altering a facial image according to given target attributes such as hair color, mustache, gender, etc.
no code implementations • 26 Jul 2019 • Alzahra Badi, Sangwook Park, David K. Han, Hanseok Ko
Performance of learning based Automatic Speech Recognition (ASR) is susceptible to noise, especially when it is introduced in the testing data while not presented in the training data.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2
no code implementations • 7 Jan 2019 • Sangwook Park, David K. Han, Hanseok Ko
Audio waveform generation can then be performed using the proposed network.