Search Results for author: David K. Han

Found 10 papers, 3 papers with code

3d human motion generation from the text via gesture action classification and the autoregressive model

no code implementations18 Nov 2022 Gwantae Kim, Youngsuk Ryu, Junyeop Lee, David K. Han, Jeongmin Bae, Hanseok Ko

To achieve the goal, the proposed method predicts expression from the sentences using a text classification model based on a pretrained language model and generates gestures using the gate recurrent unit-based autoregressive model.

Action Classification Action Recognition +4

Discriminatory and orthogonal feature learning for noise robust keyword spotting

no code implementations20 Oct 2022 Donghyeon Kim, Kyungdeuk Ko, David K. Han, Hanseok Ko

In order to train the network for more robust performance in noisy environments, we introduce the LOw Variant Orthogonal (LOVO) loss.

Keyword Spotting

Efficient dynamic filter for robust and low computational feature extraction

no code implementations3 May 2022 Donghyeon Kim, Gwantae Kim, Bokyeung Lee, Jeong-gi Kwak, David K. Han, Hanseok Ko

However, the performance of the dynamic filter might be degraded since simple feature pooling is used to reduce the computational resource in the IDF part.

Keyword Spotting Speaker Verification

A Lightweight dynamic filter for keyword spotting

no code implementations23 Sep 2021 Donghyeon Kim, Kyungdeuk Ko, Jeonggi Kwak, David K. Han, Hanseok Ko

Keyword Spotting (KWS) from speech signals is widely applied to perform fully hands-free speech recognition.

Keyword Spotting speech-recognition +1

Memory-based Semantic Segmentation for Off-road Unstructured Natural Environments

1 code implementation12 Aug 2021 Youngsaeng Jin, David K. Han, Hanseok Ko

In this paper, a built-in memory module for semantic segmentation is proposed to overcome these problems.

Autonomous Driving Segmentation +1

Cross-Referencing Self-Training Network for Sound Event Detection in Audio Mixtures

1 code implementation27 May 2021 Sangwook Park, David K. Han, Mounya Elhilali

Sound event detection is an important facet of audio tagging that aims to identify sounds of interest and define both the sound category and time boundaries for each sound event in a continuous recording.

Audio Tagging Event Detection +1

Few-shot Learning for CT Scan based COVID-19 Diagnosis

no code implementations1 Feb 2021 Yifan Jiang, Han Chen, David K. Han, Hanseok Ko

To compensate for the sparseness of labeled data, the proposed method utilizes a large amount of synthetic COVID-19 CT images and adjusts the networks from the source domain (synthetic data) to the target domain (real data) with a cross-domain training mechanism.

Computed Tomography (CT) COVID-19 Diagnosis +2

CAFE-GAN: Arbitrary Face Attribute Editing with Complementary Attention Feature

1 code implementation ECCV 2020 Jeong-gi Kwak, David K. Han, Hanseok Ko

The goal of face attribute editing is altering a facial image according to given target attributes such as hair color, mustache, gender, etc.

Attribute Generative Adversarial Network

Correlation Distance Skip Connection Denoising Autoencoder (CDSK-DAE) for Speech Feature Enhancement

no code implementations26 Jul 2019 Alzahra Badi, Sangwook Park, David K. Han, Hanseok Ko

Performance of learning based Automatic Speech Recognition (ASR) is susceptible to noise, especially when it is introduced in the testing data while not presented in the training data.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Cannot find the paper you are looking for? You can Submit a new open access paper.