Search Results for author: Kyuwoong Hwang

Found 7 papers, 0 papers with code

Knowledge Distillation from Non-streaming to Streaming ASR Encoder using Auxiliary Non-streaming Layer

no code implementations31 Aug 2023 Kyuhong Shim, Jinkyu Lee, Simyung Chang, Kyuwoong Hwang

Streaming automatic speech recognition (ASR) models are restricted from accessing future context, which results in worse performance compared to the non-streaming models.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

SubSpectral Normalization for Neural Audio Data Processing

no code implementations25 Mar 2021 Simyung Chang, Hyoungwoo Park, Janghoon Cho, Hyunsin Park, Sungrack Yun, Kyuwoong Hwang

In this work, we introduce SubSpectral Normalization (SSN), which splits the input frequency dimension into several groups (sub-bands) and performs a different normalization for each group.

Keyword Spotting

Acoustic Scene Classification Based on a Large-margin Factorized CNN

no code implementations14 Oct 2019 Janghoon Cho, Sungrack Yun, Hyoungwoo Park, Jungyun Eum, Kyuwoong Hwang

With this loss function, the samples from the same audio scene are clustered independently of the environment, and thus we can get the classifier with better generalization ability in an unseen environment.

Acoustic Scene Classification Classification +2

Weakly Labeled Sound Event Detection Using Tri-training and Adversarial Learning

no code implementations14 Oct 2019 Hyoungwoo Park, Sungrack Yun, Jungyun Eum, Janghoon Cho, Kyuwoong Hwang

This paper considers a semi-supervised learning framework for weakly labeled polyphonic sound event detection problems for the DCASE 2019 challenge's task4 by combining both the tri-training and adversarial learning.

Event Detection Sound Event Detection

Query-by-example on-device keyword spotting

no code implementations11 Oct 2019 Byeonggeun Kim, Mingu Lee, Jinkyu Lee, Yeonseok Kim, Kyuwoong Hwang

A keyword spotting (KWS) system determines the existence of, usually predefined, keyword in a continuous speech stream.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Orthogonality Constrained Multi-Head Attention For Keyword Spotting

no code implementations10 Oct 2019 Mingu Lee, Jinkyu Lee, Hye Jin Jang, Byeonggeun Kim, Wonil Chang, Kyuwoong Hwang

Augmenting regularization terms which penalize positional and contextual non-orthogonality between the attention heads encourages to output different representations from separate subsequences, which in turn enables leveraging structured information without explicit sequence models such as hidden Markov models.

Keyword Spotting

An End-to-End Text-independent Speaker Verification Framework with a Keyword Adversarial Network

no code implementations6 Aug 2019 Sungrack Yun, Janghoon Cho, Jungyun Eum, Wonil Chang, Kyuwoong Hwang

In training our speaker verification framework, we consider both the triplet loss minimization and adversarial gradient of the ASR network to obtain more discriminative and text-independent speaker embedding vectors.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Cannot find the paper you are looking for? You can Submit a new open access paper.