Search Results for author: Seong-Hu Kim

Found 8 papers, 6 papers with code

Data Augmentation and Squeeze-and-Excitation Network on Multiple Dimension for Sound Event Localization and Detection in Real Scenes

no code implementations • 24 Jun 2022 • Byeong-Yun Ko, Hyeonuk Nam, Seong-Hu Kim, Deokki Min, Seung-Deok Choi, Yong-Hwa Park

Performance of sound event localization and detection (SELD) in real scenes is limited by small size of SELD dataset, due to difficulty in obtaining sufficient amount of realistic multi-channel audio data recordings with accurate label.

Data Augmentation Sound Event Localization and Detection

Paper
Add Code

Frequency Dependent Sound Event Detection for DCASE 2022 Challenge Task 4

1 code implementation • 23 Jun 2022 • Hyeonuk Nam, Seong-Hu Kim, Deokki Min, Byeong-Yun Ko, Seung-Deok Choi, Yong-Hwa Park

While many deep learning methods on other domains have been applied to sound event detection (SED), differences between original domains of the methods and SED have not been appropriately considered so far.

Event Detection Sound Event Detection

Paper
Code

Frequency Dynamic Convolution: Frequency-Adaptive Pattern Recognition for Sound Event Detection

1 code implementation • 29 Mar 2022 • Hyeonuk Nam, Seong-Hu Kim, Byeong-Yun Ko, Yong-Hwa Park

2D convolution is widely used in sound event detection (SED) to recognize two dimensional time-frequency patterns of sound events.

Ranked #3 on Sound Event Detection on DESED

Event Detection Sound Event Detection +1

Paper
Code

Decomposed Temporal Dynamic CNN: Efficient Time-Adaptive Network for Text-Independent Speaker Verification Explained with Speaker Activation Map

1 code implementation • 29 Mar 2022 • Seong-Hu Kim, Hyeonuk Nam, Yong-Hwa Park

To extract accurate speaker information for text-independent speaker verification, temporal dynamic CNNs (TDY-CNNs) adapting kernels to each time bin was proposed.

Data Augmentation Text-Independent Speaker Verification

Paper
Code

Temporal Dynamic Convolutional Neural Network for Text-Independent Speaker Verification and Phonemetic Analysis

1 code implementation • 7 Oct 2021 • Seong-Hu Kim, Hyeonuk Nam, Yong-Hwa Park

The temporal dynamic model adapts itself to phonemes without explicitly given phoneme information during training, and results show the necessity to consider phoneme variation within utterances for more accurate and robust text-independent speaker verification.

Speaker Recognition Text-Independent Speaker Recognition +1

Paper
Code

FilterAugment: An Acoustic Environmental Data Augmentation Method

1 code implementation • 7 Oct 2021 • Hyeonuk Nam, Seong-Hu Kim, Yong-Hwa Park

Thus, training acoustic models for audio and speech tasks requires regularization on various acoustic environments in order to achieve robust performance in real life applications.

Data Augmentation Event Detection +2

Paper
Code

Deep learning based cough detection camera using enhanced features

no code implementations • 28 Jul 2021 • Gyeong-Tae Lee, Hyeonuk Nam, Seong-Hu Kim, Sang-Min Choi, Youngkey Kim, Yong-Hwa Park

Finally, a test F1 score of 91. 9% (test accuracy of 97. 2%) was achieved from G-net with the MFCC-V-A feature (named Spectroflow), an acoustic feature effective for use in cough detection.

Data Augmentation

Paper
Add Code

Heavily Augmented Sound Event Detection utilizing Weak Predictions

1 code implementation • 8 Jul 2021 • Hyeonuk Nam, Byeong-Yun Ko, Gyeong-Tae Lee, Seong-Hu Kim, Won-Ho Jung, Sang-Min Choi, Yong-Hwa Park

In this work, we used two main approaches to overcome the lack of strongly labeled data.

Ranked #6 on Sound Event Detection on DESED

Data Augmentation Event Detection +1

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.