Search Results for author: Ju-ho Kim

Found 19 papers, 11 papers with code

Diff-SV: A Unified Hierarchical Framework for Noise-Robust Speaker Verification Using Score-Based Diffusion Probabilistic Models

1 code implementation14 Sep 2023 Ju-ho Kim, Jungwoo Heo, Hyun-seo Shin, Chan-yeong Lim, Ha-Jin Yu

Diff-SV unifies a DPM-based speech enhancement system with a speaker embedding extractor, and yields a discriminative and noise-tolerable speaker representation through a hierarchical structure.

Speaker Verification Speech Enhancement

PAS: Partial Additive Speech Data Augmentation Method for Noise Robust Speaker Verification

1 code implementation20 Jul 2023 Wonbin Kim, Hyun-seo Shin, Ju-ho Kim, Jungwoo Heo, Chan-yeong Lim, Ha-Jin Yu

In this paper, we propose a new additive noise method, partial additive speech (PAS), which aims to train SV systems to be less affected by noisy environments.

Data Augmentation Speaker Verification

Integrated Parameter-Efficient Tuning for General-Purpose Audio Models

1 code implementation4 Nov 2022 Ju-ho Kim, Jungwoo Heo, Hyun-seo Shin, Chan-yeong Lim, Ha-Jin Yu

To overcome these limitations, the present study explores and applies efficient transfer learning methods in the audio domain.

Genre classification Keyword Spotting +3

Two Methods for Spoofing-Aware Speaker Verification: Multi-Layer Perceptron Score Fusion Model and Integrated Embedding Projector

no code implementations28 Jun 2022 Jungwoo Heo, Ju-ho Kim, Hyun-seo Shin

The use of deep neural networks (DNN) has dramatically elevated the performance of automatic speaker verification (ASV) over the last decade.

Speaker Verification

Extended U-Net for Speaker Verification in Noisy Environments

1 code implementation27 Jun 2022 Ju-ho Kim, Jungwoo Heo, Hye-jin Shim, Ha-Jin Yu

Background noise is a well-known factor that deteriorates the accuracy and reliability of speaker verification (SV) systems by blurring speech intelligibility.

Denoising Speaker Identification +1

RawNeXt: Speaker verification system for variable-duration utterances with deep layer aggregation and extended dynamic scaling policies

1 code implementation15 Dec 2021 Ju-ho Kim, Hye-jin Shim, Jungwoo Heo, Ha-Jin Yu

Despite achieving satisfactory performance in speaker verification using deep neural networks, variable-duration utterances remain a challenge that threatens the robustness of systems.

Speaker Verification

Attentive max feature map and joint training for acoustic scene classification

no code implementations15 Apr 2021 Hye-jin Shim, Jee-weon Jung, Ju-ho Kim, Ha-Jin Yu

Furthermore, adopting the proposed attentive max feature map, our team placed fourth in the recent DCASE 2021 challenge.

Acoustic Scene Classification Multi-Task Learning +1

Learning Metrics from Mean Teacher: A Supervised Learning Method for Improving the Generalization of Speaker Verification System

no code implementations14 Apr 2021 Ju-ho Kim, Hye-jin Shim, Jee-weon Jung, Ha-Jin Yu

By learning the reliable intermediate representation of the mean teacher network, we expect that the proposed method can explore more discriminatory embedding spaces and improve the generalization performance of the speaker verification system.

Speaker Verification

DCASENET: A joint pre-trained deep neural network for detecting and classifying acoustic scenes and events

1 code implementation21 Sep 2020 Jee-weon Jung, Hye-jin Shim, Ju-ho Kim, Ha-Jin Yu

Single task deep neural networks that perform a target task among diverse cross-related tasks in the acoustic scene and event literature are being developed.

Acoustic Scene Classification Audio Tagging +3

Capturing scattered discriminative information using a deep architecture in acoustic scene classification

no code implementations9 Jul 2020 Hye-jin Shim, Jee-weon Jung, Ju-ho Kim, Ha-Jin Yu

Various experiments are conducted using the detection and classification of acoustic scenes and events 2020 task1-a dataset to validate the proposed methods.

Acoustic Scene Classification General Classification +1

Integrated Replay Spoofing-aware Text-independent Speaker Verification

no code implementations10 Jun 2020 Hye-jin Shim, Jee-weon Jung, Ju-ho Kim, Seung-bin Kim, Ha-Jin Yu

In this paper, we propose two approaches for building an integrated system of speaker verification and presentation attack detection: an end-to-end monolithic approach and a back-end modular approach.

Multi-Task Learning Speaker Identification +1

Segment Aggregation for short utterances speaker verification using raw waveforms

1 code implementation7 May 2020 Seung-bin Kim, Jee-weon Jung, Hye-jin Shim, Ju-ho Kim, Ha-Jin Yu

The proposed method segments an input utterance into several short utterances and then aggregates the segment embeddings extracted from the segmented inputs to compose a speaker embedding.

Speaker Verification

Improved RawNet with Feature Map Scaling for Text-independent Speaker Verification using Raw Waveforms

2 code implementations1 Apr 2020 Jee-weon Jung, Seung-bin Kim, Hye-jin Shim, Ju-ho Kim, Ha-Jin Yu

Recent advances in deep learning have facilitated the design of speaker verification systems that directly input raw waveforms.

Text-Independent Speaker Verification

RawNet: Advanced end-to-end deep neural network using raw waveforms for text-independent speaker verification

4 code implementations17 Apr 2019 Jee-weon Jung, Hee-Soo Heo, Ju-ho Kim, Hye-jin Shim, Ha-Jin Yu

In this study, we explore end-to-end deep neural networks that input raw waveforms to improve various aspects: front-end speaker embedding extraction including model architecture, pre-training scheme, additional objective functions, and back-end classification.

Classification Data Augmentation +2

Teaching Syntax by Adversarial Distraction

no code implementations WS 2018 Ju-ho Kim, Christopher Malon, Asim Kadav

Existing entailment datasets mainly pose problems which can be answered without attention to grammar or word order.

General Classification

Cannot find the paper you are looking for? You can Submit a new open access paper.