Search Results for author: Kwanghee Choi

Found 18 papers, 14 papers with code

Wav2Gloss: Generating Interlinear Glossed Text from Speech

no code implementations19 Mar 2024 Taiqi He, Kwanghee Choi, Lindia Tjuatja, Nathaniel R. Robinson, Jiatong Shi, Shinji Watanabe, Graham Neubig, David R. Mortensen, Lori Levin

Thousands of the world's languages are in danger of extinction--a tremendous threat to cultural identities and human language diversity.

Understanding Probe Behaviors through Variational Bounds of Mutual Information

1 code implementation15 Dec 2023 Kwanghee Choi, Jee-weon Jung, Shinji Watanabe

With the success of self-supervised representations, researchers seek a better understanding of the information encapsulated within a representation.

Speech Intelligibility Assessment of Dysarthric Speech by using Goodness of Pronunciation with Uncertainty Quantification

1 code implementation28 May 2023 Eun Jung Yeo, Kwanghee Choi, Sunhee Kim, Minhwa Chung

This paper proposes an improved Goodness of Pronunciation (GoP) that utilizes Uncertainty Quantification (UQ) for automatic speech intelligibility assessment for dysarthric speech.

Uncertainty Quantification

Correcting Faulty Road Maps by Image Inpainting

1 code implementation12 Nov 2022 Soojung Hong, Kwanghee Choi

As maintaining road networks is labor-intensive, many automatic road extraction approaches have been introduced to solve this real-world problem, fueled by the abundance of large-scale high-resolution satellite imagery and advances in computer vision.

Image Inpainting

Opening the Black Box of wav2vec Feature Encoder

1 code implementation27 Oct 2022 Kwanghee Choi, Eun Jung Yeo

Self-supervised models, namely, wav2vec and its variants, have shown promising results in various downstream tasks in the speech domain.

Automatic Severity Classification of Dysarthric speech by using Self-supervised Model with Multi-task Learning

1 code implementation27 Oct 2022 Eun Jung Yeo, Kwanghee Choi, Sunhee Kim, Minhwa Chung

To tackle the problem, we propose a novel automatic severity assessment method for dysarthric speech, using the self-supervised model in conjunction with multi-task learning.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

TiDAL: Learning Training Dynamics for Active Learning

1 code implementation ICCV 2023 Seong Min Kye, Kwanghee Choi, Hyeongmin Byun, Buru Chang

Active learning (AL) aims to select the most useful data samples from an unlabeled data pool and annotate them to expand the labeled dataset under a limited budget.

Active Learning

Reliable Decision from Multiple Subtasks through Threshold Optimization: Content Moderation in the Wild

1 code implementation16 Aug 2022 Donghyun Son, Byounggyu Lew, Kwanghee Choi, Yongsu Baek, Seungwoo Choi, Beomjun Shin, Sungjoo Ha, Buru Chang

In this study, we formulate real-world scenarios of content moderation and introduce a simple yet effective threshold optimization method that searches the optimal thresholds of the multiple subtasks to make a reliable moderation decision in a cost-effective way.

Distilling a Pretrained Language Model to a Multilingual ASR Model

1 code implementation25 Jun 2022 Kwanghee Choi, Hyung-Min Park

Hence, we are motivated to distill the rich knowledge embedded inside a well-trained teacher text model to the student speech model.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Temporal Knowledge Distillation for On-device Audio Classification

no code implementations27 Oct 2021 Kwanghee Choi, Martin Kersner, Jacob Morton, Buru Chang

Improving the performance of on-device audio classification models remains a challenge given the computational limits of the mobile environment.

Audio Classification Event Detection +2

Disentangling Label Distribution for Long-tailed Visual Recognition

2 code implementations CVPR 2021 Youngkyu Hong, Seungju Han, Kwanghee Choi, Seokjun Seo, Beomsu Kim, Buru Chang

Although this method surpasses state-of-the-art methods on benchmark datasets, it can be further improved by directly disentangling the source label distribution from the model prediction in the training phase.

Image Classification Long-tail Learning

Combating the Instability of Mutual Information-based Losses via Regularization

1 code implementation16 Nov 2020 Kwanghee Choi, Siyeong Lee

Notable progress has been made in numerous fields of machine learning based on neural network-driven mutual information (MI) bounds.

Contrastive Learning

Cannot find the paper you are looking for? You can Submit a new open access paper.