Search Results for author: Kwanghee Choi

Found 18 papers, 14 papers with code

Wav2Gloss: Generating Interlinear Glossed Text from Speech

no code implementations • 19 Mar 2024 • Taiqi He, Kwanghee Choi, Lindia Tjuatja, Nathaniel R. Robinson, Jiatong Shi, Shinji Watanabe, Graham Neubig, David R. Mortensen, Lori Levin

Thousands of the world's languages are in danger of extinction--a tremendous threat to cultural identities and human language diversity.

Paper
Add Code

OWSM v3.1: Better and Faster Open Whisper-Style Speech Models based on E-Branchformer

no code implementations • 30 Jan 2024 • Yifan Peng, Jinchuan Tian, William Chen, Siddhant Arora, Brian Yan, Yui Sudo, Muhammad Shakeel, Kwanghee Choi, Jiatong Shi, Xuankai Chang, Jee-weon Jung, Shinji Watanabe

In this work, we aim to improve the performance and efficiency of OWSM without extra training data.

Paper
Add Code

Understanding Probe Behaviors through Variational Bounds of Mutual Information

1 code implementation • 15 Dec 2023 • Kwanghee Choi, Jee-weon Jung, Shinji Watanabe

With the success of self-supervised representations, researchers seek a better understanding of the information encapsulated within a representation.

Paper
Code

Exploring Speech Recognition, Translation, and Understanding with Discrete Speech Units: A Comparative Study

no code implementations • 27 Sep 2023 • Xuankai Chang, Brian Yan, Kwanghee Choi, Jeeweon Jung, Yichen Lu, Soumi Maiti, Roshan Sharma, Jiatong Shi, Jinchuan Tian, Shinji Watanabe, Yuya Fujita, Takashi Maekaku, Pengcheng Guo, Yao-Fei Cheng, Pavel Denisov, Kohei Saijo, Hsiu-Hsuan Wang

Speech signals, typically sampled at rates in the tens of thousands per second, contain redundancies, evoking inefficiencies in sequence modeling.

Automatic Speech Recognition Self-Supervised Learning +3

Paper
Add Code

Speech Intelligibility Assessment of Dysarthric Speech by using Goodness of Pronunciation with Uncertainty Quantification

1 code implementation • 28 May 2023 • Eun Jung Yeo, Kwanghee Choi, Sunhee Kim, Minhwa Chung

This paper proposes an improved Goodness of Pronunciation (GoP) that utilizes Uncertainty Quantification (UQ) for automatic speech intelligibility assessment for dysarthric speech.

Uncertainty Quantification

Paper
Code

Unsupervised Speech Representation Pooling Using Vector Quantization

1 code implementation • 8 Apr 2023 • Jeongkyun Park, Kwanghee Choi, Hyunjun Heo, Hyung-Min Park

However, the pooling problem remains; the length of speech representations is inherently variable.

Emotion Recognition intent-classification +4

Paper
Code

OLKAVS: An Open Large-Scale Korean Audio-Visual Speech Dataset

1 code implementation • 16 Jan 2023 • Jeongkyun Park, Jung-Wook Hwang, Kwanghee Choi, Seung-Hyun Lee, Jun Hwan Ahn, Rae-Hong Park, Hyung-Min Park

Inspired by humans comprehending speech in a multi-modal manner, various audio-visual datasets have been constructed.

Audio-Visual Speech Recognition Lip Reading +3

Paper
Code

Correcting Faulty Road Maps by Image Inpainting

1 code implementation • 12 Nov 2022 • Soojung Hong, Kwanghee Choi

As maintaining road networks is labor-intensive, many automatic road extraction approaches have been introduced to solve this real-world problem, fueled by the abundance of large-scale high-resolution satellite imagery and advances in computer vision.

Image Inpainting

Paper
Code

Opening the Black Box of wav2vec Feature Encoder

1 code implementation • 27 Oct 2022 • Kwanghee Choi, Eun Jung Yeo

Self-supervised models, namely, wav2vec and its variants, have shown promising results in various downstream tasks in the speech domain.

Paper
Code

Automatic Severity Classification of Dysarthric speech by using Self-supervised Model with Multi-task Learning

1 code implementation • 27 Oct 2022 • Eun Jung Yeo, Kwanghee Choi, Sunhee Kim, Minhwa Chung

To tackle the problem, we propose a novel automatic severity assessment method for dysarthric speech, using the self-supervised model in conjunction with multi-task learning.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Paper
Code

TiDAL: Learning Training Dynamics for Active Learning

1 code implementation • ICCV 2023 • Seong Min Kye, Kwanghee Choi, Hyeongmin Byun, Buru Chang

Active learning (AL) aims to select the most useful data samples from an unlabeled data pool and annotate them to expand the labeled dataset under a limited budget.

Active Learning

Paper
Code

Reliable Decision from Multiple Subtasks through Threshold Optimization: Content Moderation in the Wild

1 code implementation • 16 Aug 2022 • Donghyun Son, Byounggyu Lew, Kwanghee Choi, Yongsu Baek, Seungwoo Choi, Beomjun Shin, Sungjoo Ha, Buru Chang

In this study, we formulate real-world scenarios of content moderation and introduce a simple yet effective threshold optimization method that searches the optimal thresholds of the multiple subtasks to make a reliable moderation decision in a cost-effective way.

Paper
Code

Distilling a Pretrained Language Model to a Multilingual ASR Model

1 code implementation • 25 Jun 2022 • Kwanghee Choi, Hyung-Min Park

Hence, we are motivated to distill the rich knowledge embedded inside a well-trained teacher text model to the student speech model.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Code

Learning with Noisy Labels by Efficient Transition Matrix Estimation to Combat Label Miscorrection

1 code implementation • 29 Nov 2021 • Seong Min Kye, Kwanghee Choi, Joonyoung Yi, Buru Chang

Recent studies on learning with noisy labels have shown remarkable performance by exploiting a small clean dataset.

Ranked #6 on Image Classification on Clothing1M (using clean data)

Learning with noisy labels Meta-Learning

Paper
Code

Temporal Knowledge Distillation for On-device Audio Classification

no code implementations • 27 Oct 2021 • Kwanghee Choi, Martin Kersner, Jacob Morton, Buru Chang

Improving the performance of on-device audio classification models remains a challenge given the computational limits of the mobile environment.

Ranked #7 on Audio Classification on FSD50K

Audio Classification Event Detection +2

Paper
Add Code

Pretraining Neural Architecture Search Controllers with Locality-based Self-Supervised Learning

1 code implementation • 15 Mar 2021 • Kwanghee Choi, Minyoung Choe, Hyelee Lee

Neural architecture search (NAS) has fostered various fields of machine learning.

Metric Learning Neural Architecture Search +1

Paper
Code

Disentangling Label Distribution for Long-tailed Visual Recognition

2 code implementations • CVPR 2021 • Youngkyu Hong, Seungju Han, Kwanghee Choi, Seokjun Seo, Beomsu Kim, Buru Chang

Although this method surpasses state-of-the-art methods on benchmark datasets, it can be further improved by directly disentangling the source label distribution from the model prediction in the training phase.

Ranked #20 on Long-tail Learning on Places-LT

Image Classification Long-tail Learning

Paper
Code

Combating the Instability of Mutual Information-based Losses via Regularization

1 code implementation • 16 Nov 2020 • Kwanghee Choi, Siyeong Lee

Notable progress has been made in numerous fields of machine learning based on neural network-driven mutual information (MI) bounds.

Contrastive Learning

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.