no code implementations • 19 Mar 2024 • Taiqi He, Kwanghee Choi, Lindia Tjuatja, Nathaniel R. Robinson, Jiatong Shi, Shinji Watanabe, Graham Neubig, David R. Mortensen, Lori Levin
Thousands of the world's languages are in danger of extinction--a tremendous threat to cultural identities and human language diversity.
no code implementations • 30 Jan 2024 • Yifan Peng, Jinchuan Tian, William Chen, Siddhant Arora, Brian Yan, Yui Sudo, Muhammad Shakeel, Kwanghee Choi, Jiatong Shi, Xuankai Chang, Jee-weon Jung, Shinji Watanabe
In this work, we aim to improve the performance and efficiency of OWSM without extra training data.
1 code implementation • 15 Dec 2023 • Kwanghee Choi, Jee-weon Jung, Shinji Watanabe
With the success of self-supervised representations, researchers seek a better understanding of the information encapsulated within a representation.
no code implementations • 27 Sep 2023 • Xuankai Chang, Brian Yan, Kwanghee Choi, Jeeweon Jung, Yichen Lu, Soumi Maiti, Roshan Sharma, Jiatong Shi, Jinchuan Tian, Shinji Watanabe, Yuya Fujita, Takashi Maekaku, Pengcheng Guo, Yao-Fei Cheng, Pavel Denisov, Kohei Saijo, Hsiu-Hsuan Wang
Speech signals, typically sampled at rates in the tens of thousands per second, contain redundancies, evoking inefficiencies in sequence modeling.
1 code implementation • 28 May 2023 • Eun Jung Yeo, Kwanghee Choi, Sunhee Kim, Minhwa Chung
This paper proposes an improved Goodness of Pronunciation (GoP) that utilizes Uncertainty Quantification (UQ) for automatic speech intelligibility assessment for dysarthric speech.
1 code implementation • 8 Apr 2023 • Jeongkyun Park, Kwanghee Choi, Hyunjun Heo, Hyung-Min Park
However, the pooling problem remains; the length of speech representations is inherently variable.
1 code implementation • 16 Jan 2023 • Jeongkyun Park, Jung-Wook Hwang, Kwanghee Choi, Seung-Hyun Lee, Jun Hwan Ahn, Rae-Hong Park, Hyung-Min Park
Inspired by humans comprehending speech in a multi-modal manner, various audio-visual datasets have been constructed.
1 code implementation • 12 Nov 2022 • Soojung Hong, Kwanghee Choi
As maintaining road networks is labor-intensive, many automatic road extraction approaches have been introduced to solve this real-world problem, fueled by the abundance of large-scale high-resolution satellite imagery and advances in computer vision.
1 code implementation • 27 Oct 2022 • Kwanghee Choi, Eun Jung Yeo
Self-supervised models, namely, wav2vec and its variants, have shown promising results in various downstream tasks in the speech domain.
1 code implementation • 27 Oct 2022 • Eun Jung Yeo, Kwanghee Choi, Sunhee Kim, Minhwa Chung
To tackle the problem, we propose a novel automatic severity assessment method for dysarthric speech, using the self-supervised model in conjunction with multi-task learning.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +3
1 code implementation • ICCV 2023 • Seong Min Kye, Kwanghee Choi, Hyeongmin Byun, Buru Chang
Active learning (AL) aims to select the most useful data samples from an unlabeled data pool and annotate them to expand the labeled dataset under a limited budget.
1 code implementation • 16 Aug 2022 • Donghyun Son, Byounggyu Lew, Kwanghee Choi, Yongsu Baek, Seungwoo Choi, Beomjun Shin, Sungjoo Ha, Buru Chang
In this study, we formulate real-world scenarios of content moderation and introduce a simple yet effective threshold optimization method that searches the optimal thresholds of the multiple subtasks to make a reliable moderation decision in a cost-effective way.
1 code implementation • 25 Jun 2022 • Kwanghee Choi, Hyung-Min Park
Hence, we are motivated to distill the rich knowledge embedded inside a well-trained teacher text model to the student speech model.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2
1 code implementation • 29 Nov 2021 • Seong Min Kye, Kwanghee Choi, Joonyoung Yi, Buru Chang
Recent studies on learning with noisy labels have shown remarkable performance by exploiting a small clean dataset.
no code implementations • 27 Oct 2021 • Kwanghee Choi, Martin Kersner, Jacob Morton, Buru Chang
Improving the performance of on-device audio classification models remains a challenge given the computational limits of the mobile environment.
Ranked #7 on Audio Classification on FSD50K
1 code implementation • 15 Mar 2021 • Kwanghee Choi, Minyoung Choe, Hyelee Lee
Neural architecture search (NAS) has fostered various fields of machine learning.
2 code implementations • CVPR 2021 • Youngkyu Hong, Seungju Han, Kwanghee Choi, Seokjun Seo, Beomsu Kim, Buru Chang
Although this method surpasses state-of-the-art methods on benchmark datasets, it can be further improved by directly disentangling the source label distribution from the model prediction in the training phase.
Ranked #20 on Long-tail Learning on Places-LT
1 code implementation • 16 Nov 2020 • Kwanghee Choi, Siyeong Lee
Notable progress has been made in numerous fields of machine learning based on neural network-driven mutual information (MI) bounds.