Search Results for author: Kyuhong Shim

Found 17 papers, 3 papers with code

Expand-and-Quantize: Unsupervised Semantic Segmentation Using High-Dimensional Space and Product Quantization

no code implementations12 Dec 2023 Jiyoung Kim, Kyuhong Shim, Insu Lee, Byonghyo Shim

In this paper, we propose a novel USS framework called Expand-and-Quantize Unsupervised Semantic Segmentation (EQUSS), which combines the benefits of high-dimensional spaces for better clustering and product quantization for effective information compression.

Clustering Dimensionality Reduction +3

Knowledge Distillation from Non-streaming to Streaming ASR Encoder using Auxiliary Non-streaming Layer

no code implementations31 Aug 2023 Kyuhong Shim, Jinkyu Lee, Simyung Chang, Kyuwoong Hwang

Streaming automatic speech recognition (ASR) models are restricted from accessing future context, which results in worse performance compared to the non-streaming models.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Improving Small Footprint Few-shot Keyword Spotting with Supervision on Auxiliary Data

no code implementations31 Aug 2023 Seunghan Yang, Byeonggeun Kim, Kyuhong Shim, Simyung Chang

Few-shot keyword spotting (FS-KWS) models usually require large-scale annotated datasets to generalize to unseen target keywords.

Keyword Spotting Multi-Task Learning +1

Depth-Relative Self Attention for Monocular Depth Estimation

no code implementations25 Apr 2023 Kyuhong Shim, Jiyoung Kim, Gusang Lee, Byonghyo Shim

Monocular depth estimation is very challenging because clues to the exact depth are incomplete in a single RGB image.

Monocular Depth Estimation

Semantic-Preserving Augmentation for Robust Image-Text Retrieval

no code implementations10 Mar 2023 Sunwoo Kim, Kyuhong Shim, Luong Trung Nguyen, Byonghyo Shim

Image text retrieval is a task to search for the proper textual descriptions of the visual world and vice versa.

Retrieval Text Retrieval

Teacher Intervention: Improving Convergence of Quantization Aware Training for Ultra-Low Precision Transformers

1 code implementation23 Feb 2023 Minsoo Kim, Kyuhong Shim, Seongmin Park, Wonyong Sung, Jungwook Choi

Pre-trained Transformer models such as BERT have shown great success in a wide range of applications, but at the cost of substantial increases in model complexity.

Knowledge Distillation Quantization

Vision Transformer-based Feature Extraction for Generalized Zero-Shot Learning

no code implementations2 Feb 2023 Jiseob Kim, Kyuhong Shim, Junhan Kim, Byonghyo Shim

In AAM, the correlation between each patch feature and the synthetic image attribute is used as the importance weight for each patch.

Attribute Generalized Zero-Shot Learning

Exploring Attention Map Reuse for Efficient Transformer Neural Networks

no code implementations29 Jan 2023 Kyuhong Shim, Jungwook Choi, Wonyong Sung

In this paper, we provide a comprehensive study on attention map reuse focusing on its ability to accelerate inference.

speech-recognition Speech Recognition

A Comparison of Transformer, Convolutional, and Recurrent Neural Networks on Phoneme Recognition

no code implementations1 Oct 2022 Kyuhong Shim, Wonyong Sung

Our analyses show that Transformer and Conformer models benefit from the long-range accessibility of self-attention through input frames.

speech-recognition Speech Recognition

Towards Intelligent Millimeter and Terahertz Communication for 6G: Computer Vision-aided Beamforming

no code implementations6 Sep 2022 Yongjun Ahn, jinhong Kim, Seungnyun Kim, Kyuhong Shim, Jiyoung Kim, Sangtae Kim, Byonghyo Shim

Beamforming technique realized by the multiple-input-multiple-output (MIMO) antenna arrays has been widely used to compensate for the severe path loss in the millimeter wave (mmWave) bands.

Management Quantization

Similarity and Content-based Phonetic Self Attention for Speech Recognition

no code implementations19 Mar 2022 Kyuhong Shim, Wonyong Sung

Especially, SA heads in lower layers capture various phonetic characteristics by the query-key dot product, which is designed to compute the pairwise relationship between frames.

speech-recognition Speech Recognition

Korean Tokenization for Beam Search Rescoring in Speech Recognition

no code implementations22 Feb 2022 Kyuhong Shim, Hyewon Bae, Wonyong Sung

Although the common approach is to use the same tokenization method for external LM as the ASR model, we show that it may not be the best choice for Korean.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Semantic Feature Extraction for Generalized Zero-shot Learning

no code implementations29 Dec 2021 Junhan Kim, Kyuhong Shim, Byonghyo Shim

Key idea of the proposed approach, henceforth referred to as semantic feature extraction-based GZSL (SE-GZSL), is to use the semantic feature containing only attribute-related information in learning the relationship between the image and the attribute.

Attribute Generalized Zero-Shot Learning

TernGEMM: GEneral Matrix Multiply Library with Ternary Weights for Fast DNN Inference

1 code implementation 2021 IEEE Workshop on Signal Processing Systems (SiPS) 2021 Seokhyeon Choi, Kyuhong Shim, Jungwook Choi, Wonyong Sung, Byonghyo Shim

We propose TernGEMM, a special GEMM library using SIMD instructions for Deep Neural Network (DNN) inference with ternary weights and activations under 8-bit.

Layer-wise Pruning of Transformer Attention Heads for Efficient Language Modeling

1 code implementation 2021 18th International SoC Design Conference (ISOCC) 2021 Kyuhong Shim, Iksoo Choi, Wonyong Sung, Jungwook Choi

While Transformer-based models have shown impressive language modeling performance, the large computation cost is often prohibitive for practical use.

Language Modelling

SVD-Softmax: Fast Softmax Approximation on Large Vocabulary Neural Networks

no code implementations NeurIPS 2017 Kyuhong Shim, Minjae Lee, Iksoo Choi, Yoonho Boo, Wonyong Sung

The approximate probability of each word can be estimated with only a small part of the weight matrix by using a few large singular values and the corresponding elements for most of the words.

Language Modelling Machine Translation +1

Cannot find the paper you are looking for? You can Submit a new open access paper.