Search Results for author: Geonmin Kim

Found 8 papers, 2 papers with code

Shortened LLaMA: A Simple Depth Pruning for Large Language Models

no code implementations5 Feb 2024 Bo-Kyeong Kim, Geonmin Kim, Tae-Ho Kim, Thibault Castells, Shinkook Choi, Junho Shin, Hyoung-Kyu Song

Structured pruning of modern large language models (LLMs) has emerged as a way of decreasing their high computational needs.

Encoder-decoder multimodal speaker change detection

no code implementations1 Jun 2023 Jee-weon Jung, Soonshin Seo, Hee-Soo Heo, Geonmin Kim, You Jin Kim, Young-ki Kwon, Minjae Lee, Bong-Jin Lee

The task of speaker change detection (SCD), which detects points where speakers change in an input, is essential for several applications.

Automatic Speech Recognition Change Detection +2

Back from the future: bidirectional CTC decoding using future information in speech recognition

no code implementations7 Oct 2021 Namkyu Jung, Geonmin Kim, Han-Gyu Kim

In this paper, we propose a simple but effective method to decode the output of Connectionist Temporal Classifier (CTC) model using a bi-directional neural language model.

Language Modelling speech-recognition +1

Spell my name: keyword boosted speech recognition

no code implementations6 Oct 2021 Namkyu Jung, Geonmin Kim, Joon Son Chung

Recognition of uncommon words such as names and technical terminology is important to understanding conversations in context.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Unpaired Speech Enhancement by Acoustic and Adversarial Supervision for Speech Recognition

1 code implementation6 Nov 2018 Geonmin Kim, Hwaran Lee, Bo-Kyeong Kim, Sang-Hoon Oh, Soo-Young Lee

Many speech enhancement methods try to learn the relationship between noisy and clean speech, obtained using an acoustic room simulator.

Generative Adversarial Network Speech Enhancement +2

Deep CNNs along the Time Axis with Intermap Pooling for Robustness to Spectral Variations

no code implementations10 Jun 2016 Hwaran Lee, Geonmin Kim, Ho-Gyeong Kim, Sang-Hoon Oh, Soo-Young Lee

Convolutional neural networks (CNNs) with convolutional and pooling operations along the frequency axis have been proposed to attain invariance to frequency shifts of features.

Compositional Sentence Representation from Character within Large Context Text

no code implementations2 May 2016 Geonmin Kim, Hwaran Lee, Jisu Choi, Soo-Young Lee

In the HCRN, word representations are built from characters, thus resolving the data-sparsity problem, and inter-sentence dependency is embedded into sentence representation at the level of sentence composition.

Dialogue Act Classification General Classification +1

Cannot find the paper you are looking for? You can Submit a new open access paper.