Search Results for author: Geonmin Kim

Found 8 papers, 2 papers with code

Compositional Sentence Representation from Character within Large Context Text

no code implementations • 2 May 2016 • Geonmin Kim, Hwaran Lee, Jisu Choi, Soo-Young Lee

In the HCRN, word representations are built from characters, thus resolving the data-sparsity problem, and inter-sentence dependency is embedded into sentence representation at the level of sentence composition.

Dialogue Act Classification General Classification +1

Paper
Add Code

Deep CNNs along the Time Axis with Intermap Pooling for Robustness to Spectral Variations

no code implementations • 10 Jun 2016 • Hwaran Lee, Geonmin Kim, Ho-Gyeong Kim, Sang-Hoon Oh, Soo-Young Lee

Convolutional neural networks (CNNs) with convolutional and pooling operations along the frequency axis have been proposed to attain invariance to frequency shifts of features.

Paper
Add Code

Unpaired Speech Enhancement by Acoustic and Adversarial Supervision for Speech Recognition

1 code implementation • 6 Nov 2018 • Geonmin Kim, Hwaran Lee, Bo-Kyeong Kim, Sang-Hoon Oh, Soo-Young Lee

Many speech enhancement methods try to learn the relationship between noisy and clean speech, obtained using an acoustic room simulator.

Generative Adversarial Network Speech Enhancement +2

Paper
Code

Semi-supervised Disentanglement with Independent Vector Variational Autoencoders

1 code implementation • 14 Mar 2020 • Bo-Kyeong Kim, Sungjin Park, Geonmin Kim, Soo-Young Lee

We aim to separate the generative factors of data into two latent vectors in a variational autoencoder.

Disentanglement General Classification

Paper
Code

Spell my name: keyword boosted speech recognition

no code implementations • 6 Oct 2021 • Namkyu Jung, Geonmin Kim, Joon Son Chung

Recognition of uncommon words such as names and technical terminology is important to understanding conversations in context.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Paper
Add Code

Back from the future: bidirectional CTC decoding using future information in speech recognition

no code implementations • 7 Oct 2021 • Namkyu Jung, Geonmin Kim, Han-Gyu Kim

In this paper, we propose a simple but effective method to decode the output of Connectionist Temporal Classifier (CTC) model using a bi-directional neural language model.

Language Modelling speech-recognition +1

Paper
Add Code

Encoder-decoder multimodal speaker change detection

no code implementations • 1 Jun 2023 • Jee-weon Jung, Soonshin Seo, Hee-Soo Heo, Geonmin Kim, You Jin Kim, Young-ki Kwon, Minjae Lee, Bong-Jin Lee

The task of speaker change detection (SCD), which detects points where speakers change in an input, is essential for several applications.

Automatic Speech Recognition Change Detection +2

Paper
Add Code

Shortened LLaMA: A Simple Depth Pruning for Large Language Models

no code implementations • 5 Feb 2024 • Bo-Kyeong Kim, Geonmin Kim, Tae-Ho Kim, Thibault Castells, Shinkook Choi, Junho Shin, Hyoung-Kyu Song

Structured pruning of modern large language models (LLMs) has emerged as a way of decreasing their high computational needs.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.