Search Results for author: Young-Bum Kim

Found 41 papers, 1 papers with code

Paper
Add Code

Optimal Data Set Selection: An Application to Grapheme-to-Phoneme Conversion

no code implementations • NAACL 2013 • Young-Bum Kim, Benjamin Snyder

Active Learning Speech Recognition

Paper
Add Code

Unsupervised Consonant-Vowel Prediction over Hundreds of Languages

no code implementations • ACL 2013 • Young-Bum Kim, Benjamin Snyder

Paper
Add Code

Training a Korean SRL System with Rich Morphological Features

no code implementations • ACL 2014 • Young-Bum Kim, Heemoon Chae, Benjamin Snyder, Yu-Seop Kim

Natural Language Inference Question Answering +2

Paper
Add Code

Weakly Supervised Slot Tagging with Partially Labeled Sequences from Web Search Click Logs

no code implementations • HLT 2015 • Minwoo Jeong, Young-Bum Kim, Karl Stratos, Ruhi Sarikaya

Natural Language Understanding Part-Of-Speech Tagging +5

Paper
Add Code

Compact Lexicon Selection with Spectral Methods

no code implementations • IJCNLP 2015 • Young-Bum Kim, Karl Stratos, Xiaohu Liu, Ruhi Sarikaya

Knowledge Graphs Spoken Language Understanding

Paper
Add Code

Pre-training of Hidden-Unit CRFs

no code implementations • IJCNLP 2015 • Young-Bum Kim, Karl Stratos, Ruhi Sarikaya

Named Entity Recognition (NER) Part-Of-Speech Tagging

Paper
Add Code

New Transfer Learning Techniques for Disparate Label Sets

no code implementations • IJCNLP 2015 • Young-Bum Kim, Karl Stratos, Ruhi Sarikaya, Minwoo Jeong

Dimensionality Reduction Domain Adaptation +1

Paper
Add Code

Shared Tasks of the 2015 Workshop on Noisy User-generated Text: Twitter Lexical Normalization and Named Entity Recognition

no code implementations • WS 2015 • Timothy Baldwin, Marie Catherine de Marneffe, Bo Han, Young-Bum Kim, Alan Ritter, Wei Xu

Lexical Normalization named-entity-recognition +2

Paper
Add Code

Part-of-speech Taggers for Low-resource Languages using CCA Features

no code implementations • EMNLP 2015 • Young-Bum Kim, Benjamin Snyder, Ruhi Sarikaya

Word Alignment

Paper
Add Code

Task Completion Platform: A self-serve multi-domain goal oriented dialogue platform

no code implementations • NAACL 2016 • Paul Crook, Alex Marin, Vipul Agarwal, Khushboo Aggarwal, Tasos Anastasakos, Ravi Bikkula, Daniel Boies, Asli Celikyilmaz, Ch, Senthilkumar ramohan, Zhaleh Feizollahi, Roman Holenstein, Minwoo Jeong, Omar Khan, Young-Bum Kim, Elizabeth Krawczyk, Xiaohu Liu, Danko Panic, Vasiliy Radostev, Nikhil Ramesh, Jean-Phillipe Robichaud, Alex Rochette, re, Logan Stromberg, Ruhi Sarikaya

Dialogue Management

Paper
Add Code

Drop-out Conditional Random Fields for Twitter with Huge Mined Gazetteer

no code implementations • NAACL 2016 • Eun-Suk Yang, Young-Bum Kim, Ruhi Sarikaya, Yu-Seop Kim

Named Entity Recognition (NER) Word Embeddings

Paper
Add Code

Scalable Semi-Supervised Query Classification Using Matrix Sketching

no code implementations • ACL 2016 • Young-Bum Kim, Karl Stratos, Ruhi Sarikaya

Classification General Classification +3

Paper
Add Code

Natural Language Model Re-usability for Scaling to Different Domains

no code implementations • EMNLP 2016 • Young-Bum Kim, Alex Rochette, re, Ruhi Sarikaya

Language Modelling

Paper
Add Code

Domainless Adaptation by Constrained Decoding on a Schema Lattice

no code implementations • COLING 2016 • Young-Bum Kim, Karl Stratos, Ruhi Sarikaya

In many applications such as personal digital assistants, there is a constant need for new domains to increase the system{'}s coverage of user queries.

Multi-Label Classification Spoken Language Understanding

Paper
Add Code

Frustratingly Easy Neural Domain Adaptation

no code implementations • COLING 2016 • Young-Bum Kim, Karl Stratos, Ruhi Sarikaya

Popular techniques for domain adaptation such as the feature augmentation method of Daum{\'e} III (2009) have mostly been considered for sparse binary-valued features, but not for dense real-valued features such as those used in neural networks.

Domain Adaptation

Paper
Add Code

Domain Attention with an Ensemble of Experts

no code implementations • ACL 2017 • Young-Bum Kim, Karl Stratos, Dongchan Kim

When given domain K + 1, our model uses a weighted combination of the K domain experts{'} feedback along with its own opinion to make predictions on the new domain.

Domain Adaptation Spoken Language Understanding

Paper
Add Code

Adversarial Adaptation of Synthetic or Stale Data

no code implementations • ACL 2017 • Young-Bum Kim, Karl Stratos, Dongchan Kim

Both cause a distribution mismatch between training and evaluation, leading to a model that overfits the flawed training data and performs poorly on the test data.

Domain Adaptation Spoken Language Understanding

Paper
Add Code

Cross-Lingual Transfer Learning for POS Tagging without Cross-Lingual Resources

no code implementations • EMNLP 2017 • Joo-Kyung Kim, Young-Bum Kim, Ruhi Sarikaya, Eric Fosler-Lussier

Evaluating on POS datasets from 14 languages in the Universal Dependencies corpus, we show that the proposed transfer learning model improves the POS tagging performance of the target languages without exploiting any linguistic knowledge between the source language and the target language.

Cross-Lingual Transfer Language Modelling +7

Paper
Add Code

Speaker-Sensitive Dual Memory Networks for Multi-Turn Slot Tagging

no code implementations • 29 Nov 2017 • Young-Bum Kim, Sungjin Lee, Ruhi Sarikaya

In multi-turn dialogs, natural language understanding models can introduce obvious errors by being blind to contextual information.

Natural Language Understanding

Paper
Add Code

OneNet: Joint Domain, Intent, Slot Prediction for Spoken Language Understanding

no code implementations • 16 Jan 2018 • Young-Bum Kim, Sungjin Lee, Karl Stratos

In practice, most spoken language understanding systems process user input in a pipelined manner; first domain is predicted, then intent and semantic slots are inferred according to the semantic frames of the predicted domain.

Spoken Language Understanding

Paper
Add Code

A Scalable Neural Shortlisting-Reranking Approach for Large-Scale Domain Classification in Natural Language Understanding

no code implementations • NAACL 2018 • Young-Bum Kim, Dongchan Kim, Joo-Kyung Kim, Ruhi Sarikaya

Intelligent personal digital assistants (IPDAs), a popular real-life application with spoken language understanding capabilities, can cover potentially thousands of overlapping domains for natural language understanding, and the task of finding the best domain to handle an utterance becomes a challenging problem on a large scale.

domain classification General Classification +2

Paper
Add Code

Efficient Large-Scale Domain Classification with Personalized Attention

no code implementations • 22 Apr 2018 • Young-Bum Kim, Dongchan Kim, Anjishnu Kumar, Ruhi Sarikaya

In this paper, we explore the task of mapping spoken language utterances to one of thousands of natural language understanding domains in intelligent personal digital assistants (IPDAs).

Classification domain classification +2

Paper
Add Code

Character-Level Feature Extraction with Densely Connected Networks

no code implementations • COLING 2018 • Chanhee Lee, Young-Bum Kim, Dongyub Lee, Heuiseok Lim

Generating character-level features is an important step for achieving good results in various natural language processing tasks.

named-entity-recognition Named Entity Recognition +4

Paper
Add Code

Rich Character-Level Information for Korean Morphological Analysis and Part-of-Speech Tagging

no code implementations • COLING 2018 • Andrew Matteson, Chanhee Lee, Young-Bum Kim, Heuiseok Lim

Due to the fact that Korean is a highly agglutinative, character-rich language, previous work on Korean morphological analysis typically employs the use of sub-character features known as graphemes or otherwise utilizes comprehensive prior linguistic knowledge (i. e., a dictionary of known morphological transformation forms, or actions).

Morphological Analysis Part-Of-Speech Tagging +1

Paper
Add Code

Joint Learning of Domain Classification and Out-of-Domain Detection with Dynamic Class Weighting for Satisficing False Acceptance Rates

no code implementations • 29 Jun 2018 • Joo-Kyung Kim, Young-Bum Kim

In domain classification for spoken dialog systems, correct detection of out-of-domain (OOD) utterances is crucial because it reduces confusion and unnecessary interaction costs between users and the systems.

Classification domain classification +2

Paper
Add Code

Efficient Large-Scale Neural Domain Classification with Personalized Attention

no code implementations • ACL 2018 • Young-Bum Kim, Dongchan Kim, Anjishnu Kumar, Ruhi Sarikaya

In this paper, we explore the task of mapping spoken language utterances to one of thousands of natural language understanding domains in intelligent personal digital assistants (IPDAs).

Classification domain classification +3

Paper
Add Code

Coupled Representation Learning for Domains, Intents and Slots in Spoken Language Understanding

no code implementations • 13 Dec 2018 • JIhwan Lee, Dongchan Kim, Ruhi Sarikaya, Young-Bum Kim

Our proposed model learns the vector representation of intents based on the slots tied to these intents by aggregating the representations of the slots.

Representation Learning Spoken Language Understanding

Paper
Add Code

Supervised Domain Enablement Attention for Personalized Domain Classification

no code implementations • EMNLP 2018 • Joo-Kyung Kim, Young-Bum Kim

The attention weights are explicitly encouraged to be similar to the corresponding elements of the ground-truth's one-hot vector by supervised attention, and the attention information of the other enabled domains is leveraged through self-distillation.

Classification domain classification +2

Paper
Add Code

Locale-agnostic Universal Domain Classification Model in Spoken Language Understanding

no code implementations • NAACL 2019 • Jihwan Lee, Ruhi Sarikaya, Young-Bum Kim

In this paper, we introduce an approach for leveraging available data across multiple locales sharing the same language to 1) improve domain classification model accuracy in Spoken Language Understanding and user experience even if new locales do not have sufficient data and 2) reduce the cost of scaling the domain classifier to a large number of locales.

Classification domain classification +3

Paper
Add Code

Continuous Learning for Large-scale Personalized Domain Classification

no code implementations • NAACL 2019 • Han Li, JIhwan Lee, Sidharth Mudgal, Ruhi Sarikaya, Young-Bum Kim

This is a major component in mainstream IPDAs in industry.

Classification Continual Learning +4

Paper
Add Code

Pseudo Labeling and Negative Feedback Learning for Large-scale Multi-label Domain Classification

no code implementations • 8 Mar 2020 • Joo-Kyung Kim, Young-Bum Kim

In large-scale domain classification, an utterance can be handled by multiple domains with overlapped capabilities.

domain classification General Classification

Paper
Add Code

Large-scale Hybrid Approach for Predicting User Satisfaction with Conversational Agents

no code implementations • 29 May 2020 • Dookun Park, Hao Yuan, Dongmin Kim, Yinglei Zhang, Matsoukas Spyros, Young-Bum Kim, Ruhi Sarikaya, Edward Guo, Yuan Ling, Kevin Quinn, Pham Hung, Benjamin Yao, Sungjin Lee

An widely used approach to tackle this is to collect human annotation data and use them for evaluation or modeling.

Paper
Add Code

Self-Supervised Contrastive Learning for Efficient User Satisfaction Prediction in Conversational Agents

no code implementations • NAACL 2021 • Mohammad Kachuee, Hao Yuan, Young-Bum Kim, Sungjin Lee

Moreover, a powerful satisfaction model can be used as an objective function that a conversational agent continuously optimizes for.

Contrastive Learning Transfer Learning

Paper
Add Code

A scalable framework for learning from implicit user feedback to improve natural language understanding in large-scale conversational AI systems

no code implementations • EMNLP 2021 • Sunghyun Park, Han Li, Ameen Patel, Sidharth Mudgal, Sungjin Lee, Young-Bum Kim, Spyros Matsoukas, Ruhi Sarikaya

Natural Language Understanding (NLU) is an established component within a conversational AI or digital assistant system, and it is responsible for producing semantic understanding of a user request.

Natural Language Understanding

Paper
Add Code

DEUS: A Data-driven Approach to Estimate User Satisfaction in Multi-turn Dialogues

no code implementations • 1 Mar 2021 • Ziming Li, Dookun Park, Julia Kiseleva, Young-Bum Kim, Sungjin Lee

Digital assistants are experiencing rapid growth due to their ability to assist users with day-to-day tasks where most dialogues are happening multi-turn.

Paper
Add Code

Neural model robustness for skill routing in large-scale conversational AI systems: A design choice exploration

no code implementations • 4 Mar 2021 • Han Li, Sunghyun Park, Aswarth Dara, Jinseok Nam, Sungjin Lee, Young-Bum Kim, Spyros Matsoukas, Ruhi Sarikaya

Ensuring model robustness or resilience in the skill routing component is an important problem since skills may dynamically change their subscription in the ontology after the skill routing model has been deployed to production.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Paper
Add Code

Handling Long-Tail Queries with Slice-Aware Conversational Systems

no code implementations • 26 Apr 2021 • Cheng Wang, Sun Kim, Taiwoo Park, Sajal Choudhary, Sunghyun Park, Young-Bum Kim, Ruhi Sarikaya, Sungjin Lee

We have been witnessing the usefulness of conversational AI systems such as Siri and Alexa, directly impacting our daily lives.

Paper
Add Code

Learning Slice-Aware Representations with Mixture of Attentions

no code implementations • Findings (ACL) 2021 • Cheng Wang, Sungjin Lee, Sunghyun Park, Han Li, Young-Bum Kim, Ruhi Sarikaya

Real-world machine learning systems are achieving remarkable performance in terms of coarse-grained metrics like overall accuracy and F-1 score.

Natural Language Understanding

Paper
Add Code

AUGNLG: Few-shot Natural Language Generation using Self-trained Data Augmentation

1 code implementation • ACL 2021 • Xinnuo Xu, Guoyin Wang, Young-Bum Kim, Sungjin Lee

Natural Language Generation (NLG) is a key component in a task-oriented dialogue system, which converts the structured meaning representation (MR) to the natural language.

Data Augmentation Retrieval +2

Paper
Code

Deciding Whether to Ask Clarifying Questions in Large-Scale Spoken Language Understanding

no code implementations • 25 Sep 2021 • Joo-Kyung Kim, Guoyin Wang, Sungjin Lee, Young-Bum Kim

A large-scale conversational agent can suffer from understanding user utterances with various ambiguities such as ASR ambiguity, intent ambiguity, and hypothesis ambiguity.

Spoken Language Understanding

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.