Search Results for author: Seunghyun Yoon

Found 32 papers, 19 papers with code

Multimodal Intent Discovery from Livestream Videos

no code implementations Findings (NAACL) 2022 Adyasha Maharana, Quan Tran, Franck Dernoncourt, Seunghyun Yoon, Trung Bui, Walter Chang, Mohit Bansal

We construct and present a new multimodal dataset consisting of software instructional livestreams and containing manual annotations for both detailed and abstract procedural intent that enable training and evaluation of joint video and text understanding models.

Intent Discovery Video Summarization +1

Multimodal Speech Emotion Recognition using Cross Attention with Aligned Audio and Text

no code implementations26 Jul 2022 Yoonhyung Lee, Seunghyun Yoon, Kyomin Jung

Then, the attention weights of each modality are applied directly to the other modality in a crossed way, so that the CAN gathers the audio and text information from the same time steps based on each modality.

Speech Emotion Recognition

PiC: A Phrase-in-Context Dataset for Phrase Understanding and Semantic Search

1 code implementation19 Jul 2022 Thang M. Pham, Seunghyun Yoon, Trung Bui, Anh Nguyen

However, the progress of learning contextualized phrase embeddings is hindered by the lack of a human-annotated, phrase-in-context benchmark.

Information Retrieval Natural Language Understanding +4

Fine-grained Image Captioning with CLIP Reward

1 code implementation Findings (NAACL) 2022 Jaemin Cho, Seunghyun Yoon, Ajinkya Kale, Franck Dernoncourt, Trung Bui, Mohit Bansal

Toward more descriptive and distinctive caption generation, we propose using CLIP, a multimodal encoder trained on huge image-text pairs from web, to calculate multimodal similarity and use it as a reward function.

Image Captioning Image Retrieval +2

How does fake news use a thumbnail? CLIP-based Multimodal Detection on the Unrepresentative News Image

1 code implementation CONSTRAINT (ACL) 2022 Hyewon Choi, Yejun Yoon, Seunghyun Yoon, Kunwoo Park

This study investigates how fake news uses a thumbnail for a news article with a focus on whether a news article's thumbnail represents the news content correctly.

Misinformation

CAISE: Conversational Agent for Image Search and Editing

1 code implementation24 Feb 2022 Hyounghun Kim, Doo Soon Kim, Seunghyun Yoon, Franck Dernoncourt, Trung Bui, Mohit Bansal

To our knowledge, this is the first dataset that provides conversational image search and editing annotations, where the agent holds a grounded conversation with users and helps them to search and edit images according to their requests.

Image Retrieval

MACRONYM: A Large-Scale Dataset for Multilingual and Multi-Domain Acronym Extraction

no code implementations19 Feb 2022 Amir Pouran Ben Veyseh, Nicole Meister, Seunghyun Yoon, Rajiv Jain, Franck Dernoncourt, Thien Huu Nguyen

Acronym extraction is the task of identifying acronyms and their expanded forms in texts that is necessary for various NLP applications.

Simple Questions Generate Named Entity Recognition Datasets

1 code implementation16 Dec 2021 Hyunjae Kim, Jaehyo Yoo, Seunghyun Yoon, Jinhyuk Lee, Jaewoo Kang

Recent named entity recognition (NER) models often rely on human-annotated datasets requiring the vast engagement of professional knowledge on the target domain and entities.

few-shot-ner Few-shot NER +3

UMIC: An Unreferenced Metric for Image Captioning via Contrastive Learning

1 code implementation ACL 2021 Hwanhee Lee, Seunghyun Yoon, Franck Dernoncourt, Trung Bui, Kyomin Jung

Also, we observe critical problems of the previous benchmark dataset (i. e., human annotations) on image captioning metric, and introduce a new collection of human annotations on the generated captions.

Contrastive Learning Image Captioning +1

Collaborative Training of GANs in Continuous and Discrete Spaces for Text Generation

no code implementations16 Oct 2020 Yanghoon Kim, Seungpil Won, Seunghyun Yoon, Kyomin Jung

Applying generative adversarial networks (GANs) to text-related tasks is challenging due to the discrete nature of language.

Text Generation

Fast and Accurate Deep Bidirectional Language Representations for Unsupervised Learning

1 code implementation ACL 2020 Joongbo Shin, Yoonhyung Lee, Seunghyun Yoon, Kyomin Jung

Even though BERT achieves successful performance improvements in various supervised learning tasks, applying BERT for unsupervised tasks still holds a limitation that it requires repetitive inference for computing contextual language representations.

Language Modelling Semantic Similarity +1

DSTC8-AVSD: Multimodal Semantic Transformer Network with Retrieval Style Word Generator

no code implementations1 Apr 2020 Hwanhee Lee, Seunghyun Yoon, Franck Dernoncourt, Doo Soon Kim, Trung Bui, Kyomin Jung

Audio Visual Scene-aware Dialog (AVSD) is the task of generating a response for a question with a given scene, video, audio, and the history of previous turns in the dialog.

Word Embeddings

BaitWatcher: A lightweight web interface for the detection of incongruent news headlines

no code implementations23 Mar 2020 Kunwoo Park, Taegyun Kim, Seunghyun Yoon, Meeyoung Cha, Kyomin Jung

In digital environments where substantial amounts of information are shared online, news headlines play essential roles in the selection and diffusion of news articles.

Misinformation

Attentive Modality Hopping Mechanism for Speech Emotion Recognition

1 code implementation29 Nov 2019 Seunghyun Yoon, Subhadeep Dey, Hwanhee Lee, Kyomin Jung

In this work, we explore the impact of visual modality in addition to speech and text for improving the accuracy of the emotion detection system.

Emotion Classification Multimodal Emotion Recognition +1

Propagate-Selector: Detecting Supporting Sentences for Question Answering via Graph Neural Networks

1 code implementation LREC 2020 Seunghyun Yoon, Franck Dernoncourt, Doo Soon Kim, Trung Bui, Kyomin Jung

In this study, we propose a novel graph neural network called propagate-selector (PS), which propagates information over sentences to understand information that cannot be inferred when considering sentences in isolation.

Answer Selection

A Compare-Aggregate Model with Latent Clustering for Answer Selection

no code implementations30 May 2019 Seunghyun Yoon, Franck Dernoncourt, Doo Soon Kim, Trung Bui, Kyomin Jung

In this paper, we propose a novel method for a sentence-level answer-selection task that is a fundamental problem in natural language processing.

Answer Selection Language Modelling +2

Detecting Incongruity Between News Headline and Body Text via a Deep Hierarchical Encoder

2 code implementations17 Nov 2018 Seunghyun Yoon, Kunwoo Park, Joongbo Shin, Hongjun Lim, Seungpil Won, Meeyoung Cha, Kyomin Jung

Some news headlines mislead readers with overrated or false information, and identifying them in advance will better assist readers in choosing proper news stories to consume.

Data Augmentation Fake News Detection +2

Multimodal Speech Emotion Recognition Using Audio and Text

4 code implementations10 Oct 2018 Seunghyun Yoon, Seokhyun Byun, Kyomin Jung

Speech emotion recognition is a challenging task, and extensive reliance has been placed on models that use audio features in building well-performing classifiers.

Emotion Classification Multimodal Emotion Recognition +2

Learning to Rank Question-Answer Pairs using Hierarchical Recurrent Encoder with Latent Topic Clustering

3 code implementations NAACL 2018 Seunghyun Yoon, Joongbo Shin, Kyomin Jung

In this paper, we propose a novel end-to-end neural architecture for ranking candidate answers, that adapts a hierarchical recurrent neural network and a latent topic clustering module.

Answer Selection Learning-To-Rank

Efficient Transfer Learning Schemes for Personalized Language Modeling using Recurrent Neural Network

no code implementations13 Jan 2017 Seunghyun Yoon, Hyeongu Yun, Yuna Kim, Gyu-tae Park, Kyomin Jung

In this paper, we propose an efficient transfer leaning methods for training a personalized language model using a recurrent neural network with long short-term memory architecture.

Language Modelling Transfer Learning

Cannot find the paper you are looking for? You can Submit a new open access paper.