Search Results for author: Sungjin Park

Found 11 papers, 6 papers with code

FreeTalky: Don’t Be Afraid! Conversations Made Easier by a Humanoid Robot using Persona-based Dialogue

no code implementations LREC 2022 Chanjun Park, Yoonna Jang, Seolhwa Lee, Sungjin Park, Heuiseok Lim

We propose a deep learning-based foreign language learning platform, named FreeTalky, for people who experience anxiety dealing with foreign languages, by employing a humanoid robot NAO and various deep learning models.

Multimodal Transformer With a Low-Computational-Cost Guarantee

no code implementations23 Feb 2024 Sungjin Park, Edward Choi

Transformer-based models have significantly improved performance across a range of multimodal understanding tasks, such as visual question answering and action recognition.

Action Recognition Question Answering +1

FactKG: Fact Verification via Reasoning on Knowledge Graphs

1 code implementation11 May 2023 Jiho Kim, Sungjin Park, Yeonsu Kwon, Yohan Jo, James Thorne, Edward Choi

KGs can be a valuable knowledge source in fact verification due to their reliability and broad applicability.

Fact Verification Knowledge Graphs +1

Do Language Models Understand Measurements?

no code implementations23 Oct 2022 Sungjin Park, Seungwoo Ryu, Edward Choi

Recent success of pre-trained language models (PLMs) has stimulated interest in their ability to understand and work with numbers.

Language Modelling

Unconditional Image-Text Pair Generation with Multimodal Cross Quantizer

1 code implementation15 Apr 2022 Hyungyung Lee, Sungjin Park, Joonseok Lee, Edward Choi

To learn a multimodal semantic correlation in a quantized space, we combine VQ-VAE with a Transformer encoder and apply an input masking strategy.

multimodal generation Quantization

Graph-Text Multi-Modal Pre-training for Medical Representation Learning

1 code implementation18 Mar 2022 Sungjin Park, Seongsu Bae, Jiho Kim, Tackeun Kim, Edward Choi

MedGTX uses a novel graph encoder to exploit the graphical nature of structured EHR data, and a text encoder to handle unstructured text, and a cross-modal encoder to learn a joint representation space.

Representation Learning

FreeTalky: Don't Be Afraid! Conversations Made Easier by a Humanoid Robot using Persona-based Dialogue

no code implementations8 Dec 2021 Chanjun Park, Yoonna Jang, Seolhwa Lee, Sungjin Park, Heuiseok Lim

We propose a deep learning-based foreign language learning platform, named FreeTalky, for people who experience anxiety dealing with foreign languages, by employing a humanoid robot NAO and various deep learning models.

Real-time Denoising and Dereverberation with Tiny Recurrent U-Net

1 code implementation5 Feb 2021 Hyeong-Seok Choi, Sungjin Park, Jie Hwan Lee, Hoon Heo, Dongsuk Jeon, Kyogu Lee

Modern deep learning-based models have seen outstanding performance improvement with speech enhancement tasks.

Denoising Speech Enhancement

Multi-View Attention Network for Visual Dialog

1 code implementation29 Apr 2020 Sungjin Park, Taesun Whang, Yeochan Yoon, Heuiseok Lim

To resolve the visual dialog task, a high-level understanding of various multimodal inputs (e. g., question, dialog history, and image) is required.

Visual Dialog

Cannot find the paper you are looking for? You can Submit a new open access paper.