Search Results for author: Chae Won Kim

Found 4 papers, 2 papers with code

MoAI: Mixture of All Intelligence for Large Language and Vision Models

1 code implementation • 12 Mar 2024 • Byung-Kwan Lee, Beomchan Park, Chae Won Kim, Yong Man Ro

Therefore, we present a new LLVM, Mixture of All Intelligence (MoAI), which leverages auxiliary visual information obtained from the outputs of external segmentation, detection, SGG, and OCR models.

Ranked #26 on Visual Question Answering on MM-Vet

Scene Understanding Visual Question Answering

248

Paper
Code

Persona Extraction Through Semantic Similarity for Emotional Support Conversation Generation

no code implementations • 7 Mar 2024 • Seunghee Han, Se Jin Park, Chae Won Kim, Yong Man Ro

We devise completeness loss and consistency loss based on semantic similarity scores.

Paper
Add Code

CoLLaVO: Crayon Large Language and Vision mOdel

1 code implementation • 17 Feb 2024 • Byung-Kwan Lee, Beomchan Park, Chae Won Kim, Yong Man Ro

Our findings reveal that the image understanding capabilities of current VLMs are strongly correlated with their zero-shot performance on vision language (VL) tasks.

Ranked #34 on Visual Question Answering on MM-Vet

Large Language Model Object +3

Paper
Code

Deep Visual Forced Alignment: Learning to Align Transcription with Talking Face Video

no code implementations • 27 Feb 2023 • Minsu Kim, Chae Won Kim, Yong Man Ro

The proposed DVFA can align the input transcription (i. e., sentence) with the talking face video without accessing the speech audio.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.