Search Results for author: Seil Kang

Found 5 papers, 0 papers with code

See What You Are Told: Visual Attention Sink in Large Multimodal Models

no code implementations5 Mar 2025 Seil Kang, Jinyeong Kim, Junhyeok Kim, Seong Jae Hwang

Large multimodal models (LMMs) "see" images by leveraging the attention mechanism between text and visual tokens in the transformer decoder.

Hallucination

FALCON: Frequency Adjoint Link with CONtinuous Density Mask for Fast Single Image Dehazing

no code implementations1 Jul 2024 Donghyun Kim, Seil Kang, Seong Jae Hwang

This work introduces FALCON (Frequency Adjoint Link with CONtinuous density mask), a single-image dehazing system achieving state-of-the-art performance on both quality and speed.

Autonomous Driving Image Dehazing +2

WoLF: Wide-scope Large Language Model Framework for CXR Understanding

no code implementations19 Mar 2024 Seil Kang, Donghyun Kim, Junhyeok Kim, Hyo Kyung Lee, Seong Jae Hwang

(1) Previous methods solely use CXR reports, which are insufficient for comprehensive Visual Question Answering (VQA), especially when additional health-related data like medication history and prior diagnoses are needed.

Anatomy Instruction Following +5

Cannot find the paper you are looking for? You can Submit a new open access paper.