Search Results for author: Soonyoung Lee

Found 8 papers, 4 papers with code

ChatEXAONEPath: An Expert-level Multimodal Large Language Model for Histopathology Using Whole Slide Images

no code implementations17 Apr 2025 SangWook Kim, Soonyoung Lee, Jongseong Jang

We demonstrate the ability of diagnosing the given histopathology images using ChatEXAONEPath with the acceptance rate of 62. 9% from 1, 134 pairs of WSIs and reports.

MASH-VLM: Mitigating Action-Scene Hallucination in Video-LLMs through Disentangled Spatial-Temporal Representations

no code implementations20 Mar 2025 Kyungho Bae, Jinhyung Kim, Sihaeng Lee, Soonyoung Lee, GunHee Lee, Jinwoo Choi

Our approach includes two key innovations: (1) DST-attention, a novel attention mechanism that disentangles the spatial and temporal tokens within the LLM by using masked attention to restrict direct interactions between the spatial and temporal tokens; (2) Harmonic-RoPE, which extends the dimensionality of the positional IDs, allowing the spatial and temporal tokens to maintain balanced positions relative to the text tokens.

Hallucination Video Understanding

Bi-directional Contextual Attention for 3D Dense Captioning

no code implementations13 Aug 2024 Minjung Kim, Hyung Suk Lim, Soonyoung Lee, Bumsoo Kim, Gunhee Kim

3D dense captioning is a task involving the localization of objects and the generation of descriptions for each object in a 3D scene.

3D dense captioning Attribute +3

EXAONEPath 1.0 Patch-level Foundation Model for Pathology

1 code implementation1 Aug 2024 Juseung Yun, Yi Hu, Jinhyung Kim, Jongseong Jang, Soonyoung Lee

To address this issue, we introduce EXAONEPath, a novel foundational model trained on patches that have undergone stain normalization.

model Self-Supervised Learning +1

Exploring the Spectrum of Visio-Linguistic Compositionality and Recognition

1 code implementation13 Jun 2024 Youngtaek Oh, Pyunghwan Ahn, Jinhyung Kim, Gwangmo Song, Soonyoung Lee, In So Kweon, Junmo Kim

Vision and language models (VLMs) such as CLIP have showcased remarkable zero-shot recognition abilities yet face challenges in visio-linguistic compositionality, particularly in linguistic comprehension and fine-grained image-text alignment.

Retrieval Zero-Shot Learning

Universal Noise Annotation: Unveiling the Impact of Noisy annotation on Object Detection

1 code implementation21 Dec 2023 Kwangrok Ryoo, Yeonsik Jo, Seungjun Lee, Mira Kim, Ahra Jo, Seung Hwan Kim, Seungryong Kim, Soonyoung Lee

For object detection task with noisy labels, it is important to consider not only categorization noise, as in image classification, but also localization noise, missing annotations, and bogus bounding boxes.

Image Classification Object +2

Cannot find the paper you are looking for? You can Submit a new open access paper.