no code implementations • 17 Apr 2025 • SangWook Kim, Soonyoung Lee, Jongseong Jang
We demonstrate the ability of diagnosing the given histopathology images using ChatEXAONEPath with the acceptance rate of 62. 9% from 1, 134 pairs of WSIs and reports.
no code implementations • 20 Mar 2025 • Kyungho Bae, Jinhyung Kim, Sihaeng Lee, Soonyoung Lee, GunHee Lee, Jinwoo Choi
Our approach includes two key innovations: (1) DST-attention, a novel attention mechanism that disentangles the spatial and temporal tokens within the LLM by using masked attention to restrict direct interactions between the spatial and temporal tokens; (2) Harmonic-RoPE, which extends the dimensionality of the positional IDs, allowing the spatial and temporal tokens to maintain balanced positions relative to the text tokens.
no code implementations • 14 Aug 2024 • Minjung Kim, Hyung Suk Lim, Seung Hwan Kim, Soonyoung Lee, Bumsoo Kim, Gunhee Kim
SIA simultaneously decodes two sets of queries-context query and instance query.
no code implementations • 13 Aug 2024 • Minjung Kim, Hyung Suk Lim, Soonyoung Lee, Bumsoo Kim, Gunhee Kim
3D dense captioning is a task involving the localization of objects and the generation of descriptions for each object in a 3D scene.
1 code implementation • 1 Aug 2024 • Juseung Yun, Yi Hu, Jinhyung Kim, Jongseong Jang, Soonyoung Lee
To address this issue, we introduce EXAONEPath, a novel foundational model trained on patches that have undergone stain normalization.
1 code implementation • 13 Jun 2024 • Youngtaek Oh, Pyunghwan Ahn, Jinhyung Kim, Gwangmo Song, Soonyoung Lee, In So Kweon, Junmo Kim
Vision and language models (VLMs) such as CLIP have showcased remarkable zero-shot recognition abilities yet face challenges in visio-linguistic compositionality, particularly in linguistic comprehension and fine-grained image-text alignment.
1 code implementation • 21 Dec 2023 • Kwangrok Ryoo, Yeonsik Jo, Seungjun Lee, Mira Kim, Ahra Jo, Seung Hwan Kim, Seungryong Kim, Soonyoung Lee
For object detection task with noisy labels, it is important to consider not only categorization noise, as in image classification, but also localization noise, missing annotations, and bogus bounding boxes.
1 code implementation • CVPR 2022 • TaeHoon Kim, Gwangmo Song, Sihaeng Lee, Sangyun Kim, Yewon Seo, Soonyoung Lee, Seung Hwan Kim, Honglak Lee, Kyunghoon Bae
Unlike other models, BiART can distinguish between image (or text) as a conditional reference and a generation target.
Ranked #1 on
Image Reconstruction
on ImageNet 256x256