1 code implementation • 27 Mar 2025 • Xiaoqin Wang, Xusen Ma, Xianxu Hou, Meidan Ding, Yudong Li, Junliang Chen, WenTing Chen, Xiaoyang Peng, Linlin Shen
Multimodal large language models (MLLMs) have demonstrated remarkable capabilities in various tasks.
no code implementations • 19 Dec 2024 • Gui Wang, Yuexiang Li, WenTing Chen, Meidan Ding, Wooi Ping Cheah, Rong Qu, Jianfeng Ren, Linlin Shen
Specifically, an Enhanced Visual State Space block is designed to focus on small lesions through multiple residual connections to preserve local features, and selectively amplify important details while suppressing irrelevant ones through channel-wise attention.
no code implementations • 3 Dec 2024 • Yuci Liang, Xinheng Lyu, Meidan Ding, WenTing Chen, Jipeng Zhang, Yuexiang Ren, Xiangjian He, Song Wu, Sen yang, Xiyue Wang, Xiaohan Xing, Linlin Shen
Recent advancements in computational pathology have produced patch-level Multi-modal Large Language Models (MLLMs), but these models are limited by their inability to analyze whole slide images (WSIs) comprehensively and their tendency to bypass crucial morphological features that pathologists rely on for diagnosis.