Search Results for author: WenTing Chen

Found 19 papers, 5 papers with code

{S$^3$-Mamba}: Small-Size-Sensitive Mamba for Lesion Segmentation

no code implementations19 Dec 2024 Gui Wang, Yuexiang Li, WenTing Chen, Meidan Ding, Wooi Ping Cheah, Rong Qu, Jianfeng Ren, Linlin Shen

Specifically, an Enhanced Visual State Space block is designed to focus on small lesions through multiple residual connections to preserve local features, and selectively amplify important details while suppressing irrelevant ones through channel-wise attention.

Image Segmentation Lesion Segmentation +2

DAMPER: A Dual-Stage Medical Report Generation Framework with Coarse-Grained MeSH Alignment and Fine-Grained Hypergraph Matching

no code implementations19 Dec 2024 Xiaofei Huang, WenTing Chen, Jie Liu, Qisheng Lu, Xiaoling Luo, Linlin Shen

In the first stage, a MeSH-Guided Coarse-Grained Alignment (MCG) stage that aligns chest X-ray (CXR) image features with medical subject headings (MeSH) features to generate a rough keyphrase representation of the overall impression.

Hypergraph Matching Medical Report Generation

WSI-LLaVA: A Multimodal Large Language Model for Whole Slide Image

no code implementations3 Dec 2024 Yuci Liang, Xinheng Lyu, Meidan Ding, WenTing Chen, Jipeng Zhang, Yuexiang Ren, Xiangjian He, Song Wu, Sen yang, Xiyue Wang, Xiaohan Xing, Linlin Shen

Recent advancements in computational pathology have produced patch-level Multi-modal Large Language Models (MLLMs), but these models are limited by their inability to analyze whole slide images (WSIs) comprehensively and their tendency to bypass crucial morphological features that pathologists rely on for diagnosis.

Language Modeling Language Modelling +5

GEM: Context-Aware Gaze EstiMation with Visual Search Behavior Matching for Chest Radiograph

1 code implementation10 Aug 2024 Shaonan Liu, WenTing Chen, Jie Liu, Xiaoling Luo, Linlin Shen

To understand the attention allocation and cognitive behavior of radiologists during the medical image interpretation process, we propose a context-aware Gaze EstiMation (GEM) network that utilizes eye gaze data collected from radiologists to simulate their visual search behavior patterns throughout the image interpretation process.

Gaze Estimation graph construction

Multi-Dataset Multi-Task Learning for COVID-19 Prognosis

no code implementations22 May 2024 Filippo Ruffini, Lorenzo Tronchin, Zhuoru Wu, WenTing Chen, Paolo Soda, Linlin Shen, Valerio Guarrasi

In the fight against the COVID-19 pandemic, leveraging artificial intelligence to predict disease outcomes from chest radiographic images represents a significant scientific aim.

Multi-Task Learning Prognosis

Medical Image Synthesis via Fine-Grained Image-Text Alignment and Anatomy-Pathology Prompting

no code implementations11 Mar 2024 WenTing Chen, Pengyu Wang, Hui Ren, Lichao Sun, Quanzheng Li, Yixuan Yuan, Xiang Li

To address these challenges, we propose a novel medical image synthesis model that leverages fine-grained image-text alignment and anatomy-pathology prompts to generate highly detailed and accurate synthetic medical images.

Anatomy Descriptive +1

A Spectrum Evaluation Benchmark for Medical Multi-Modal Large Language Models

no code implementations17 Feb 2024 Jie Liu, Wenxuan Wang, Yihang Su, Jingyuan Huan, WenTing Chen, Yudi Zhang, Cheng-Yi Li, Kao-Jung Chang, Xiaohan Xin, Linlin Shen, Michael R. Lyu

The significant breakthroughs of Medical Multi-Modal Large Language Models (Med-MLLMs) renovate modern healthcare with robust information synthesis and medical decision support.

Visual Question Answering (VQA)

Fine-Grained Image-Text Alignment in Medical Imaging Enables Explainable Cyclic Image-Report Generation

no code implementations13 Dec 2023 WenTing Chen, Linlin Shen, Jingyang Lin, Jiebo Luo, Xiang Li, Yixuan Yuan

To address these issues, we propose a novel Adaptive patch-word Matching (AdaMatch) model to correlate chest X-ray (CXR) image regions with words in medical reports and apply it to CXR-report generation to provide explainability for the generation process.

Language Modeling Language Modelling +1

Two-Stream Regression Network for Dental Implant Position Prediction

no code implementations17 May 2023 Xinquan Yang, Xuguang Li, Xuechen Li, WenTing Chen, Linlin Shen, Xin Li, Yongqiang Deng

In this paper, we develop a two-stream implant position regression framework (TSIPR), which consists of an implant region detector (IRD) and a multi-scale patch embedding regression network (MSPENet), to address this issue.

Position Position regression +2

Gated SwitchGAN for multi-domain facial image translation

no code implementations28 Nov 2021 Xiaokang Zhang, Yuanlue Zhu, WenTing Chen, Wenshuang Liu, Linlin Shen

The existing methods generally provide a discriminator with an auxiliary classifier to impose domain translation.

Attribute feature selection +3

Translate the Facial Regions You Like Using Region-Wise Normalization

no code implementations29 Jul 2020 Wenshuang Liu, Wenting Chen, Linlin Shen

We propose in this paper a region-wise normalization framework, for region level face translation.

Image Generation MORPH +1

TR-GAN: Topology Ranking GAN with Triplet Loss for Retinal Artery/Vein Classification

no code implementations29 Jul 2020 Wenting Chen, Shuang Yu, Junde Wu, Kai Ma, Cheng Bian, Chunyan Chu, Linlin Shen, Yefeng Zheng

A topology ranking discriminator based on ordinal regression is proposed to rank the topological connectivity level of the ground-truth, the generated A/V mask and the intentionally shuffled mask.

Classification General Classification +2

Leveraging Undiagnosed Data for Glaucoma Classification with Teacher-Student Learning

1 code implementation22 Jul 2020 Junde Wu, Shuang Yu, WenTing Chen, Kai Ma, Rao Fu, Hanruo Liu, Xiaoguang Di, Yefeng Zheng

Recently, deep learning has been adopted to the glaucoma classification task with performance comparable to that of human experts.

Classification General Classification +1

Texture Deformation Based Generative Adversarial Networks for Face Editing

no code implementations24 Dec 2018 WenTing Chen, Xinpeng Xie, Xi Jia, Linlin Shen

We also evaluate our approach qualitatively and quantitatively on facial attribute and facial expression synthesis.

Attribute Image-to-Image Translation +1

Cannot find the paper you are looking for? You can Submit a new open access paper.