1 code implementation • 5 Jan 2024 • Gang Liu, Jinlong He, Pengfei Li, Genrong He, Zhaolin Chen, Shenjun Zhong
In this paper, we propose a parameter efficient framework for fine-tuning MLLMs, specifically validated on medical visual question answering (Med-VQA) and medical report generation (MRG) tasks, using public benchmark datasets.
Ranked #1 on Medical Visual Question Answering on VQA-RAD (using extra training data)
Medical Report Generation Medical Visual Question Answering +4
1 code implementation • 11 Jul 2023 • Pengfei Li, Gang Liu, Jinlong He, Zixu Zhao, Shenjun Zhong
Medical visual question answering (VQA) is a challenging task that requires answering clinical questions of a given medical image, by taking consider of both visual and language information.
Ranked #1 on Medical Visual Question Answering on PathVQA