Search Results for author: Chenyi Zhou

Found 2 papers, 1 papers with code

MedThink: Explaining Medical Visual Question Answering via Multimodal Decision-Making Rationale

no code implementations18 Apr 2024 Xiaotang Gai, Chenyi Zhou, Jiaxiang Liu, Yang Feng, Jian Wu, Zuozhu Liu

Moreover, we design a novel framework which finetunes lightweight pretrained generative models by incorporating medical decision-making rationales into the training process.

Decision Making Medical Visual Question Answering +2

Joint Visual and Text Prompting for Improved Object-Centric Perception with Multimodal Large Language Models

2 code implementations6 Apr 2024 Songtao Jiang, Yan Zhang, Chenyi Zhou, Yeying Jin, Yang Feng, Jian Wu, Zuozhu Liu

In this paper, we present a novel approach, Joint Visual and Text Prompting (VTPrompt), that employs fine-grained visual information to enhance the capability of MLLMs in VQA, especially for object-oriented perception.

Object Question Answering +1

Cannot find the paper you are looking for? You can Submit a new open access paper.