Search Results for author: Cuong Nhat Ha

Found 1 papers, 0 papers with code

Fusion of Domain-Adapted Vision and Language Models for Medical Visual Question Answering

no code implementations • 24 Apr 2024 • Cuong Nhat Ha, Shima Asaadi, Sanjeev Kumar Karn, Oladimeji Farri, Tobias Heimann, Thomas Runkler

Vision-language models, while effective in general domains and showing strong performance in diverse multi-modal applications like visual question-answering (VQA), struggle to maintain the same level of effectiveness in more specialized domains, e. g., medical.

Language Modelling Medical Visual Question Answering +2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.