1 code implementation • 10 Jun 2024 • Yunsoo Kim, Jinge Wu, Yusuf Abdulle, Honghan Wu
This paper introduces MedExQA, a novel benchmark in medical question-answering, to evaluate large language models' (LLMs) understanding of medical knowledge through explanations.
no code implementations • 3 Apr 2024 • Yunsoo Kim, Jinge Wu, Yusuf Abdulle, Yue Gao, Honghan Wu
This work proposes a novel approach to enhance human-computer interaction in chest X-ray analysis using Vision-Language Models (VLMs) enhanced with radiologists' attention by incorporating eye gaze data alongside textual prompts.
no code implementations • 20 Dec 2023 • Emily Groves, Minhong Wang, Yusuf Abdulle, Holger Kunz, Jason Hoelscher-Obermaier, Ronin Wu, Honghan Wu
Five setups were designed to assess ML and FT model performance across different data availability scenarios. Datasets for curation tasks included: task 1 (620, 386), task 2 (611, 430), and task 3 (617, 381), maintaining a 50:50 positive versus negative ratio.