no code implementations • 13 May 2023 • Ke Zhang, Yan Yang, Jun Yu, Hanliang Jiang, Jianping Fan, Qingming Huang, Weidong Han
To address this limitation, we propose a unified Med-VLP framework based on Multi-task Paired Masking with Alignment (MPMA) to integrate the cross-modal alignment task into the joint image-text reconstruction framework to achieve more comprehensive cross-modal interaction, while a Global and Local Alignment (GLA) module is designed to assist self-supervised paradigm in obtaining semantic representations with rich domain knowledge.
1 code implementation • 19 Jan 2021 • Hanliang Jiang, Fuhao Shen, Fei Gao, Weidong Han
Besides, empirical study shows that the reasoning process of learned networks is in conformity with physicians' diagnosis.
Ranked #1 on Neural Architecture Search on LIDC-IDRI