1 code implementation • 8 May 2024 • Wentao Tan, Changxing Ding, Jiayu Jiang, Fei Wang, Yibing Zhan, Dapeng Tao
Thus, we propose a novel method that uses MLLMs to caption images according to various templates.
Language Modelling Large Language Model +1