no code implementations • 29 Jun 2023 • Jinhong Ni, Yalong Bai, Wei zhang, Ting Yao, Tao Mei
Multimodal fusion integrates the complementary information present in multiple modalities and has gained much attention recently.
Visual Question Answering (VQA)