1 code implementation • 15 Mar 2024 • Rocktim Jyoti Das, Simeon Emilov Hristov, Haonan Li, Dimitar Iliyanov Dimitrov, Ivan Koychev, Preslav Nakov
Solving the problems in the dataset requires advanced perception and joint reasoning over the text and the visual content of the image.