no code implementations • 5 Apr 2024 • Lili Liang, Guanglu Sun, Jin Qiu, Lizhong Zhang
Compositional spatio-temporal reasoning poses a significant challenge in the field of video question answering (VideoQA).
Question Answering Video Question Answering