no code implementations • CVPR 2022 • Mona Gandhi, Mustafa Omer Gul, Eva Prakash, Madeleine Grunde-McLaughlin, Ranjay Krishna, Maneesh Agrawala
Recent video question answering benchmarks indicate that state-of-the-art models struggle to answer compositional questions.