Search Results for author: Raiymbek Akshulakov

Found 2 papers, 2 papers with code

Do Vision and Language Encoders Represent the World Similarly?

1 code implementation10 Jan 2024 Mayug Maniparambil, Raiymbek Akshulakov, Yasser Abdelaziz Dahou Djilali, Sanath Narayan, Mohamed El Amine Seddik, Karttikeya Mangalam, Noel E. O'Connor

In the absence of statistical similarity in aligned encoders like CLIP, we show that a possible matching of unaligned encoders exists without any training.

Graph Matching Image Classification +3

EgoSchema: A Diagnostic Benchmark for Very Long-form Video Language Understanding

1 code implementation NeurIPS 2023 Karttikeya Mangalam, Raiymbek Akshulakov, Jitendra Malik

We introduce EgoSchema, a very long-form video question-answering dataset, and benchmark to evaluate long video understanding capabilities of modern vision and language systems.

Multiple-choice Question Answering +2

Cannot find the paper you are looking for? You can Submit a new open access paper.