Search Results for author: Ellis Brown

Found 5 papers, 5 papers with code

SAT: Dynamic Spatial Aptitude Training for Multimodal Language Models

1 code implementation10 Dec 2024 Arijit Ray, Jiafei Duan, Ellis Brown, Reuben Tan, Dina Bashkirova, Rose Hendrix, Kiana Ehsani, Aniruddha Kembhavi, Bryan A. Plummer, Ranjay Krishna, Kuo-Hao Zeng, Kate Saenko

While many studies highlight that large multimodal language models (MLMs) struggle to reason about space, they only focus on static spatial relationships, and not dynamic awareness of motion and space, i. e., reasoning about the effect of egocentric and object motions on spatial relationships.

Action Recognition Spatial Reasoning

V-IRL: Grounding Virtual Intelligence in Real Life

1 code implementation5 Feb 2024 Jihan Yang, Runyu Ding, Ellis Brown, Xiaojuan Qi, Saining Xie

There is a sensory gulf between the Earth that humans inhabit and the digital realms in which modern AI agents are created.

Decision Making

Your Diffusion Model is Secretly a Zero-Shot Classifier

4 code implementations ICCV 2023 Alexander C. Li, Mihir Prabhudesai, Shivam Duggal, Ellis Brown, Deepak Pathak

Our generative approach to classification, which we call Diffusion Classifier, attains strong results on a variety of benchmarks and outperforms alternative methods of extracting knowledge from diffusion models.

Domain Generalization Fine-Grained Image Classification +5

Internet Explorer: Targeted Representation Learning on the Open Web

1 code implementation27 Feb 2023 Alexander C. Li, Ellis Brown, Alexei A. Efros, Deepak Pathak

Modern vision models typically rely on fine-tuning general-purpose models pre-trained on large, static datasets.

Classification Representation Learning +1

Cannot find the paper you are looking for? You can Submit a new open access paper.