Search Results for author: Arjun R. Akula

Found 8 papers, 3 papers with code

KAFA: Rethinking Image Ad Understanding with Knowledge-Augmented Feature Adaptation of Vision-Language Models

no code implementations • 28 May 2023 • Zhiwei Jia, Pradyumna Narayana, Arjun R. Akula, Garima Pruthi, Hao Su, Sugato Basu, Varun Jampani

Image ad understanding is a crucial task with wide real-world applications.

Paper
Add Code

MetaCLUE: Towards Comprehensive Visual Metaphors Research

no code implementations • CVPR 2023 • Arjun R. Akula, Brendan Driscoll, Pradyumna Narayana, Soravit Changpinyo, Zhiwei Jia, Suyash Damle, Garima Pruthi, Sugato Basu, Leonidas Guibas, William T. Freeman, Yuanzhen Li, Varun Jampani

Towards this goal, we introduce MetaCLUE, a set of vision tasks on visual metaphor.

Image Generation Question Answering +1

Paper
Add Code

Question Generation for Evaluating Cross-Dataset Shifts in Multi-modal Grounding

no code implementations • 24 Jan 2022 • Arjun R. Akula

Visual question answering (VQA) is the multi-modal task of answering natural language questions about an input image.

Question Answering Question Generation +2

Paper
Add Code

CX-ToM: Counterfactual Explanations with Theory-of-Mind for Enhancing Human Trust in Image Recognition Models

1 code implementation • 3 Sep 2021 • Arjun R. Akula, Keze Wang, Changsong Liu, Sari Saba-Sadiya, Hongjing Lu, Sinisa Todorovic, Joyce Chai, Song-Chun Zhu

More concretely, our CX-ToM framework generates sequence of explanations in a dialog by mediating the differences between the minds of machine and human user.

counterfactual Explainable Artificial Intelligence (XAI)

Paper
Code

Words aren't enough, their order matters: On the Robustness of Grounding Visual Referring Expressions

1 code implementation • ACL 2020 • Arjun R. Akula, Spandana Gella, Yaser Al-Onaizan, Song-Chun Zhu, Siva Reddy

To measure the true progress of existing models, we split the test set into two sets, one which requires reasoning on linguistic structure and the other which doesn't.

Contrastive Learning Multi-Task Learning +2

Paper
Code

X-ToM: Explaining with Theory-of-Mind for Gaining Justified Human Trust

no code implementations • 15 Sep 2019 • Arjun R. Akula, Changsong Liu, Sari Saba-Sadiya, Hongjing Lu, Sinisa Todorovic, Joyce Y. Chai, Song-Chun Zhu

We present a new explainable AI (XAI) framework aimed at increasing justified human trust and reliance in the AI machine through explanations.

Action Recognition Explainable Artificial Intelligence (XAI) +2

Paper
Add Code

Natural Language Interaction with Explainable AI Models

no code implementations • 13 Mar 2019 • Arjun R. Akula, Sinisa Todorovic, Joyce Y. Chai, Song-Chun Zhu

This paper presents an explainable AI (XAI) system that provides explanations for its predictions.

Explainable Artificial Intelligence (XAI)

Paper
Add Code

Discourse Parsing in Videos: A Multi-modal Appraoch

1 code implementation • 6 Mar 2019 • Arjun R. Akula, Song-Chun Zhu

We propose the task of Visual Discourse Parsing, which requires understanding discourse relations among scenes in a video.

Discourse Parsing Visual Dialog +1

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.