Search Results for author: Bhavan Jasani

Found 6 papers, 2 papers with code

Synthesize Step-by-Step: Tools, Templates and LLMs as Data Generators for Reasoning-Based Chart VQA

no code implementations25 Mar 2024 Zhuowan Li, Bhavan Jasani, Peng Tang, Shabnam Ghadar

In particular, our approach improves the accuracy of the previous state-of-the-art approach from 38% to 54% on the human-written questions in the ChartQA dataset, which needs strong reasoning.

Data Augmentation Question Answering +1

YORO -- Lightweight End to End Visual Grounding

1 code implementation15 Nov 2022 Chih-Hui Ho, Srikar Appalaraju, Bhavan Jasani, R. Manmatha, Nuno Vasconcelos

We present YORO - a multi-modal transformer encoder-only architecture for the Visual Grounding (VG) task.

Natural Language Queries Visual Grounding

Skeleton based Zero Shot Action Recognition in Joint Pose-Language Semantic Space

no code implementations26 Nov 2019 Bhavan Jasani, Afshaan Mazagonwalla

In this work, we present a body pose based zero shot action recognition network and demonstrate its performance on the NTU RGB-D dataset.

Action Recognition Temporal Action Localization +1

Are we asking the right questions in MovieQA?

no code implementations8 Nov 2019 Bhavan Jasani, Rohit Girdhar, Deva Ramanan

Joint vision and language tasks like visual question answering are fascinating because they explore high-level understanding, but at the same time, can be more prone to language biases.

Question Answering Visual Question Answering

Learning Sampling Policies for Domain Adaptation

no code implementations19 May 2018 Yash Patel, Kashyap Chitta, Bhavan Jasani

We address the problem of semi-supervised domain adaptation of classification algorithms through deep Q-learning.

Classification Domain Adaptation +3

Cannot find the paper you are looking for? You can Submit a new open access paper.