Search Results for author: Bhavan Jasani

Found 6 papers, 2 papers with code

Synthesize Step-by-Step: Tools, Templates and LLMs as Data Generators for Reasoning-Based Chart VQA

no code implementations • 25 Mar 2024 • Zhuowan Li, Bhavan Jasani, Peng Tang, Shabnam Ghadar

In particular, our approach improves the accuracy of the previous state-of-the-art approach from 38% to 54% on the human-written questions in the ChartQA dataset, which needs strong reasoning.

Data Augmentation Question Answering +1

Paper
Add Code

YORO -- Lightweight End to End Visual Grounding

1 code implementation • 15 Nov 2022 • Chih-Hui Ho, Srikar Appalaraju, Bhavan Jasani, R. Manmatha, Nuno Vasconcelos

We present YORO - a multi-modal transformer encoder-only architecture for the Visual Grounding (VG) task.

Natural Language Queries Visual Grounding

Paper
Code

DocFormer: End-to-End Transformer for Document Understanding

1 code implementation • ICCV 2021 • Srikar Appalaraju, Bhavan Jasani, Bhargava Urala Kota, Yusheng Xie, R. Manmatha

DocFormer uses text, vision and spatial features and combines them using a novel multi-modal self-attention layer.

Ranked #3 on Document Image Classification on RVL-CDIP

Document Image Classification document understanding

245

Paper
Code

Skeleton based Zero Shot Action Recognition in Joint Pose-Language Semantic Space

no code implementations • 26 Nov 2019 • Bhavan Jasani, Afshaan Mazagonwalla

In this work, we present a body pose based zero shot action recognition network and demonstrate its performance on the NTU RGB-D dataset.

Action Recognition Temporal Action Localization +1

Paper
Add Code

Are we asking the right questions in MovieQA?

no code implementations • 8 Nov 2019 • Bhavan Jasani, Rohit Girdhar, Deva Ramanan

Joint vision and language tasks like visual question answering are fascinating because they explore high-level understanding, but at the same time, can be more prone to language biases.

Question Answering Visual Question Answering

Paper
Add Code

Learning Sampling Policies for Domain Adaptation

no code implementations • 19 May 2018 • Yash Patel, Kashyap Chitta, Bhavan Jasani

We address the problem of semi-supervised domain adaptation of classification algorithms through deep Q-learning.

Classification Domain Adaptation +3

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.