1 code implementation • 31 Dec 2023 • Vardaan Pahuja, Weidi Luo, Yu Gu, Cheng-Hao Tu, Hong-You Chen, Tanya Berger-Wolf, Charles Stewart, Song Gao, Wei-Lun Chao, Yu Su
In this work, we exploit the structured context linked to camera trap images to boost out-of-distribution generalization for species classification tasks in camera traps.
no code implementations • 6 Jun 2023 • Vardaan Pahuja, AJ Piergiovanni, Anelia Angelova
Building joint representations across images and text is an essential step for tasks such as Visual Question Answering and Video Question Answering.
1 code implementation • 19 Dec 2022 • Vardaan Pahuja, Boshi Wang, Hugo Latapie, Jayanth Srinivasa, Yu Su
To address the limitations of existing KG link prediction frameworks, we propose a novel retrieve-and-read framework, which first retrieves a relevant subgraph context for the query and then jointly reasons over the context and the query with a high-capacity reader.
Ranked #2 on Link Prediction on FB15k-237
no code implementations • 12 Sep 2022 • Yu Gu, Vardaan Pahuja, Gong Cheng, Yu Su
In this survey, we situate KBQA in the broader literature of semantic parsing and give a comprehensive account of how existing KBQA approaches attempt to address the unique challenges.
1 code implementation • ACL 2021 • Vardaan Pahuja, Yu Gu, Wenhu Chen, Mehdi Bahrami, Lei Liu, Wei-Peng Chen, Yu Su
Knowledge bases (KBs) and text often contain complementary knowledge: KBs store structured knowledge that can support long range reasoning, while text stores more comprehensive and timely knowledge in an unstructured way.
no code implementations • 19 Sep 2019 • Vardaan Pahuja, Jie Fu, Christopher J. Pal
We aim to tackle this issue for the specific task of Visual Question Answering (VQA).
no code implementations • WS 2019 • Vardaan Pahuja, Jie Fu, Sarath Chandar, Christopher J. Pal
In current formulations of such networks only the parameters of the neural modules and/or the order of their execution is learned.
1 code implementation • 28 May 2018 • Shagun Sodhani, Vardaan Pahuja
Self-play is an unsupervised training procedure which enables the reinforcement learning agents to explore the environment without requiring any external rewards.
no code implementations • 21 May 2018 • Shagun Sodhani, Vardaan Pahuja
This is the reproducibility report for the paper "Learning To Count Objects In Natural Images For Visual QuestionAnswering"
1 code implementation • 31 Jan 2018 • Amrita Saha, Vardaan Pahuja, Mitesh M. Khapra, Karthik Sankaranarayanan, Sarath Chandar
Further, unlike existing large scale QA datasets which contain simple questions that can be answered from a single tuple, the questions in our dialogs require a larger subgraph of the KG.
no code implementations • 14 Mar 2017 • Vardaan Pahuja, Anirban Laha, Shachar Mirkin, Vikas Raykar, Lili Kotlerman, Guy Lev
The stream of words produced by Automatic Speech Recognition (ASR) systems is typically devoid of punctuations and formatting.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +1