Search Results for author: Abhishek Das

Found 46 papers, 27 papers with code

Visual Dialog

11 code implementations • CVPR 2017 • Abhishek Das, Satwik Kottur, Khushi Gupta, Avi Singh, Deshraj Yadav, José M. F. Moura, Devi Parikh, Dhruv Batra

We introduce the task of Visual Dialog, which requires an AI agent to hold a meaningful dialog with humans in natural, conversational language about visual content.

Ranked #15 on Visual Dialog on VisDial v0.9 val

Chatbot Retrieval +1

10,425

Paper
Code

Grad-CAM: Visual Explanations from Deep Networks via Gradient-based Localization

124 code implementations • ICCV 2017 • Ramprasaath R. Selvaraju, Michael Cogswell, Abhishek Das, Ramakrishna Vedantam, Devi Parikh, Dhruv Batra

For captioning and VQA, we show that even non-attention based models can localize inputs.

General Classification Image Classification +2

9,389

Paper
Code

Grad-CAM: Why did you say that?

2 code implementations • 22 Nov 2016 • Ramprasaath R. Selvaraju, Abhishek Das, Ramakrishna Vedantam, Michael Cogswell, Devi Parikh, Dhruv Batra

We propose a technique for making Convolutional Neural Network (CNN)-based models more transparent by visualizing input regions that are 'important' for predictions -- or visual explanations.

Image Captioning Visual Question Answering

9,389

Paper
Code

Embodied Question Answering

4 code implementations • CVPR 2018 • Abhishek Das, Samyak Datta, Georgia Gkioxari, Stefan Lee, Devi Parikh, Dhruv Batra

We present a new AI task -- Embodied Question Answering (EmbodiedQA) -- where an agent is spawned at a random location in a 3D environment and asked a question ("What color is the car?").

Embodied Question Answering Navigate +3

1,177

Paper
Code

Neural Modular Control for Embodied Question Answering

2 code implementations • 26 Oct 2018 • Abhishek Das, Georgia Gkioxari, Stefan Lee, Devi Parikh, Dhruv Batra

We use imitation learning to warm-start policies at each level of the hierarchy, dramatically increasing sample efficiency, followed by reinforcement learning.

Embodied Question Answering Imitation Learning +3

1,177

Paper
Code

The Open Catalyst 2020 (OC20) Dataset and Community Challenges

5 code implementations • 20 Oct 2020 • Lowik Chanussot, Abhishek Das, Siddharth Goyal, Thibaut Lavril, Muhammed Shuaibi, Morgane Riviere, Kevin Tran, Javier Heras-Domingo, Caleb Ho, Weihua Hu, Aini Palizhati, Anuroop Sriram, Brandon Wood, Junwoong Yoon, Devi Parikh, C. Lawrence Zitnick, Zachary Ulissi

Catalyst discovery and optimization is key to solving many societal and energy challenges including solar fuels synthesis, long-term energy storage, and renewable fertilizer production.

3D Pose Estimation

595

Paper
Code

Rotation Invariant Graph Neural Networks using Spin Convolutions

1 code implementation • 17 Jun 2021 • Muhammed Shuaibi, Adeesh Kolluru, Abhishek Das, Aditya Grover, Anuroop Sriram, Zachary Ulissi, C. Lawrence Zitnick

We introduce a novel approach to modeling angular information between sets of neighboring atoms in a graph neural network.

Ranked #3 on Initial Structure to Relaxed Energy (IS2RE) on OC20

Initial Structure to Relaxed Energy (IS2RE)

595

Paper
Code

GemNet-OC: Developing Graph Neural Networks for Large and Diverse Molecular Simulation Datasets

1 code implementation • 6 Apr 2022 • Johannes Gasteiger, Muhammed Shuaibi, Anuroop Sriram, Stephan Günnemann, Zachary Ulissi, C. Lawrence Zitnick, Abhishek Das

This work investigates this question by first developing the GemNet-OC model based on the large Open Catalyst 2020 (OC20) dataset.

Ranked #1 on Initial Structure to Relaxed Energy (IS2RE) on OC20

Initial Structure to Relaxed Energy (IS2RE)

595

Paper
Code

The Open Catalyst 2022 (OC22) Dataset and Challenges for Oxide Electrocatalysts

2 code implementations • 17 Jun 2022 • Richard Tran, Janice Lan, Muhammed Shuaibi, Brandon M. Wood, Siddharth Goyal, Abhishek Das, Javier Heras-Domingo, Adeesh Kolluru, Ammar Rizvi, Nima Shoghi, Anuroop Sriram, Felix Therrien, Jehad Abed, Oleksandr Voznyy, Edward H. Sargent, Zachary Ulissi, C. Lawrence Zitnick

The development of machine learning models for electrocatalysts requires a broad set of training data to enable their use across a wide variety of materials.

BIG-bench Machine Learning Property Prediction +1

595

Paper
Code

Spherical Channels for Modeling Atomic Interactions

2 code implementations • 29 Jun 2022 • C. Lawrence Zitnick, Abhishek Das, Adeesh Kolluru, Janice Lan, Muhammed Shuaibi, Anuroop Sriram, Zachary Ulissi, Brandon Wood

We propose the Spherical Channel Network (SCN) to model atomic energies and forces.

595

Paper
Code

AdsorbML: A Leap in Efficiency for Adsorption Energy Calculations using Generalizable Machine Learning Potentials

2 code implementations • 29 Nov 2022 • Janice Lan, Aini Palizhati, Muhammed Shuaibi, Brandon M. Wood, Brook Wander, Abhishek Das, Matt Uyttendaele, C. Lawrence Zitnick, Zachary W. Ulissi

Computational catalysis is playing an increasingly significant role in the design of catalysts across a wide range of applications.

Benchmarking

595

Paper
Code

The Open DAC 2023 Dataset and Challenges for Sorbent Discovery in Direct Air Capture

2 code implementations • 1 Nov 2023 • Anuroop Sriram, Sihoon Choi, Xiaohan Yu, Logan M. Brabson, Abhishek Das, Zachary Ulissi, Matt Uyttendaele, Andrew J. Medford, David S. Sholl

We also trained state-of-the-art ML models on this dataset to approximate calculations at the DFT level.

595

Paper
Code

Learning Cooperative Visual Dialog Agents with Deep Reinforcement Learning

7 code implementations • ICCV 2017 • Abhishek Das, Satwik Kottur, José M. F. Moura, Stefan Lee, Dhruv Batra

Specifically, we pose a cooperative 'image guessing' game between two agents -- Qbot and Abot -- who communicate in natural language dialog so that Qbot can select an unseen image from a lineup of images.

reinforcement-learning Reinforcement Learning (RL) +2

189

Paper
Code

EquiformerV2: Improved Equivariant Transformer for Scaling to Higher-Degree Representations

1 code implementation • 21 Jun 2023 • Yi-Lun Liao, Brandon Wood, Abhishek Das, Tess Smidt

Equivariant Transformers such as Equiformer have demonstrated the efficacy of applying Transformers to the domain of 3D atomistic systems.

138

Paper
Code

Towards Training Billion Parameter Graph Neural Networks for Atomic Simulations

1 code implementation • ICLR 2022 • Anuroop Sriram, Abhishek Das, Brandon M. Wood, Siddharth Goyal, C. Lawrence Zitnick

Recent progress in Graph Neural Networks (GNNs) for modeling atomic simulations has the potential to revolutionize catalyst discovery, which is a key step in making progress towards the energy breakthroughs needed to combat climate change.

Ranked #2 on Initial Structure to Relaxed Energy (IS2RE) on OC20

Initial Structure to Relaxed Energy (IS2RE)

119

Paper
Code

Large-scale Pretraining for Visual Dialog: A Simple State-of-the-Art Baseline

2 code implementations • ECCV 2020 • Vishvak Murahari, Dhruv Batra, Devi Parikh, Abhishek Das

Next, we find that additional finetuning using "dense" annotations in VisDial leads to even higher NDCG -- more than 10% over our base model -- but hurts MRR -- more than 17% below our base model!

Language Modelling Representation Learning +2

Paper
Code

Feel The Music: Automatically Generating A Dance For An Input Song

1 code implementation • 21 Jun 2020 • Purva Tendulkar, Abhishek Das, Aniruddha Kembhavi, Devi Parikh

We encode intuitive, flexible heuristics for what a 'good' dance is: the structure of the dance should align with the structure of the music.

Paper
Code

Audio Visual Scene-Aware Dialog (AVSD) Challenge at DSTC7

4 code implementations • 1 Jun 2018 • Huda Alamri, Vincent Cartillier, Raphael Gontijo Lopes, Abhishek Das, Jue Wang, Irfan Essa, Dhruv Batra, Devi Parikh, Anoop Cherian, Tim K. Marks, Chiori Hori

Scene-aware dialog systems will be able to have conversations with users about the objects and events around them.

Video Description Visual Dialog

Paper
Code

End-to-End Audio Visual Scene-Aware Dialog using Multimodal Attention-Based Video Features

2 code implementations • 21 Jun 2018 • Chiori Hori, Huda Alamri, Jue Wang, Gordon Wichern, Takaaki Hori, Anoop Cherian, Tim K. Marks, Vincent Cartillier, Raphael Gontijo Lopes, Abhishek Das, Irfan Essa, Dhruv Batra, Devi Parikh

We introduce a new dataset of dialogs about videos of human behaviors.

Question Answering Video Description +1

Paper
Code

PIRLNav: Pretraining with Imitation and RL Finetuning for ObjectNav

1 code implementation • CVPR 2023 • Ram Ramrakhya, Dhruv Batra, Erik Wijmans, Abhishek Das

We find that BC$\rightarrow$RL on human demonstrations outperforms BC$\rightarrow$RL on SP and FE trajectories, even when controlled for same BC-pretraining success on train, and even on a subset of val episodes where BC-pretraining success favors the SP or FE policies.

Imitation Learning Navigate +1

Paper
Code

Multi-Image Steganography Using Deep Neural Networks

2 code implementations • 2 Jan 2021 • Abhishek Das, Japsimar Singh Wahi, Mansi Anand, Yugant Rana

Steganography is the science of hiding a secret message within an ordinary public message.

Image Steganography

Paper
Code

Auxiliary Tasks and Exploration Enable ObjectNav

1 code implementation • 8 Apr 2021 • Joel Ye, Dhruv Batra, Abhishek Das, Erik Wijmans

We instead re-enable a generic learned agent by adding auxiliary learning tasks and an exploration reward.

Ranked #2 on Robot Navigation on Habitat 2020 Object Nav test-std

Auxiliary Learning Navigate +1

Paper
Code

Audio-Visual Scene-Aware Dialog

2 code implementations • 25 Jan 2019 • Huda Alamri, Vincent Cartillier, Abhishek Das, Jue Wang, Anoop Cherian, Irfan Essa, Dhruv Batra, Tim K. Marks, Chiori Hori, Peter Anderson, Stefan Lee, Devi Parikh

We introduce the task of scene-aware dialog.

Scene-Aware Dialogue

Paper
Code

Improving Generative Visual Dialog by Answering Diverse Questions

1 code implementation • IJCNLP 2019 • Vishvak Murahari, Prithvijit Chattopadhyay, Dhruv Batra, Devi Parikh, Abhishek Das

Prior work on training generative Visual Dialog models with reinforcement learning(Das et al.) has explored a Qbot-Abot image-guessing game and shown that this 'self-talk' approach can lead to improved performance at the downstream dialog-conditioned image-guessing task.

Representation Learning Visual Dialog

Paper
Code

Auxiliary Tasks Speed Up Learning PointGoal Navigation

1 code implementation • 9 Jul 2020 • Joel Ye, Dhruv Batra, Erik Wijmans, Abhishek Das

PointGoal Navigation is an embodied task that requires agents to navigate to a specified point in an unseen environment.

Navigate PointGoal Navigation

Paper
Code

Detecting Hate Speech in Multi-modal Memes

1 code implementation • 29 Dec 2020 • Abhishek Das, Japsimar Singh Wahi, SiYao Li

A crucial characteristic of the challenge is that it includes "benign confounders" to counter the possibility of models exploiting unimodal priors.

Binary Classification Hate Speech Detection +6

Paper
Code

Smart Refrigerator using Internet of Things and Android

1 code implementation • 18 Dec 2020 • Abhishek Das, Vivek Dhuri, Ranjushree Pal

The kitchen is regarded as the central unit of the traditional as well as modern homes.

Human-Computer Interaction

Paper
Code

Evaluating Visual Conversational Agents via Cooperative Human-AI Games

no code implementations • 17 Aug 2017 • Prithvijit Chattopadhyay, Deshraj Yadav, Viraj Prabhu, Arjun Chandrasekaran, Abhishek Das, Stefan Lee, Dhruv Batra, Devi Parikh

This suggests a mismatch between benchmarking of AI in isolation and in the context of human-AI teams.

Benchmarking

Paper
Add Code

Human Attention in Visual Question Answering: Do Humans and Deep Networks Look at the Same Regions?

no code implementations • 17 Jun 2016 • Abhishek Das, Harsh Agrawal, C. Lawrence Zitnick, Devi Parikh, Dhruv Batra

We conduct large-scale studies on `human attention' in Visual Question Answering (VQA) to understand where humans choose to look to answer questions about images.

Question Answering Visual Question Answering

Paper
Add Code

Human Attention in Visual Question Answering: Do Humans and Deep Networks Look at the Same Regions?

no code implementations • EMNLP 2016 • Abhishek Das, Harsh Agrawal, C. Lawrence Zitnick, Devi Parikh, Dhruv Batra

We conduct large-scale studies on `human attention' in Visual Question Answering (VQA) to understand where humans choose to look to answer questions about images.

Question Answering Visual Question Answering

Paper
Add Code

TarMAC: Targeted Multi-Agent Communication

no code implementations • ICLR 2019 • Abhishek Das, Théophile Gervet, Joshua Romoff, Dhruv Batra, Devi Parikh, Michael Rabbat, Joelle Pineau

We propose a targeted communication architecture for multi-agent reinforcement learning, where agents learn both what messages to send and whom to address them to while performing cooperative tasks in partially-observable environments.

Multi-agent Reinforcement Learning

Paper
Add Code

Impact of Data Normalization on Deep Neural Network for Time Series Forecasting

no code implementations • 13 Dec 2018 • Samit Bhanja, Abhishek Das

The time series forecasting has a great impact on our socio-economic environment.

Image Classification speech-recognition +3

Paper
Add Code

Connecting Language and Vision to Actions

no code implementations • ACL 2018 • Peter Anderson, Abhishek Das, Qi Wu

A long-term goal of AI research is to build intelligent agents that can see the rich visual environment around us, communicate this understanding in natural language to humans and other agents, and act in a physical or embodied environment.

Image Captioning Language Modelling +3

Paper
Add Code

Response to "Visual Dialogue without Vision or Dialogue" (Massiceti et al., 2018)

no code implementations • 16 Jan 2019 • Abhishek Das, Devi Parikh, Dhruv Batra

In a recent workshop paper, Massiceti et al. presented a baseline model and subsequent critique of Visual Dialog (Das et al., CVPR 2017) that raises what we believe to be unfounded concerns about the dataset and evaluation.

Visual Dialog

Paper
Add Code

Embodied Question Answering in Photorealistic Environments with Point Cloud Perception

no code implementations • CVPR 2019 • Erik Wijmans, Samyak Datta, Oleksandr Maksymets, Abhishek Das, Georgia Gkioxari, Stefan Lee, Irfan Essa, Devi Parikh, Dhruv Batra

To help bridge the gap between internet vision-style problems and the goal of vision for embodied perception we instantiate a large-scale navigation task -- Embodied Question Answering [1] in photo-realistic environments (Matterport 3D).

Embodied Question Answering Question Answering

Paper
Add Code

IR-VIC: Unsupervised Discovery of Sub-goals for Transfer in RL

no code implementations • 24 Jul 2019 • Nirbhay Modhe, Prithvijit Chattopadhyay, Mohit Sharma, Abhishek Das, Devi Parikh, Dhruv Batra, Ramakrishna Vedantam

We propose a novel framework to identify sub-goals useful for exploration in sequential decision making tasks under partial observability.

Decision Making Hierarchical Reinforcement Learning

Paper
Add Code

Probing Emergent Semantics in Predictive Agents via Question Answering

no code implementations • ICML 2020 • Abhishek Das, Federico Carnevale, Hamza Merzic, Laura Rimell, Rosalia Schneider, Josh Abramson, Alden Hung, Arun Ahuja, Stephen Clark, Gregory Wayne, Felix Hill

Recent work has shown how predictive modeling can endow agents with rich knowledge of their surroundings, improving their ability to act in complex environments.

Question Answering

Paper
Add Code

ForceNet: A Graph Neural Network for Large-Scale Quantum Chemistry Simulation

no code implementations • 1 Jan 2021 • Weihua Hu, Muhammed Shuaibi, Abhishek Das, Siddharth Goyal, Anuroop Sriram, Jure Leskovec, Devi Parikh, Larry Zitnick

We use ForceNet to perform quantum chemistry simulations, where ForceNet is able to achieve 4x higher success rate than existing ML models.

Atomic Forces

Paper
Add Code

An Introduction to Electrocatalyst Design using Machine Learning for Renewable Energy Storage

no code implementations • 14 Oct 2020 • C. Lawrence Zitnick, Lowik Chanussot, Abhishek Das, Siddharth Goyal, Javier Heras-Domingo, Caleb Ho, Weihua Hu, Thibaut Lavril, Aini Palizhati, Morgane Riviere, Muhammed Shuaibi, Anuroop Sriram, Kevin Tran, Brandon Wood, Junwoong Yoon, Devi Parikh, Zachary Ulissi

As we increase our reliance on renewable energy sources such as wind and solar, which produce intermittent power, storage is needed to transfer power from times of peak generation to peak demand.

BIG-bench Machine Learning

Paper
Add Code

ForceNet: A Graph Neural Network for Large-Scale Quantum Calculations

no code implementations • 2 Mar 2021 • Weihua Hu, Muhammed Shuaibi, Abhishek Das, Siddharth Goyal, Anuroop Sriram, Jure Leskovec, Devi Parikh, C. Lawrence Zitnick

By not imposing explicit physical constraints, we can flexibly design expressive models while maintaining their computational efficiency.

Atomic Forces Computational Efficiency +1

Paper
Add Code

Auxiliary Tasks and Exploration Enable ObjectGoal Navigation

no code implementations • ICCV 2021 • Joel Ye, Dhruv Batra, Abhishek Das, Erik Wijmans

We instead re-enable a generic learned agent by adding auxiliary learning tasks and an exploration reward.

Auxiliary Learning Navigate

Paper
Add Code

ABSA-Bench: Towards the Unified Evaluation of Aspect-based Sentiment Analysis Research

no code implementations • ALTA 2020 • Abhishek Das, Wei Emma Zhang

Aspect-Based Sentiment Analysis (ABSA)has gained much attention in recent years.

Aspect-Based Sentiment Analysis Aspect-Based Sentiment Analysis (ABSA) +1

Paper
Add Code

NarrationBot and InfoBot: A Hybrid System for Automated Video Description

no code implementations • 7 Nov 2021 • Shasta Ihorn, Yue-Ting Siu, Aditya Bodi, Lothar Narins, Jose M. Castanon, Yash Kant, Abhishek Das, Ilmi Yoon, Pooyan Fazli

To overcome the increasing gaps in video accessibility, we developed a hybrid system of two tools to 1) automatically generate descriptions for videos and 2) provide answers or additional descriptions in response to user queries on a video.

Video Description

Paper
Add Code

DS-VIC: Unsupervised Discovery of Decision States for Transfer in RL

no code implementations • 25 Sep 2019 • Nirbhay Modhe, Prithvijit Chattopadhyay, Mohit Sharma, Abhishek Das, Devi Parikh, Dhruv Batra, Ramakrishna Vedantam

We learn to identify decision states, namely the parsimonious set of states where decisions meaningfully affect the future states an agent can reach in an environment.

Paper
Add Code

Habitat-Web: Learning Embodied Object-Search Strategies from Human Demonstrations at Scale

no code implementations • CVPR 2022 • Ram Ramrakhya, Eric Undersander, Dhruv Batra, Abhishek Das

We present a large-scale study of imitating human demonstrations on tasks that require a virtual robot to search for objects in new environments -- (1) ObjectGoal Navigation (e. g. 'find & go to a chair') and (2) Pick&Place (e. g. 'find mug, pick mug, find counter, place mug on counter').

Imitation Learning Reinforcement Learning (RL)

Paper
Add Code

Generalizing Denoising to Non-Equilibrium Structures Improves Equivariant Force Fields

no code implementations • 14 Mar 2024 • Yi-Lun Liao, Tess Smidt, Abhishek Das

We study the effectiveness of training equivariant networks with DeNS on OC20, OC22 and MD17 datasets and demonstrate that DeNS can achieve new state-of-the-art results on OC20 and OC22 and significantly improve training efficiency on MD17.

Denoising

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.