Search Results for author: Homanga Bharadhwaj

Found 28 papers, 6 papers with code

D2RL: Deep Dense Architectures in Reinforcement Learning

4 code implementations19 Oct 2020 Samarth Sinha, Homanga Bharadhwaj, Aravind Srinivas, Animesh Garg

While improvements in deep learning architectures have played a crucial role in improving the state of supervised and unsupervised learning in computer vision and natural language processing, neural network architecture choices for reinforcement learning remain relatively under-explored.

reinforcement-learning Reinforcement Learning (RL)

Model-Predictive Control via Cross-Entropy and Gradient-Based Optimization

1 code implementation L4DC 2020 Homanga Bharadhwaj, Kevin Xie, Florian Shkurti

Recent works in high-dimensional model-predictive control and model-based reinforcement learning with learned dynamics and reward models have resorted to population-based optimization methods, such as the Cross-Entropy Method (CEM), for planning a sequence of actions.

Model-based Reinforcement Learning Model Predictive Control

A Generative Framework for Zero-Shot Learning with Adversarial Domain Adaptation

1 code implementation7 Jun 2019 Varun Khare, Divyat Mahajan, Homanga Bharadhwaj, Vinay Verma, Piyush Rai

Our approach is based on end-to-end learning of the class distributions of seen classes and unseen classes.

 Ranked #1 on Zero-Shot Learning on CUB-200 - 0-Shot Learning (using extra training data)

Attribute Domain Adaptation +1

Continual Model-Based Reinforcement Learning with Hypernetworks

1 code implementation25 Sep 2020 Yizhou Huang, Kevin Xie, Homanga Bharadhwaj, Florian Shkurti

Effective planning in model-based reinforcement learning (MBRL) and model-predictive control (MPC) relies on the accuracy of the learned dynamics model.

Continual Learning Model-based Reinforcement Learning +3

Diversity inducing Information Bottleneck in Model Ensembles

1 code implementation10 Mar 2020 Samarth Sinha, Homanga Bharadhwaj, Anirudh Goyal, Hugo Larochelle, Animesh Garg, Florian Shkurti

Although deep learning models have achieved state-of-the-art performance on a number of vision tasks, generalization over high dimensional multi-modal data, and reliable predictive uncertainty estimation are still active areas of research.

Out-of-Distribution Detection

Explanations for Temporal Recommendations

no code implementations17 Jul 2018 Homanga Bharadhwaj, Shruti Joshi

Recommendation systems are an integral part of Artificial Intelligence (AI) and have become increasingly important in the growing age of commercialization in AI.

Recommendation Systems

Layer-wise Relevance Propagation for Explainable Recommendations

no code implementations17 Jul 2018 Homanga Bharadhwaj

Relationships between the images are identified by the model and layer-wise relevance propagation is used to infer pixel-level details of the images that may have significantly informed the model's choice.

A Data-Efficient Framework for Training and Sim-to-Real Transfer of Navigation Policies

no code implementations11 Oct 2018 Homanga Bharadhwaj, Zihan Wang, Yoshua Bengio, Liam Paull

Learning effective visuomotor policies for robots purely from data is challenging, but also appealing since a learning-based system should not require manual tuning or calibration.

Meta-Learning

MANGA: Method Agnostic Neural-policy Generalization and Adaptation

no code implementations19 Nov 2019 Homanga Bharadhwaj, Shoichiro Yamaguchi, Shin-ichi Maeda

Efficiently transferring learned policies to an unknown environment with changes in dynamics configurations in the presence of motor noise is very important for operating robots in the real world, and our work is a novel attempt in that direction.

Imitation Learning Reinforcement Learning (RL)

LEAF: Latent Exploration Along the Frontier

no code implementations21 May 2020 Homanga Bharadhwaj, Animesh Garg, Florian Shkurti

We target the challenging problem of policy learning from initial and goal states specified as images, and do not assume any access to the underlying ground-truth states of the robot and the environment.

Generalized Adversarially Learned Inference

no code implementations15 Jun 2020 Yatin Dandi, Homanga Bharadhwaj, Abhishek Kumar, Piyush Rai

Recent approaches, such as ALI and BiGAN frameworks, develop methods of inference of latent variables in GANs by adversarially training an image generator along with an encoder to match two joint distributions of image and latent vector pairs.

De-anonymization of authors through arXiv submissions during double-blind review

no code implementations1 Jul 2020 Homanga Bharadhwaj, Dylan Turpin, Animesh Garg, Ashton Anderson

Under two conditions: papers that are released on arXiv before the review phase and papers that are not, we examine the correlation between the reputation of their authors with the review scores and acceptance decisions.

A Bayesian Approach with Type-2 Student-tMembership Function for T-S Model Identification

no code implementations2 Sep 2020 Vikas Singh, Homanga Bharadhwaj, Nishchal K. Verma

Clustering techniques have been proved highly suc-cessful for Takagi-Sugeno (T-S) fuzzy model identification.

Clustering regression

Offline Policy Optimization with Variance Regularization

no code implementations1 Jan 2021 Riashat Islam, Samarth Sinha, Homanga Bharadhwaj, Samin Yeasar Arnob, Zhuoran Yang, Zhaoran Wang, Animesh Garg, Lihong Li, Doina Precup

Learning policies from fixed offline datasets is a key challenge to scale up reinforcement learning (RL) algorithms towards practical applications.

Continuous Control Offline RL +1

Conservative Safety Critics for Exploration

no code implementations ICLR 2021 Homanga Bharadhwaj, Aviral Kumar, Nicholas Rhinehart, Sergey Levine, Florian Shkurti, Animesh Garg

Safe exploration presents a major challenge in reinforcement learning (RL): when active data collection requires deploying partially trained policies, we must ensure that these policies avoid catastrophically unsafe regions, while still enabling trial and error learning.

Reinforcement Learning (RL) Safe Exploration

Latent Skill Planning for Exploration and Transfer

no code implementations ICLR 2021 Kevin Xie, Homanga Bharadhwaj, Danijar Hafner, Animesh Garg, Florian Shkurti

To quickly solve new tasks in complex environments, intelligent agents need to build up reusable knowledge.

Learning by Watching: Physical Imitation of Manipulation Skills from Human Videos

no code implementations18 Jan 2021 Haoyu Xiong, Quanzhou Li, Yun-Chun Chen, Homanga Bharadhwaj, Samarth Sinha, Animesh Garg

Learning from visual data opens the potential to accrue a large range of manipulation behaviors by leveraging human demonstrations without specifying each of them mathematically, but rather through natural task specification.

Keypoint Detection Robot Manipulation +1

Auditing AI models for Verified Deployment under Semantic Specifications

no code implementations25 Sep 2021 Homanga Bharadhwaj, De-An Huang, Chaowei Xiao, Anima Anandkumar, Animesh Garg

We enable such unit tests through variations in a semantically-interpretable latent space of a generative model.

Face Recognition

Auditing Robot Learning for Safety and Compliance during Deployment

no code implementations12 Oct 2021 Homanga Bharadhwaj

They are also likely going to fail and be incompliant with human preferences in increasingly subtle ways.

INFOrmation Prioritization through EmPOWERment in Visual Model-Based RL

no code implementations ICLR 2022 Homanga Bharadhwaj, Mohammad Babaeizadeh, Dumitru Erhan, Sergey Levine

We propose a modified objective for model-based RL that, in combination with mutual information maximization, allows us to learn representations and dynamics for visual model-based RL without reconstruction in a way that explicitly prioritizes functionally relevant factors.

Model-based Reinforcement Learning Reinforcement Learning (RL)

Simplifying Model-based RL: Learning Representations, Latent-space Models, and Policies with One Objective

no code implementations18 Sep 2022 Raj Ghugare, Homanga Bharadhwaj, Benjamin Eysenbach, Sergey Levine, Ruslan Salakhutdinov

In this work, we propose a single objective which jointly optimizes a latent-space model and policy to achieve high returns while remaining self-consistent.

Reinforcement Learning (RL) Value prediction

CACTI: A Framework for Scalable Multi-Task Multi-Scene Visual Imitation Learning

no code implementations12 Dec 2022 Zhao Mandi, Homanga Bharadhwaj, Vincent Moens, Shuran Song, Aravind Rajeswaran, Vikash Kumar

On a real robot setup, CACTI enables efficient training of a single policy that can perform 10 manipulation tasks involving kitchen objects, and is robust to varying layouts of distractors.

Data Augmentation Image Generation +3

Offline Policy Optimization in RL with Variance Regularizaton

no code implementations29 Dec 2022 Riashat Islam, Samarth Sinha, Homanga Bharadhwaj, Samin Yeasar Arnob, Zhuoran Yang, Animesh Garg, Zhaoran Wang, Lihong Li, Doina Precup

Learning policies from fixed offline datasets is a key challenge to scale up reinforcement learning (RL) algorithms towards practical applications.

Continuous Control Offline RL +1

Zero-Shot Robot Manipulation from Passive Human Videos

no code implementations3 Feb 2023 Homanga Bharadhwaj, Abhinav Gupta, Shubham Tulsiani, Vikash Kumar

Can we learn robot manipulation for everyday tasks, only by watching videos of humans doing arbitrary tasks in different unstructured settings?

Robot Manipulation

Visual Affordance Prediction for Guiding Robot Exploration

no code implementations28 May 2023 Homanga Bharadhwaj, Abhinav Gupta, Shubham Tulsiani

Motivated by the intuitive understanding humans have about the space of possible interactions, and the ease with which they can generalize this understanding to previously unseen scenes, we develop an approach for learning visual affordances for guiding robot exploration.

RoboAgent: Generalization and Efficiency in Robot Manipulation via Semantic Augmentations and Action Chunking

no code implementations5 Sep 2023 Homanga Bharadhwaj, Jay Vakil, Mohit Sharma, Abhinav Gupta, Shubham Tulsiani, Vikash Kumar

The grand aim of having a single robot that can manipulate arbitrary objects in diverse settings is at odds with the paucity of robotics datasets.

Chunking Robot Manipulation

Towards Generalizable Zero-Shot Manipulation via Translating Human Interaction Plans

no code implementations1 Dec 2023 Homanga Bharadhwaj, Abhinav Gupta, Vikash Kumar, Shubham Tulsiani

We pursue the goal of developing robots that can interact zero-shot with generic unseen objects via a diverse repertoire of manipulation skills and show how passive human videos can serve as a rich source of data for learning such generalist robots.

Robot Manipulation Translation

Cannot find the paper you are looking for? You can Submit a new open access paper.