Search Results for author: David Meger

Found 29 papers, 19 papers with code

Trajectory-Constrained Deep Latent Visual Attention for Improved Local Planning in Presence of Heterogeneous Terrain

no code implementations9 Dec 2021 Stefan Wapnick, Travis Manderson, David Meger, Gregory Dudek

We present a reward-predictive, model-based deep learning method featuring trajectory-constrained visual attention for use in mapless, local visual navigation tasks.

Visual Navigation

Active 3D Shape Reconstruction from Vision and Touch

1 code implementation NeurIPS 2021 Edward J. Smith, David Meger, Luis Pineda, Roberto Calandra, Jitendra Malik, Adriana Romero, Michal Drozdzal

In this paper, we focus on this problem and introduce a system composed of: 1) a haptic simulator leveraging high spatial resolution vision-based tactile sensors for active touching of 3D objects; 2)a mesh-based 3D shape reconstruction model that relies on tactile or visuotactile signals; and 3) a set of data-driven solutions with either tactile or visuotactile priors to guide the shape exploration.

3D Reconstruction 3D Shape Reconstruction

A Deep Reinforcement Learning Approach to Marginalized Importance Sampling with the Successor Representation

1 code implementation12 Jun 2021 Scott Fujimoto, David Meger, Doina Precup

We bridge the gap between MIS and deep reinforcement learning by observing that the density ratio can be computed from the successor representation of the target policy.

Learning Intuitive Physics with Multimodal Generative Models

1 code implementation12 Jan 2021 Sahand Rezaei-Shoshtari, Francois Robert Hogan, Michael Jenkin, David Meger, Gregory Dudek

Predicting the future interaction of objects when they come into contact with their environment is key for autonomous agents to take intelligent and anticipatory actions.

Practical Marginalized Importance Sampling with the Successor Representation

no code implementations1 Jan 2021 Scott Fujimoto, David Meger, Doina Precup

We bridge the gap between MIS and deep reinforcement learning by observing that the density ratio can be computed from the successor representation of the target policy.

Intervention Design for Effective Sim2Real Transfer

no code implementations3 Dec 2020 Melissa Mozifian, Amy Zhang, Joelle Pineau, David Meger

The goal of this work is to address the recent success of domain randomization and data augmentation for the sim2real setting.

Causal Inference Data Augmentation

Learning the Latent Space of Robot Dynamics for Cutting Interaction Inference

1 code implementation22 Jul 2020 Sahand Rezaei-Shoshtari, David Meger, Inna Sharf

Utilization of latent space to capture a lower-dimensional representation of a complex dynamics model is explored in this work.

An Equivalence between Loss Functions and Non-Uniform Sampling in Experience Replay

1 code implementation NeurIPS 2020 Scott Fujimoto, David Meger, Doina Precup

Prioritized Experience Replay (PER) is a deep reinforcement learning technique in which agents learn from transitions sampled with non-uniform probability proportionate to their temporal-difference error.

3D Shape Reconstruction from Vision and Touch

1 code implementation NeurIPS 2020 Edward J. Smith, Roberto Calandra, Adriana Romero, Georgia Gkioxari, David Meger, Jitendra Malik, Michal Drozdzal

When a toddler is presented a new toy, their instinctual behaviour is to pick it upand inspect it with their hand and eyes in tandem, clearly searching over its surface to properly understand what they are playing with.

3D Shape Reconstruction

Learning to Drive Off Road on Smooth Terrain in Unstructured Environments Using an On-Board Camera and Sparse Aerial Images

no code implementations9 Apr 2020 Travis Manderson, Stefan Wapnick, David Meger, Gregory Dudek

We present a method for learning to drive on smooth terrain while simultaneously avoiding collisions in challenging off-road and unstructured outdoor environments using only visual inputs.

Detecting GAN generated errors

no code implementations2 Dec 2019 Xiru Zhu, Fengdi Che, Tianzi Yang, Tzuyang Yu, David Meger, Gregory Dudek

This is because the task of evaluating the quality of a generated image differs from deciding if an image is real or fake.

Unifying Variational Inference and PAC-Bayes for Supervised Learning that Scales

1 code implementation23 Oct 2019 Sanjay Thakur, Herke van Hoof, Gunshi Gupta, David Meger

PAC Bayes is a generalized framework which is more resistant to overfitting and that yields performance bounds that hold with arbitrarily high probability even on the unjustified extrapolations.

Variational Inference

Deep learning for Aerosol Forecasting

no code implementations14 Oct 2019 Caleb Hoyne, S. Karthik Mukkavilli, David Meger

Reanalysis datasets combining numerical physics models and limited observations to generate a synthesised estimate of variables in an Earth system, are prone to biases against ground truth.

Cascaded Gaussian Processes for Data-efficient Robot Dynamics Learning

no code implementations5 Oct 2019 Sahand Rezaei-Shoshtari, David Meger, Inna Sharf

Motivated by the recursive Newton-Euler formulation, we propose a novel cascaded Gaussian process learning framework for the inverse dynamics of robot manipulators.

Dimensionality Reduction Gaussian Processes

Learning Domain Randomization Distributions for Training Robust Locomotion Policies

no code implementations2 Jun 2019 Melissa Mozifian, Juan Camilo Gamboa Higuera, David Meger, Gregory Dudek

We explore the use of gradient-based search methods to learn a domain randomization with the following properties: 1) The trained policy should be successful in environments sampled from the domain randomization distribution 2) The domain randomization distribution should be wide enough so that the experience similar to the target robot system is observed during training, while addressing the practicality of training finite capacity models.

Human Motion Prediction via Pattern Completion in Latent Representation Space

no code implementations18 Apr 2019 Yi Tian Xu, Yaqiao Li, David Meger

Inspired by ideas in cognitive science, we propose a novel and general approach to solve human motion understanding via pattern completion on a learned latent representation space.

Action Classification General Classification +3

Uncertainty Aware Learning from Demonstrations in Multiple Contexts using Bayesian Neural Networks

1 code implementation13 Mar 2019 Sanjay Thakur, Herke van Hoof, Juan Camilo Gamboa Higuera, Doina Precup, David Meger

Learned controllers such as neural networks typically do not have a notion of uncertainty that allows to diagnose an offset between training and testing conditions, and potentially intervene.

Off-Policy Deep Reinforcement Learning without Exploration

10 code implementations7 Dec 2018 Scott Fujimoto, David Meger, Doina Precup

Many practical applications of reinforcement learning constrain agents to learn from a fixed batch of data which has already been gathered, without offering further possibility for data collection.

Continuous Control

Where Off-Policy Deep Reinforcement Learning Fails

no code implementations27 Sep 2018 Scott Fujimoto, David Meger, Doina Precup

This work examines batch reinforcement learning--the task of maximally exploiting a given batch of off-policy data, without further data collection.

Continuous Control

Synthesizing Neural Network Controllers with Probabilistic Model based Reinforcement Learning

3 code implementations6 Mar 2018 Juan Camilo Gamboa Higuera, David Meger, Gregory Dudek

Finally, we assess the performance of the algorithm for learning motor controllers for a six legged autonomous underwater vehicle.

Model-based Reinforcement Learning

Addressing Function Approximation Error in Actor-Critic Methods

46 code implementations ICML 2018 Scott Fujimoto, Herke van Hoof, David Meger

In value-based reinforcement learning methods such as deep Q-learning, function approximation errors are known to lead to overestimated value estimates and suboptimal policies.

OpenAI Gym Q-Learning

Bayesian Policy Gradients via Alpha Divergence Dropout Inference

1 code implementation6 Dec 2017 Peter Henderson, Thang Doan, Riashat Islam, David Meger

Policy gradient methods have had great success in solving continuous control tasks, yet the stochastic nature of such problems makes deterministic value estimation difficult.

Continuous Control Policy Gradient Methods

Cost Adaptation for Robust Decentralized Swarm Behaviour

1 code implementation21 Sep 2017 Peter Henderson, Matthew Vertescher, David Meger, Mark Coates

To allay this problem, we use a meta-learning process -- cost adaptation -- which generates the optimization objective for D-RHC to solve based on a set of human-generated priors (cost and constraint functions) and an auxiliary heuristic.


Deep Reinforcement Learning that Matters

6 code implementations19 Sep 2017 Peter Henderson, Riashat Islam, Philip Bachman, Joelle Pineau, Doina Precup, David Meger

In recent years, significant progress has been made in solving challenging problems across various domains using deep reinforcement learning (RL).

Benchmark Environments for Multitask Learning in Continuous Domains

1 code implementation14 Aug 2017 Peter Henderson, Wei-Di Chang, Florian Shkurti, Johanna Hansen, David Meger, Gregory Dudek

As demand drives systems to generalize to various domains and problems, the study of multitask, transfer and lifelong learning has become an increasingly important pursuit.

OpenAI Gym

Improved Adversarial Systems for 3D Object Generation and Reconstruction

3 code implementations29 Jul 2017 Edward Smith, David Meger

This paper describes a new approach for training generative adversarial networks (GAN) to understand the detailed 3D shape of objects.

Cannot find the paper you are looking for? You can Submit a new open access paper.