no code implementations • ICLR 2019 • Danijar Hafner, Dustin Tran, Timothy Lillicrap, Alex Irpan, James Davidson
NCPs are compatible with any model that can output uncertainty estimates, are easy to scale, and yield reliable uncertainty estimates throughout training.
no code implementations • 30 Nov 2018 • Alexander Pashevich, Danijar Hafner, James Davidson, Rahul Sukthankar, Cordelia Schmid
To achieve this, we study different modulation signals and exploration for hierarchical controllers.
9 code implementations • 12 Nov 2018 • Danijar Hafner, Timothy Lillicrap, Ian Fischer, Ruben Villegas, David Ha, Honglak Lee, James Davidson
Planning has been very successful for control tasks with known environment dynamics.
Ranked #2 on Continuous Control on DeepMind Walker Walk (Images)
no code implementations • ECCV 2018 • Tian Ye, Xiaolong Wang, James Davidson, Abhinav Gupta
In order to demonstrate that our system models these underlying physical properties, we train our model on collisions of different shapes (cube, cone, cylinder, spheres etc.)
2 code implementations • ICLR 2019 • Danijar Hafner, Dustin Tran, Timothy Lillicrap, Alex Irpan, James Davidson
NCPs are compatible with any model that can output uncertainty estimates, are easy to scale, and yield reliable uncertainty estimates throughout training.
3 code implementations • 15 May 2018 • Arsalan Mousavian, Alexander Toshev, Marek Fiser, Jana Kosecka, Ayzaan Wahid, James Davidson
We propose to using high level semantic and contextual features including segmentation and detection masks obtained by off-the-shelf state-of-the-art vision as observations and use deep network to learn the navigation policy.
no code implementations • 11 Oct 2017 • Aleksandra Faust, Oscar Ramirez, Marek Fiser, Kenneth Oslund, Anthony Francis, James Davidson, Lydia Tapia
The RL agents learn short-range, point-to-point navigation policies that capture robot dynamics and task constraints without knowledge of the large-scale topology.
2 code implementations • 8 Sep 2017 • Danijar Hafner, James Davidson, Vincent Vanhoucke
We introduce TensorFlow Agents, an efficient infrastructure paradigm for building parallel reinforcement learning algorithms in TensorFlow.
1 code implementation • 24 Aug 2017 • Xinchen Yan, Jasmine Hsu, Mohi Khansari, Yunfei Bai, Arkanath Pathak, Abhinav Gupta, James Davidson, Honglak Lee
Our contributions are fourfold: (1) To best of our knowledge, we are presenting for the first time a method to learn a 6-DOF grasping net from RGBD input; (2) We build a grasping dataset from demonstrations in virtual reality with rich sensory and interaction annotations.
no code implementations • 13 Jul 2017 • Marc Pickett, Ayush Sekhari, James Davidson
Domain knowledge can often be encoded in the structure of a network, such as convolutional layers for vision, which has been shown to increase generalization and decrease sample complexity, or the number of samples required for successful learning.
no code implementations • NeurIPS 2017 • Danijar Hafner, Alex Irpan, James Davidson, Nicolas Heess
We propose ThalNet, a deep learning model inspired by neocortical communication via the thalamus.
no code implementations • ICLR 2018 • Luke Metz, Julian Ibarz, Navdeep Jaitly, James Davidson
Specifically, we show how Q-values and policies over continuous spaces can be modeled using a next step prediction model over discretized dimensions.
7 code implementations • ICML 2017 • Lerrel Pinto, James Davidson, Rahul Sukthankar, Abhinav Gupta
Deep neural networks coupled with fast simulation and improved computation have led to recent successes in the field of reinforcement learning (RL).
6 code implementations • CVPR 2017 • Saurabh Gupta, Varun Tolani, James Davidson, Sergey Levine, Rahul Sukthankar, Jitendra Malik
The accumulated belief of the world enables the agent to track visited regions of the environment.
1 code implementation • 5 Oct 2016 • Lerrel Pinto, James Davidson, Abhinav Gupta
Due to large number of experiences required for training, most of these approaches use a self-supervised paradigm: using sensors to measure success/failure.