Search Results for author: Bradly C. Stadie

Found 8 papers, 5 papers with code

World Model as a Graph: Learning Latent Landmarks for Planning

1 code implementation25 Nov 2020 Lunjun Zhang, Ge Yang, Bradly C. Stadie

Planning - the ability to analyze the structure of a problem in the large and decompose it into interrelated subproblems - is a hallmark of human intelligence.

Continuous Control Graph Learning +2

Transfer Learning for Estimating Causal Effects using Neural Networks

no code implementations23 Aug 2018 Sören R. Künzel, Bradly C. Stadie, Nikita Vemuri, Varsha Ramakrishnan, Jasjeet S. Sekhon, Pieter Abbeel

We develop new algorithms for estimating heterogeneous treatment effects, combining recent developments in transfer learning for neural networks with insights from the causal inference literature.

Causal Inference Transfer Learning

One-Shot Imitation Learning

no code implementations NeurIPS 2017 Yan Duan, Marcin Andrychowicz, Bradly C. Stadie, Jonathan Ho, Jonas Schneider, Ilya Sutskever, Pieter Abbeel, Wojciech Zaremba

A neural net is trained that takes as input one demonstration and the current state (which initially is the initial state of the other demonstration of the pair), and outputs an action with the goal that the resulting sequence of states and actions matches as closely as possible with the second demonstration.

Feature Engineering Imitation Learning +1

Third-Person Imitation Learning

1 code implementation6 Mar 2017 Bradly C. Stadie, Pieter Abbeel, Ilya Sutskever

A key difficulty in reinforcement learning is specifying a reward function for the agent to optimize.

Imitation Learning reinforcement-learning +1

Incentivizing Exploration In Reinforcement Learning With Deep Predictive Models

1 code implementation3 Jul 2015 Bradly C. Stadie, Sergey Levine, Pieter Abbeel

By parameterizing our learned model with a neural network, we are able to develop a scalable and efficient approach to exploration bonuses that can be applied to tasks with complex, high-dimensional state spaces.

Atari Games reinforcement-learning +2

Cannot find the paper you are looking for? You can Submit a new open access paper.