Search Results for author: Nolan Wagener

Found 7 papers, 2 papers with code

H-GAP: Humanoid Control with a Generalist Planner

no code implementations • 5 Dec 2023 • Zhengyao Jiang, Yingchen Xu, Nolan Wagener, Yicheng Luo, Michael Janner, Edward Grefenstette, Tim Rocktäschel, Yuandong Tian

However, the extensive collection of human motion-captured data and the derived datasets of humanoid trajectories, such as MoCapAct, paves the way to tackle these challenges.

Humanoid Control Model Predictive Control +1

Paper
Add Code

MoCapAct: A Multi-Task Dataset for Simulated Humanoid Control

1 code implementation • 15 Aug 2022 • Nolan Wagener, Andrey Kolobov, Felipe Vieira Frujeri, Ricky Loynd, Ching-An Cheng, Matthew Hausknecht

We demonstrate the utility of MoCapAct by using it to train a single hierarchical policy capable of tracking the entire MoCap dataset within dm_control and show the learned low-level component can be re-used to efficiently learn downstream high-level tasks.

Humanoid Control

131

Paper
Code

Consistent Dropout for Policy Gradient Reinforcement Learning

no code implementations • 23 Feb 2022 • Matthew Hausknecht, Nolan Wagener

Dropout has long been a staple of supervised learning, but is rarely used in reinforcement learning.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Safe Reinforcement Learning Using Advantage-Based Intervention

1 code implementation • 16 Jun 2021 • Nolan Wagener, Byron Boots, Ching-An Cheng

We propose a new algorithm, SAILR, that uses an intervention mechanism based on advantage functions to keep the agent safe throughout training and optimizes the agent's policy using off-the-shelf RL algorithms designed for unconstrained MDPs.

reinforcement-learning Reinforcement Learning (RL) +1

Paper
Code

An Online Learning Approach to Model Predictive Control

no code implementations • 24 Feb 2019 • Nolan Wagener, Ching-An Cheng, Jacob Sacks, Byron Boots

In this paper, we show that there exists a close connection between MPC and online learning, an abstract theoretical framework for analyzing online decision making in the optimization literature.

Decision Making Model Predictive Control

Paper
Add Code

Fast Policy Learning through Imitation and Reinforcement

no code implementations • 26 May 2018 • Ching-An Cheng, Xinyan Yan, Nolan Wagener, Byron Boots

We show that if the switching time is properly randomized, LOKI can learn to outperform a suboptimal expert and converge faster than running policy gradient from scratch.

Imitation Learning Reinforcement Learning (RL)

Paper
Add Code

Learning Contact-Rich Manipulation Skills with Guided Policy Search

no code implementations • 22 Jan 2015 • Sergey Levine, Nolan Wagener, Pieter Abbeel

Autonomous learning of object manipulation skills can enable robots to acquire rich behavioral repertoires that scale to the variety of objects found in the real world.

Robotics

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.