no code implementations • 5 Dec 2023 • Zhengyao Jiang, Yingchen Xu, Nolan Wagener, Yicheng Luo, Michael Janner, Edward Grefenstette, Tim Rocktäschel, Yuandong Tian
However, the extensive collection of human motion-captured data and the derived datasets of humanoid trajectories, such as MoCapAct, paves the way to tackle these challenges.
1 code implementation • 15 Aug 2022 • Nolan Wagener, Andrey Kolobov, Felipe Vieira Frujeri, Ricky Loynd, Ching-An Cheng, Matthew Hausknecht
We demonstrate the utility of MoCapAct by using it to train a single hierarchical policy capable of tracking the entire MoCap dataset within dm_control and show the learned low-level component can be re-used to efficiently learn downstream high-level tasks.
no code implementations • 23 Feb 2022 • Matthew Hausknecht, Nolan Wagener
Dropout has long been a staple of supervised learning, but is rarely used in reinforcement learning.
1 code implementation • 16 Jun 2021 • Nolan Wagener, Byron Boots, Ching-An Cheng
We propose a new algorithm, SAILR, that uses an intervention mechanism based on advantage functions to keep the agent safe throughout training and optimizes the agent's policy using off-the-shelf RL algorithms designed for unconstrained MDPs.
no code implementations • 24 Feb 2019 • Nolan Wagener, Ching-An Cheng, Jacob Sacks, Byron Boots
In this paper, we show that there exists a close connection between MPC and online learning, an abstract theoretical framework for analyzing online decision making in the optimization literature.
no code implementations • 26 May 2018 • Ching-An Cheng, Xinyan Yan, Nolan Wagener, Byron Boots
We show that if the switching time is properly randomized, LOKI can learn to outperform a suboptimal expert and converge faster than running policy gradient from scratch.
no code implementations • 22 Jan 2015 • Sergey Levine, Nolan Wagener, Pieter Abbeel
Autonomous learning of object manipulation skills can enable robots to acquire rich behavioral repertoires that scale to the variety of objects found in the real world.
Robotics