no code implementations • 18 Jan 2023 • Megan M. Baker, Alexander New, Mario Aguilar-Simon, Ziad Al-Halah, Sébastien M. R. Arnold, Ese Ben-Iwhiwhu, Andrew P. Brna, Ethan Brooks, Ryan C. Brown, Zachary Daniels, Anurag Daram, Fabien Delattre, Ryan Dellana, Eric Eaton, Haotian Fu, Kristen Grauman, Jesse Hostetler, Shariq Iqbal, Cassandra Kent, Nicholas Ketz, Soheil Kolouri, George Konidaris, Dhireesha Kudithipudi, Erik Learned-Miller, Seungwon Lee, Michael L. Littman, Sandeep Madireddy, Jorge A. Mendez, Eric Q. Nguyen, Christine D. Piatko, Praveen K. Pilly, Aswin Raghavan, Abrar Rahman, Santhosh Kumar Ramakrishnan, Neale Ratzlaff, Andrea Soltoggio, Peter Stone, Indranil Sur, Zhipeng Tang, Saket Tiwari, Kyle Vedder, Felix Wang, Zifan Xu, Angel Yanguas-Gil, Harel Yedidsion, Shangqun Yu, Gautam K. Vallabha
Despite the advancement of machine learning techniques in recent years, state-of-the-art systems lack robustness to "real world" events, where the input distributions and tasks encountered by the deployed systems will not be limited to the original training context, and systems will instead need to adapt to novel distributions and tasks while deployed.
no code implementations • 29 Dec 2022 • Saket Tiwari, Omer Gottesman, George Konidaris
Central to our work is the idea that the transition dynamics induce a low dimensional manifold of reachable states embedded in the high-dimensional nominal state space.
no code implementations • 29 Dec 2022 • Saket Tiwari, George Konidaris
Deep neural networks can approximate functions on different types of data, from images to graphs, with varied underlying structure.
1 code implementation • 7 Jun 2022 • Haotian Fu, Shangqun Yu, Saket Tiwari, Michael Littman, George Konidaris
We propose a novel parameterized skill-learning algorithm that aims to learn transferable parameterized skills and synthesize them into a new action space that supports efficient learning in long-horizon tasks.
no code implementations • 4 Dec 2018 • Saket Tiwari, M. Prannoy
Hierarchical reinforcement learning deals with the problem of breaking down large tasks into meaningful sub-tasks.
Hierarchical Reinforcement Learning
reinforcement-learning
+2
no code implementations • 4 Dec 2018 • Saket Tiwari, Philip S. Thomas
In this paper we show how the option-critic architecture can be extended to estimate the natural gradient of the expected discounted return.