no code implementations • 26 May 2023 • Rajeev Alur, Osbert Bastani, Kishor Jothimurugan, Mateo Perez, Fabio Somenzi, Ashutosh Trivedi
The difficulty of manually specifying reward functions has led to an interest in using linear temporal logic (LTL) to express objectives for reinforcement learning (RL).
1 code implementation • 6 Feb 2023 • Kishor Jothimurugan, Steve Hsu, Osbert Bastani, Rajeev Alur
We formulate the problem as a two agent zero-sum game in which the adversary picks the sequence of subtasks.
no code implementations • 6 Jun 2022 • Kishor Jothimurugan, Suguman Bansal, Osbert Bastani, Rajeev Alur
Our empirical evaluation demonstrates that our algorithm computes equilibrium policies with high social welfare, whereas state-of-the-art baselines either fail to compute Nash equilibria or compute ones with comparatively lower social welfare.
1 code implementation • NeurIPS 2021 • Kishor Jothimurugan, Suguman Bansal, Osbert Bastani, Rajeev Alur
Our approach then incorporates reinforcement learning to learn neural network policies for each edge (sub-task) within a Dijkstra-style planning algorithm to compute a high-level plan in the graph.
no code implementations • 5 May 2021 • Kishor Jothimurugan, Matthew Andrews, Jeongran Lee, Lorenzo Maggi
We study regenerative stopping problems in which the system starts anew whenever the controller decides to stop and the long-term average cost is to be minimized.
no code implementations • 29 Oct 2020 • Kishor Jothimurugan, Osbert Bastani, Rajeev Alur
We propose a novel hierarchical reinforcement learning framework for control with continuous state and action spaces.
Hierarchical Reinforcement Learning reinforcement-learning +1
1 code implementation • NeurIPS 2019 • Kishor Jothimurugan, Rajeev Alur, Osbert Bastani
Reinforcement learning is a promising approach for learning control policies for robot tasks.