no code implementations • 3 Mar 2024 • Pulkit Katdare, Anant Joshi, Katherine Driggs-Campbell
In this work, we argue that this residual term is significant and correcting for it could potentially improve sample-complexity of reinforcement learning methods.
no code implementations • 2 Jul 2021 • Anant Joshi, Amirhossein Taghvaei, Prashant G. Mehta, Sean P. Meyn
This paper is concerned with optimal control problems for control systems in continuous time, and interacting particle system methods designed to construct approximate control solutions.