no code implementations • 15 Aug 2013 • Yury Sokolov, Robert Kozma, Ludmilla D. Werbos, Paul J. Werbos
This paper provides new stability results for Action-Dependent Heuristic Dynamic Programming (ADHDP), using a control algorithm that iteratively improves an internal model of the external world in the autonomous system based on its continuous interaction with the environment.
no code implementations • 5 Mar 2021 • Wilkie Olin-Ammentorp, Yury Sokolov, Maxim Bazhenov
Reinforcement learning (RL) is a foundation of learning in biological systems and provides a framework to address numerous challenges with real-world artificial intelligence applications.