1 code implementation • 23 May 2019 • Domingo Esteban, Leonel Rozo, Darwin G. Caldwell
Moreover, such composition of individual policies is usually performed sequentially, which is not suitable for tasks that require to perform the subtasks concurrently.
Hierarchical Reinforcement Learning reinforcement-learning +1