no code implementations • 1 Jan 2021 • Junyoung Park, Sanzhar Bakhtiyarov, Jinkyoo Park
From the RL perspective, Minmax mTSP raises several significant challenges, such as the cooperation of multiple workers and the need for a well-engineered reward function.
no code implementations • ICLR Workshop DeepDiffEq 2019 • Stefano Massaroli, Michael Poli, Sanzhar Bakhtiyarov, Atsushi Yamashita, Hajime Asama, Jinkyoo Park
Action spaces equipped with parameter sets are a common occurrence in reinforcement learning applications.
Hierarchical Reinforcement Learning reinforcement-learning +1