1 code implementation • 26 Nov 2022 • Max Siebenborn, Boris Belousov, Junning Huang, Jan Peters
On the other hand, the proposed Decision LSTM is able to achieve expert-level performance on these tasks, in addition to learning a swing-up controller on the real system.
no code implementations • 30 Nov 2019 • Junning Huang, Sirui Xie, Jiankai Sun, Qiurui Ma, Chunxiao Liu, Jianping Shi, Dahua Lin, Bolei Zhou
In this work, we propose a hybrid framework to learn neural decisions in the classical modular pipeline through end-to-end imitation learning.
no code implementations • ICLR 2019 • Sirui Xie, Junning Huang, Lanxin Lei, Chunxiao Liu, Zheng Ma, Wei zhang, Liang Lin
Reinforcement learning agents need exploratory behaviors to escape from local optima.