1 code implementation • 17 May 2019 • Zheng Tian, Ying Wen, Zhichen Gong, Faiz Punakkath, Shihao Zou, Jun Wang
In a single-agent setting, reinforcement learning (RL) tasks can be cast into an inference problem by introducing a binary random variable o, which stands for the "optimality".
Multi-agent Reinforcement Learning reinforcement-learning +1
no code implementations • 3 Mar 2017 • Zhichen Gong, Huanhuan Chen
Thus DSW is able to yield alignment that is semantically more interpretable than that of DTW.