no code implementations • 2 Nov 2020 • Nan Lin, YuXuan Li, Yujun Zhu, Ruolin Wang, Xiayu Zhang, Jianmin Ji, Keke Tang, Xiaoping Chen, Xinming Zhang
Our meta policy tries to manipulate the next optimal state and actual action is produced by the inverse dynamics model.