no code implementations • 18 Feb 2023 • Jing-Cheng Pang, Xin-Yu Yang, Si-Hang Yang, Yang Yu
To ease the learning burden of the policy, we investigate an inside-out scheme for natural language-conditioned RL by developing a task language (TL) that is task-related and unique.