no code implementations • SIGDIAL (ACL) 2020 • Cheng-Hsun Hsueh, Wei-Yun Ma
To address this, in this paper, we propose semantic guidance using reinforcement learning to ensure that the generated responses indeed include the given or predicted semantics and that these semantics do not appear repeatedly in the response.