no code implementations • ICLR Workshop drlStructPred 2019 • Jacob Biloki, Chen Liang, Ni Lao
We consider the problem of weakly supervised structured prediction (SP) with reinforcement learning (RL) – for example, given a database table and a question, perform a sequence of computation actions on the table, which generates a response and receives a binary success-failure reward.