no code implementations • SIGDIAL (ACL) 2020 • Megha Jhunjhunwala, Caleb Bryant, Pararth Shah
We present a novel multi-domain, multi-action dialog policy architecture trained on MultiWOZ, and show that small amounts of online supervision can lead to significant improvement in model performance.