Search Results for author: David Carter

Found 1 papers, 0 papers with code

Batch Policy Gradient Methods for Improving Neural Conversation Models

no code implementations • 10 Feb 2017 • Kirthevasan Kandasamy, Yoram Bachrach, Ryota Tomioka, Daniel Tarlow, David Carter

We study reinforcement learning of chatbots with recurrent neural network architectures when the rewards are noisy and expensive to obtain.

Chatbot Policy Gradient Methods +2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.