Search Results for author: Douglas Boubert

Found 1 papers, 0 papers with code

Rewarding Chatbots for Real-World Engagement with Millions of Users

no code implementations10 Mar 2023 Robert Irvine, Douglas Boubert, Vyas Raina, Adian Liusie, Ziyi Zhu, Vineet Mudupalli, Aliaksei Korshuk, Zongyi Liu, Fritz Cremer, Valentin Assassi, Christie-Carol Beauchamp, Xiaoding Lu, Thomas Rialan, William Beauchamp

The proposed approach uses automatic pseudo-labels collected from user interactions to train a reward model that can be used to reject low-scoring sample responses generated by the chatbot model at inference time.

Chatbot Language Modelling

Cannot find the paper you are looking for? You can Submit a new open access paper.