Search Results for author: Saravanan Ganesh

Found 2 papers, 0 papers with code

PERL: Parameter Efficient Reinforcement Learning from Human Feedback

no code implementations • 15 Mar 2024 • Hakim Sidahmed, Samrat Phatale, Alex Hutcheson, Zhuonan Lin, Zhang Chen, Zac Yu, Jarvis Jin, Roman Komarytsia, Christiane Ahlheim, Yonghao Zhu, Simral Chaudhary, Bowen Li, Saravanan Ganesh, Bill Byrne, Jessica Hoffmann, Hassan Mansoor, Wei Li, Abhinav Rastogi, Lucas Dixon

We investigate the setup of "Parameter Efficient Reinforcement Learning" (PERL), in which we perform reward model training and reinforcement learning using LoRA.

reinforcement-learning

Paper
Add Code

TicketTalk: Toward human-level performance with end-to-end, transaction-based dialog systems

no code implementations • ACL 2021 • Bill Byrne, Karthik Krishnamoorthi, Saravanan Ganesh, Mihir Sanjay Kale

In terms of data, we introduce TicketTalk, a movie ticketing dialog dataset with 23, 789 annotated conversations.

Response Generation

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.