Search Results for author: Zac Yu

Found 2 papers, 1 papers with code

PERL: Parameter Efficient Reinforcement Learning from Human Feedback

no code implementations • 15 Mar 2024 • Hakim Sidahmed, Samrat Phatale, Alex Hutcheson, Zhuonan Lin, Zhang Chen, Zac Yu, Jarvis Jin, Roman Komarytsia, Christiane Ahlheim, Yonghao Zhu, Simral Chaudhary, Bowen Li, Saravanan Ganesh, Bill Byrne, Jessica Hoffmann, Hassan Mansoor, Wei Li, Abhinav Rastogi, Lucas Dixon

We investigate the setup of "Parameter Efficient Reinforcement Learning" (PERL), in which we perform reward model training and reinforcement learning using LoRA.

reinforcement-learning

Paper
Add Code

MAVE: A Product Dataset for Multi-source Attribute Value Extraction

1 code implementation • 16 Dec 2021 • Li Yang, Qifan Wang, Zac Yu, Anand Kulkarni, Sumit Sanghai, Bin Shu, Jon Elsas, Bhargav Kanagal

Attribute value extraction refers to the task of identifying values of an attribute of interest from product information.

Attribute Attribute Extraction +2

132

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.