Search Results for author: Huiying Zhong

Found 1 papers, 0 papers with code

Provable Multi-Party Reinforcement Learning with Diverse Human Feedback

no code implementations • 8 Mar 2024 • Huiying Zhong, Zhun Deng, Weijie J. Su, Zhiwei Steven Wu, Linjun Zhang

Our work \textit{initiates} the theoretical study of multi-party RLHF that explicitly models the diverse preferences of multiple individuals.

Fairness Meta-Learning +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.