Search Results for author: Yannick Metz

Found 3 papers, 0 papers with code

RLHF-Blender: A Configurable Interactive Interface for Learning from Diverse Human Feedback

no code implementations8 Aug 2023 Yannick Metz, David Lindner, Raphaël Baur, Daniel Keim, Mennatallah El-Assady

To use reinforcement learning from human feedback (RLHF) in practical applications, it is crucial to learn reward models from diverse sources of human feedback and to consider human factors involved in providing feedback of different types.

Cannot find the paper you are looking for? You can Submit a new open access paper.