no code implementations • 8 Aug 2023 • Yannick Metz, David Lindner, Raphaël Baur, Daniel Keim, Mennatallah El-Assady
To use reinforcement learning from human feedback (RLHF) in practical applications, it is crucial to learn reward models from diverse sources of human feedback and to consider human factors involved in providing feedback of different types.
no code implementations • 7 Oct 2022 • Eugene Bykovets, Yannick Metz, Mennatallah El-Assady, Daniel A. Keim, Joachim M. Buhmann
To overcome this, we formulate a Pareto optimization problem in which we simultaneously optimize for reward and OOD detection performance.
Out of Distribution (OOD) Detection Reinforcement Learning (RL)
no code implementations • 22 Aug 2022 • Eugene Bykovets, Yannick Metz, Mennatallah El-Assady, Daniel A. Keim, Joachim M. Buhmann
Robustness to adversarial perturbations has been explored in many areas of computer vision.