Search Results for author: Yannick Metz

Found 3 papers, 0 papers with code

RLHF-Blender: A Configurable Interactive Interface for Learning from Diverse Human Feedback

no code implementations • 8 Aug 2023 • Yannick Metz, David Lindner, Raphaël Baur, Daniel Keim, Mennatallah El-Assady

To use reinforcement learning from human feedback (RLHF) in practical applications, it is crucial to learn reward models from diverse sources of human feedback and to consider human factors involved in providing feedback of different types.

Paper
Add Code

How to Enable Uncertainty Estimation in Proximal Policy Optimization

no code implementations • 7 Oct 2022 • Eugene Bykovets, Yannick Metz, Mennatallah El-Assady, Daniel A. Keim, Joachim M. Buhmann

To overcome this, we formulate a Pareto optimization problem in which we simultaneously optimize for reward and OOD detection performance.

Out of Distribution (OOD) Detection Reinforcement Learning (RL)

Paper
Add Code

BARReL: Bottleneck Attention for Adversarial Robustness in Vision-Based Reinforcement Learning

no code implementations • 22 Aug 2022 • Eugene Bykovets, Yannick Metz, Mennatallah El-Assady, Daniel A. Keim, Joachim M. Buhmann

Robustness to adversarial perturbations has been explored in many areas of computer vision.

Adversarial Robustness reinforcement-learning +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.