no code implementations • 7 Oct 2022 • Eugene Bykovets, Yannick Metz, Mennatallah El-Assady, Daniel A. Keim, Joachim M. Buhmann
To overcome this, we formulate a Pareto optimization problem in which we simultaneously optimize for reward and OOD detection performance.
Out of Distribution (OOD) Detection Reinforcement Learning (RL)
no code implementations • 22 Aug 2022 • Eugene Bykovets, Yannick Metz, Mennatallah El-Assady, Daniel A. Keim, Joachim M. Buhmann
Robustness to adversarial perturbations has been explored in many areas of computer vision.
no code implementations • 29 Sep 2021 • Ivan Ovinnikov, Eugene Bykovets, Joachim M. Buhmann
Inverse reinforcement learning methods aim to retrieve the reward function of a Markov decision process based on a dataset of expert demonstrations.
no code implementations • 18 Nov 2020 • Luis Haug, Ivan Ovinnikov, Eugene Bykovets
Given an optimality profile and a small amount of additional supervision, our algorithm fits a reward function, modeled as a neural network, by essentially minimizing the Wasserstein distance between the corresponding induced distribution and the optimality profile.