1 code implementation • 13 May 2023 • Patrick Wienhöft, Marnix Suilen, Thiago D. Simão, Clemens Dubslaff, Christel Baier, Nils Jansen
In an offline reinforcement learning setting, the safe policy improvement (SPI) problem aims to improve the performance of a behavior policy according to which sample data has been generated.
no code implementations • 22 Mar 2023 • Christel Baier, Clemens Dubslaff, Patrick Wienhöft, Stefan J. Kiebel
A central task in control theory, artificial intelligence, and formal methods is to synthesize reward-maximizing strategies for agents that operate in partially unknown environments.
no code implementations • 20 Jan 2023 • Christel Baier, Clemens Dubslaff, Holger Hermanns, Nikolai Käfer
Bayesian networks (BNs) are a probabilistic graphical model widely used for representing expert knowledge and reasoning under uncertainty.