Search Results for author: Alessandro G. Bottero

Found 5 papers, 2 papers with code

Information-Theoretic Safe Bayesian Optimization

no code implementations • 23 Feb 2024 • Alessandro G. Bottero, Carlos E. Luis, Julia Vinogradska, Felix Berkenkamp, Jan Peters

In this paper, we propose an information-theoretic safe exploration criterion that directly exploits the GP posterior to identify the most informative safe parameters to evaluate.

Bayesian Optimization Decision Making +1

Paper
Add Code

Model-Based Epistemic Variance of Values for Risk-Aware Policy Optimization

no code implementations • 7 Dec 2023 • Carlos E. Luis, Alessandro G. Bottero, Julia Vinogradska, Felix Berkenkamp, Jan Peters

Previous work upper bounds the posterior variance over values by solving a so-called uncertainty Bellman equation (UBE), but the over-approximation may result in inefficient exploration.

Model-based Reinforcement Learning Offline RL

Paper
Add Code

Value-Distributional Model-Based Reinforcement Learning

no code implementations • 12 Aug 2023 • Carlos E. Luis, Alessandro G. Bottero, Julia Vinogradska, Felix Berkenkamp, Jan Peters

We study the problem from a model-based Bayesian reinforcement learning perspective, where the goal is to learn the posterior distribution over value functions induced by parameter (epistemic) uncertainty of the Markov decision process.

Continuous Control Decision Making +3

Paper
Add Code

Model-Based Uncertainty in Value Functions

1 code implementation • 24 Feb 2023 • Carlos E. Luis, Alessandro G. Bottero, Julia Vinogradska, Felix Berkenkamp, Jan Peters

We consider the problem of quantifying uncertainty over expected cumulative rewards in model-based reinforcement learning.

Continuous Control Model-based Reinforcement Learning +3

Paper
Code

Information-Theoretic Safe Exploration with Gaussian Processes

1 code implementation • 9 Dec 2022 • Alessandro G. Bottero, Carlos E. Luis, Julia Vinogradska, Felix Berkenkamp, Jan Peters

We consider a sequential decision making task where we are not allowed to evaluate parameters that violate an a priori unknown (safety) constraint.

Decision Making Gaussian Processes +1

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.