no code implementations • 5 Mar 2024 • Amaury Gouverneur, Borja Rodríguez-Gálvez, Tobias J. Oechtering, Mikael Skoglund
This paper studies the Bayesian regret of a variant of the Thompson-Sampling algorithm for bandit problems.
no code implementations • 26 Apr 2023 • Amaury Gouverneur, Borja Rodríguez-Gálvez, Tobias J. Oechtering, Mikael Skoglund
In this work, we study the performance of the Thompson Sampling algorithm for Contextual Bandit problems based on the framework introduced by Neu et al. and their concept of lifted information ratio.
no code implementations • 18 Jul 2022 • Amaury Gouverneur, Borja Rodríguez-Gálvez, Tobias J. Oechtering, Mikael Skoglund
Building on the framework introduced by Xu and Raginksy [1] for supervised learning problems, we study the best achievable performance for model-based Bayesian reinforcement learning problems.
no code implementations • 10 Apr 2020 • Dominik Fay, Jens Sjölund, Tobias J. Oechtering
For this reason, we turn our attention to Private Aggregation of Teacher Ensembles (PATE), where all local models can be trained independently without inter-institutional communication.
no code implementations • 19 Jan 2020 • Jaya Prakash Champati, Ramana R. Avula, Tobias J. Oechtering, James Gross
There has been a significant research effort in optimizing this metric in communication and networking systems under different settings.
1 code implementation • 21 Mar 2018 • Ramana R. Avula, Tobias J. Oechtering, Daniel Månsson
In this paper, we present a one-step-ahead predictive control strategy using Bayesian risk to measure and control privacy leakage with an energy storage system.
Signal Processing