Search Results for author: Thomas Kleine Buening

Found 6 papers, 1 papers with code

Bandits Meet Mechanism Design to Combat Clickbait in Online Recommendation

no code implementations27 Nov 2023 Thomas Kleine Buening, Aadirupa Saha, Christos Dimitrakakis, Haifeng Xu

We study a strategic variant of the multi-armed bandit problem, which we coin the strategic click-bandit.

Minimax-Bayes Reinforcement Learning

1 code implementation21 Feb 2023 Thomas Kleine Buening, Christos Dimitrakakis, Hannes Eriksson, Divya Grover, Emilio Jorge

While the Bayesian decision-theoretic framework offers an elegant solution to the problem of decision making under uncertainty, one question is how to appropriately select the prior distribution.

Decision Making Decision Making Under Uncertainty +2

Environment Design for Inverse Reinforcement Learning

no code implementations26 Oct 2022 Thomas Kleine Buening, Christos Dimitrakakis

The task of learning a reward function from expert demonstrations suffers from high sample complexity as well as inherent limitations to what can be learned from demonstrations in a given environment.

reinforcement-learning Reinforcement Learning (RL)

ANACONDA: An Improved Dynamic Regret Algorithm for Adaptive Non-Stationary Dueling Bandits

no code implementations25 Oct 2022 Thomas Kleine Buening, Aadirupa Saha

We study the problem of non-stationary dueling bandits and provide the first adaptive dynamic regret algorithm for this problem.

Interactive Inverse Reinforcement Learning for Cooperative Games

no code implementations8 Nov 2021 Thomas Kleine Buening, Anne-Marie George, Christos Dimitrakakis

How should the first agent act in order to learn the joint reward function as quickly as possible and so that the joint policy is as close to optimal as possible?

reinforcement-learning Reinforcement Learning (RL)

Cannot find the paper you are looking for? You can Submit a new open access paper.