Search Results for author: Steffen Grunewalder

Found 3 papers, 0 papers with code

Bandits with Delayed, Aggregated Anonymous Feedback

no code implementations ICML 2018 Ciara Pike-Burke, Shipra Agrawal, Csaba Szepesvari, Steffen Grunewalder

In this problem, when the player pulls an arm, a reward is generated, however it is not immediately observed.

Approximations of the Restless Bandit Problem

no code implementations22 Feb 2017 Steffen Grunewalder, Azadeh Khaleghi

The multi-armed restless bandit problem is studied in the case where the pay-off distributions are stationary $\varphi$-mixing.

Modelling transition dynamics in MDPs with RKHS embeddings

no code implementations18 Jun 2012 Steffen Grunewalder, Guy Lever, Luca Baldassarre, Massi Pontil, Arthur Gretton

For policy optimisation we compare with least-squares policy iteration where a Gaussian process is used for value function estimation.

Cannot find the paper you are looking for? You can Submit a new open access paper.