Search Results for author: Benjamin Howson

Found 2 papers, 0 papers with code

Delayed Feedback in Generalised Linear Bandits Revisited

no code implementations21 Jul 2022 Benjamin Howson, Ciara Pike-Burke, Sarah Filippi

However, the stringent requirement for immediate rewards is unmet in many real-world applications where the reward is almost always delayed.

Decision Making

Optimism and Delays in Episodic Reinforcement Learning

no code implementations15 Nov 2021 Benjamin Howson, Ciara Pike-Burke, Sarah Filippi

In this paper, we study the impact of delayed feedback in episodic reinforcement learning from a theoretical perspective and propose two general-purpose approaches to handling the delays.

reinforcement-learning Reinforcement Learning (RL)

Cannot find the paper you are looking for? You can Submit a new open access paper.