Search Results for author: Luc Siecker

Found 2 papers, 0 papers with code

Multi-Armed Bandits with Generalized Temporally-Partitioned Rewards

no code implementations1 Mar 2023 Ronald C. van den Broek, Rik Litjens, Tobias Sagis, Luc Siecker, Nina Verbeeke, Pratik Gajane

In some real-world applications, feedback about a decision is delayed and may arrive via partial rewards that are observed with different delays.

Decision Making Multi-Armed Bandits

Generalizing distribution of partial rewards for multi-armed bandits with temporally-partitioned rewards

no code implementations13 Nov 2022 Ronald C. van den Broek, Rik Litjens, Tobias Sagis, Luc Siecker, Nina Verbeeke, Pratik Gajane

In this paper, we introduce a general formulation of how an arm's cumulative reward is distributed across several rounds, called Beta-spread property.

Multi-Armed Bandits

Cannot find the paper you are looking for? You can Submit a new open access paper.