Search Results for author: Michalis Mamakos

Clustered Linear Contextual Bandits with Knapsacks

Thus, maximizing the total reward requires learning not only models about the reward and the resource consumption, but also cluster memberships.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.