Search Results for author: Paul Duff

Short-lived High-volume Multi-A(rmed)/B(andits) Testing

We aim to minimize the loss due to not knowing the mean rewards, averaged over instances generated from a given prior distribution.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.