no code implementations • 18 Oct 2023 • Nataša Bolić, Tommaso Cesari, Roberto Colomboni
If the distribution admits a density bounded by some constant $M$, then, for any time horizon $T$: $\bullet$ If the agents' valuations are revealed after each interaction, we provide an algorithm achieving regret $M \log T$ and show this rate is optimal, up to constant factors.