Search Results for author: Martin Zubeldia

Found 2 papers, 0 papers with code

Concentration of Contractive Stochastic Approximation: Additive and Multiplicative Noise

no code implementations28 Mar 2023 Zaiwei Chen, Siva Theja Maguluri, Martin Zubeldia

To demonstrate the applicability of our theoretical results, we use them to provide maximal concentration bounds for a large class of reinforcement learning algorithms, including but not limited to on-policy TD-learning with linear function approximation, off-policy TD-learning with generalized importance sampling factors, and $Q$-learning.

Q-Learning

Collaboratively Learning the Best Option, Using Bounded Memory

no code implementations22 Feb 2018 Lili Su, Martin Zubeldia, Nancy Lynch

We say an individual learns the best option if eventually (as $t \to \infty$) it pulls only the arm with the highest average reward.

Cannot find the paper you are looking for? You can Submit a new open access paper.