Search Results for author: Martin Zubeldia

Found 2 papers, 0 papers with code

Concentration of Contractive Stochastic Approximation: Additive and Multiplicative Noise

no code implementations • 28 Mar 2023 • Zaiwei Chen, Siva Theja Maguluri, Martin Zubeldia

To demonstrate the applicability of our theoretical results, we use them to provide maximal concentration bounds for a large class of reinforcement learning algorithms, including but not limited to on-policy TD-learning with linear function approximation, off-policy TD-learning with generalized importance sampling factors, and $Q$-learning.

Q-Learning

Paper
Add Code

Collaboratively Learning the Best Option, Using Bounded Memory

no code implementations • 22 Feb 2018 • Lili Su, Martin Zubeldia, Nancy Lynch

We say an individual learns the best option if eventually (as $t \to \infty$) it pulls only the arm with the highest average reward.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.