Search Results for author: Mario Bravo

Found 5 papers, 0 papers with code

Stochastic Halpern iteration in normed spaces and applications to reinforcement learning

no code implementations19 Mar 2024 Mario Bravo, Juan Pablo Contreras

We analyze the oracle complexity of the stochastic Halpern iteration with variance reduction, where we aim to approximate fixed-points of nonexpansive and contractive operators in a normed finite-dimensional space.

reinforcement-learning

Bandit Learning in Concave N-Person Games

no code implementations NeurIPS 2018 Mario Bravo, David Leslie, Panayotis Mertikopoulos

This paper examines the long-run behavior of learning with bandit feedback in non-cooperative concave games.

Stochastic Optimization

Bandit learning in concave $N$-person games

no code implementations3 Oct 2018 Mario Bravo, David S. Leslie, Panayotis Mertikopoulos

This paper examines the long-run behavior of learning with bandit feedback in non-cooperative concave games.

Stochastic Optimization

On the robustness of learning in games with stochastically perturbed payoff observations

no code implementations20 Dec 2014 Mario Bravo, Panayotis Mertikopoulos

Motivated by the scarcity of accurate payoff feedback in practical applications of game theory, we examine a class of learning dynamics where players adjust their choices based on past payoff observations that are subject to noise and random disturbances.

Cannot find the paper you are looking for? You can Submit a new open access paper.