no code implementations • 21 Feb 2024 • Gergely Neu, Nneka Okolo
We study the performance of stochastic first-order methods for finding saddle points of convex-concave functions.
no code implementations • 22 May 2023 • Germano Gabbianelli, Gergely Neu, Nneka Okolo, Matteo Papini
Offline Reinforcement Learning (RL) aims to learn a near-optimal policy from a fixed dataset of transitions collected by another policy.
no code implementations • 21 Oct 2022 • Gergely Neu, Nneka Okolo
We propose a new stochastic primal-dual optimization algorithm for planning in a large discounted Markov decision process with a generative model and linear function approximation.