no code implementations • 15 Sep 2023 • Hamza Cherkaoui, Merwan Barlier, Igor Colin
We address in this paper a particular instance of the multi-agent linear stochastic bandit problem, called clustered multi-agent linear bandits.
no code implementations • 15 Sep 2023 • Xuedong Shang, Igor Colin, Merwan Barlier, Hamza Cherkaoui
We introduce the safe best-arm identification framework with linear feedback, where the agent is subject to some stage-wise safety constraint that linearly depends on an unknown parameter vector.
1 code implementation • NeurIPS 2020 • Hamza Cherkaoui, Jeremias Sulam, Thomas Moreau
In this paper, we accelerate such iterative algorithms by unfolding proximal gradient descent solvers in order to learn their parameters for 1D TV regularized problems.
no code implementations • 19 Oct 2020 • Hamza Cherkaoui, Jeremias Sulam, Thomas Moreau
In this paper, we accelerate such iterative algorithms by unfolding proximal gradient descent solvers in order to learn their parameters for 1D TV regularized problems.