no code implementations • 6 Mar 2024 • Antoine Scheid, Daniil Tiapkin, Etienne Boursier, Aymeric Capitaine, El Mahdi El Mhamdi, Eric Moulines, Michael I. Jordan, Alain Durmus
This work considers a repeated principal-agent bandit game, where the principal can only interact with her environment through the agent.
no code implementations • ICLR 2021 • El Mahdi El Mhamdi, Rachid Guerraoui, Sébastien Rouault
We propose a practical method which, despite increasing the variance, reduces the variance-norm ratio, mitigating the identified weakness.
1 code implementation • 22 May 2020 • Andrei Kucharavy, El Mahdi El Mhamdi, Rachid Guerraoui
Generative adversarial networks (GANs) are pairs of artificial neural networks that are trained one against each other.
no code implementations • 18 Nov 2019 • El Mahdi El Mhamdi, Rachid Guerraoui, Arsany Guirguis
We moreover show that the throughput gain of LiuBei compared to another state-of-the-art Byzantine-resilient ML algorithm (that assumes network asynchrony) is 70%.
no code implementations • 7 Jun 2018 • El Mahdi El Mhamdi, Rachid Guerraoui, Lê Nguyên Hoang, Alexandre Maurer
We first solve the problem analytically in the case of two populations, with a uniform bonus-malus on the zones where each population is a majority.
no code implementations • 29 May 2018 • Henrik Aslund, El Mahdi El Mhamdi, Rachid Guerraoui, Alexandre Maurer
We show that when a third party, the adversary, steps into the two-party setting (agent and operator) of safely interruptible reinforcement learning, a trade-off has to be made between the probability of following the optimal policy in the limit, and the probability of escaping a dangerous situation created by the adversary.
1 code implementation • ICML 2018 • Georgios Damaskinos, El Mahdi El Mhamdi, Rachid Guerraoui, Rhicheek Patra, Mahsa Taziki
The dampening component bounds the convergence rate by adjusting to stale information through a generic gradient weighting scheme.
1 code implementation • ICML 2018 • El Mahdi El Mhamdi, Rachid Guerraoui, Sébastien Rouault
Based on this leeway, we build a simple attack, and experimentally show its strong to utmost effectivity on CIFAR-10 and MNIST.
1 code implementation • 21 Feb 2018 • El Mahdi El Mhamdi, Rachid Guerraoui, Alexandre Maurer, Vladislav Tempez
A standard belief on emerging collective behavior is that it emerges from simple individual rules.
1 code implementation • NeurIPS 2017 • Peva Blanchard, El Mahdi El Mhamdi, Rachid Guerraoui, Julien Stainer
We propose \emph{Krum}, an aggregation rule that satisfies our resilience property, which we argue is the first provably Byzantine-resilient algorithm for distributed SGD.
no code implementations • 25 Jul 2017 • El Mahdi El Mhamdi, Rachid Guerraoui, Sebastien Rouault
This bound involves dependencies on the network parameters that can be seen as being too pessimistic in the average case.
no code implementations • 27 Jun 2017 • El Mahdi El Mhamdi, Rachid Guerraoui
We view a neural network as a distributed system of which neurons can fail independently, and we evaluate its robustness in the absence of any (recovery) learning phase.
no code implementations • NeurIPS 2017 • El Mahdi El Mhamdi, Rachid Guerraoui, Hadrien Hendrikx, Alexandre Maurer
We give realistic sufficient conditions on the learning algorithm to enable dynamic safe interruptibility in the case of joint action learners, yet show that these conditions are not sufficient for independent learners.
Multi-agent Reinforcement Learning reinforcement-learning +1
no code implementations • 8 Mar 2017 • Peva Blanchard, El Mahdi El Mhamdi, Rachid Guerraoui, Julien Stainer
The growth of data, the need for scalability and the complexity of models used in modern machine learning calls for distributed implementations.