no code implementations • 21 Oct 2021 • Mohamed Yassine Boukhari, Akash Dhasade, Anne-Marie Kermarrec, Rafael Pires, Othmane Safsafi, Rishi Sharma
GeL enables constrained edge devices to perform additional learning through guessed updates on top of gradient-based steps.
no code implementations • 29 Sep 2021 • Romain Laroche, Othmane Safsafi, Raphael Feraud, Nicolas Broutin
In Batched Multi-Armed Bandits (BMAB), the policy is not allowed to be updated at each time step.