Finite Regret and Cycles with Fixed Step-Size via Alternating Gradient Descent-Ascent

9 Jul 2019  ·  James P. Bailey, Gauthier Gidel, Georgios Piliouras ·

Gradient descent is arguably one of the most popular online optimization methods with a wide array of applications. However, the standard implementation where agents simultaneously update their strategies yields several undesirable properties; strategies diverge away from equilibrium and regret grows over time. In this paper, we eliminate these negative properties by introducing a different implementation to obtain finite regret via arbitrary fixed step-size. We obtain this surprising property by having agents take turns when updating their strategies. In this setting, we show that an agent that uses gradient descent obtains bounded regret -- regardless of how their opponent updates their strategies. Furthermore, we show that in adversarial settings that agents' strategies are bounded and cycle when both are using the alternating gradient descent algorithm.

PDF Abstract
No code implementations yet. Submit your code now

Categories


Computer Science and Game Theory Dynamical Systems Optimization and Control

Datasets


  Add Datasets introduced or used in this paper