no code implementations • 1 Jan 2022 • Kshitija Taywade, Brent Harrison, Adib Bagh
We investigate the use of a multi-agent multi-armed bandit (MA-MAB) setting for modeling repeated Cournot oligopoly games, where the firms acting as agents choose from the set of arms representing production quantity (a discrete value).