no code implementations • 7 Dec 2023 • Ayush Rai, Shaoshuai Mou
In the proposed approach, the agents sample their individual local functions in a way that benefits the whole network by utilizing a running consensus to estimate the upper confidence bound on the global function.