Distributed Reinforcement Learning via Gossip

28 Oct 2013 · Adwaitvedant S. Mathkar, Vivek S. Borkar ·

We consider the classical TD(0) algorithm implemented on a network of agents wherein the agents also incorporate the updates received from neighboring agents using a gossip-like mechanism. The combined scheme is shown to converge for both discounted and average cost problems.

Code

Add Remove Mark official

No code implementations yet. Submit your code now

Tasks

Add Remove

reinforcement-learning

Reinforcement Learning (RL)

Datasets

Add Datasets introduced or used in this paper

Results from the Paper

Add Remove

Submit results from this paper to get state-of-the-art GitHub badges and help the community compare results to other papers.

Methods

Add Remove

No methods listed for this paper. Add relevant methods here