1 code implementation • 4 May 2023 • Desik Rengarajan, Nitin Ragothaman, Dileep Kalathil, Srinivas Shakkottai
We consider the problem of federated offline reinforcement learning (RL), a scenario under which distributed learning agents must collaboratively learn a high-quality control policy only using small pre-collected datasets generated according to different unknown behavior policies.