Efficient Off-Policy Meta-Reinforcement Learning via Probabilistic Context Variables

19 Mar 2019Kate RakellyAurick ZhouDeirdre QuillenChelsea FinnSergey Levine

Deep reinforcement learning algorithms require large amounts of experience to learn an individual task. While in principle meta-reinforcement learning (meta-RL) algorithms enable agents to learn new skills from small amounts of experience, several major challenges preclude their practicality... (read more)

PDF Abstract

Evaluation results from the paper


  Submit results from this paper to get state-of-the-art GitHub badges and help community compare results to other papers.