no code implementations • 16 Sep 2018 • Ameet Deshpande, Srikanth Sarma, Ashutosh Jha, Balaraman Ravindran
One such approach is Hindsight Experience replay which uses an off-policy Reinforcement Learning algorithm to learn a goal conditioned policy.