Search Results for author: Girish Raguvir J

Found 1 papers, 0 papers with code

Learning to Mix n-Step Returns: Generalizing lambda-Returns for Deep Reinforcement Learning

no code implementations • ICLR 2018 • Sahil Sharma, Girish Raguvir J, Srivatsan Ramesh, Balaraman Ravindran

Our second major contribution is that we propose a generalization of lambda-returns called Confidence-based Autodidactic Returns (CAR), wherein the RL agent learns the weighting of the n-step returns in an end-to-end manner.

Benchmarking Decision Making +2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.