Search Results for author: Amir M. Farahmand

Found 2 papers, 0 papers with code

Value Pursuit Iteration

no code implementations • NeurIPS 2012 • Amir M. Farahmand, Doina Precup

VPI has two main features: First, it is a nonparametric algorithm that finds a good sparse approximation of the optimal value function given a dictionary of features.

Reinforcement Learning (RL)

Paper
Add Code

Regularized Policy Iteration

no code implementations • NeurIPS 2008 • Amir M. Farahmand, Mohammad Ghavamzadeh, Shie Mannor, Csaba Szepesvári

In this paper we consider approximate policy-iteration-based reinforcement learning algorithms.

L2 Regularization reinforcement-learning +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.