Search Results for author: Mahdi M. Fard

Found 2 papers, 0 papers with code

PAC-Bayesian Model Selection for Reinforcement Learning

no code implementations NeurIPS 2010 Mahdi M. Fard, Joelle Pineau

This paper introduces the first set of PAC-Bayesian bounds for the batch reinforcement learning problem in finite state spaces.

Model Selection reinforcement-learning +1

MDPs with Non-Deterministic Policies

no code implementations NeurIPS 2008 Mahdi M. Fard, Joelle Pineau

Markov Decision Processes (MDPs) have been extensively studied and used in the context of planning and decision-making, and many methods exist to find the optimal policy for problems modelled as MDPs.

Decision Making

Cannot find the paper you are looking for? You can Submit a new open access paper.