Search Results for author: Massi Pontil

Modelling transition dynamics in MDPs with RKHS embeddings

For policy optimisation we compare with least-squares policy iteration where a Gaussian process is used for value function estimation.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.