Search Results for author: Romain Deguest

Found 2 papers, 0 papers with code

Sensitivity analysis in HMMs with application to likelihood maximization

no code implementations NeurIPS 2009 Pierre-Arnaud Coquelin, Romain Deguest, Rémi Munos

We derive an IPA estimator for the gradient of the log-likelihood, which may be used in a gradient method for the purpose of likelihood maximization.

Particle Filter-based Policy Gradient in POMDPs

no code implementations NeurIPS 2008 Pierre-Arnaud Coquelin, Romain Deguest, Rémi Munos

Our setting is a Partially Observable Markov Decision Process with continuous state, observation and action spaces.

Cannot find the paper you are looking for? You can Submit a new open access paper.