# Probabilistic inverse reinforcement learning in unknown environments

9 Aug 2014

We consider the problem of learning by demonstration from agents acting in unknown stochastic Markov environments or games. Our aim is to estimate agent preferences in order to construct improved policies for the same task that the agents are trying to solve... (read more)

