no code implementations • 28 Jun 2019 • Martin Tappler, Bernhard K. Aichernig, Giovanni Bacci, Maria Eichlseder, Kim G. Larsen
In this work, we study L*-based learning of deterministic Markov decision processes, first assuming an ideal setting with perfect information.