no code implementations • 25 Aug 2022 • Alessandro Abate, Yousif Almulla, James Fox, David Hyland, Michael Wooldridge
Second, we propose a novel method for distilling the task automaton (assumed to be a deterministic finite automaton) from the learnt product MDP.