Search Results for author: David Gomez-Ullate Oteiza

Found 1 papers, 1 papers with code

Opponent Aware Reinforcement Learning

1 code implementation22 Aug 2019 Victor Gallego, Roi Naveiro, David Rios Insua, David Gomez-Ullate Oteiza

We introduce Threatened Markov Decision Processes (TMDPs) as an extension of the classical Markov Decision Process framework for Reinforcement Learning (RL).

reinforcement-learning Reinforcement Learning +1

Cannot find the paper you are looking for? You can Submit a new open access paper.