no code implementations • 8 Apr 2022 • Vektor Dewanto, Marcus Gallagher
We therefore propose a system of approximators for the bias (specifically, its relative value) from transient and recurrent states.
no code implementations • 3 Jul 2021 • Vektor Dewanto, Marcus Gallagher
In reinforcement learning (RL), the goal is to obtain an optimal policy, for which the optimality criterion is fundamentally important.
1 code implementation • 28 May 2021 • Vektor Dewanto, Marcus Gallagher
In this work, we develop a policy gradient method that optimizes the gain, then the bias (which indicates the transient performance and is important to capably select from policies with equal gain).
no code implementations • 18 Oct 2020 • Vektor Dewanto, George Dunn, Ali Eshragh, Marcus Gallagher, Fred Roosta
Reinforcement learning is important part of artificial intelligence.