Search Results for author: Vektor Dewanto

Found 4 papers, 1 papers with code

Approximate discounting-free policy evaluation from transient and recurrent states

no code implementations8 Apr 2022 Vektor Dewanto, Marcus Gallagher

We therefore propose a system of approximators for the bias (specifically, its relative value) from transient and recurrent states.

reinforcement-learning Reinforcement Learning (RL)

Examining average and discounted reward optimality criteria in reinforcement learning

no code implementations3 Jul 2021 Vektor Dewanto, Marcus Gallagher

In reinforcement learning (RL), the goal is to obtain an optimal policy, for which the optimality criterion is fundamentally important.

reinforcement-learning Reinforcement Learning (RL)

A nearly Blackwell-optimal policy gradient method

1 code implementation28 May 2021 Vektor Dewanto, Marcus Gallagher

In this work, we develop a policy gradient method that optimizes the gain, then the bias (which indicates the transient performance and is important to capably select from policies with equal gain).

reinforcement-learning Reinforcement Learning (RL)

Cannot find the paper you are looking for? You can Submit a new open access paper.