Search Results for author: Vektor Dewanto

Found 4 papers, 1 papers with code

Approximate discounting-free policy evaluation from transient and recurrent states

no code implementations • 8 Apr 2022 • Vektor Dewanto, Marcus Gallagher

We therefore propose a system of approximators for the bias (specifically, its relative value) from transient and recurrent states.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Examining average and discounted reward optimality criteria in reinforcement learning

no code implementations • 3 Jul 2021 • Vektor Dewanto, Marcus Gallagher

In reinforcement learning (RL), the goal is to obtain an optimal policy, for which the optimality criterion is fundamentally important.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

A nearly Blackwell-optimal policy gradient method

1 code implementation • 28 May 2021 • Vektor Dewanto, Marcus Gallagher

In this work, we develop a policy gradient method that optimizes the gain, then the bias (which indicates the transient performance and is important to capably select from policies with equal gain).

reinforcement-learning Reinforcement Learning (RL)

Paper
Code

Average-reward model-free reinforcement learning: a systematic review and literature mapping

no code implementations • 18 Oct 2020 • Vektor Dewanto, George Dunn, Ali Eshragh, Marcus Gallagher, Fred Roosta

Reinforcement learning is important part of artificial intelligence.

Decision Making reinforcement-learning +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.