Search Results for author: Henry Sowerby

Found 2 papers, 1 papers with code

Designing Rewards for Fast Learning

no code implementations30 May 2022 Henry Sowerby, Zhiyuan Zhou, Michael L. Littman

To solve this optimization problem, we propose a linear-programming based algorithm that efficiently finds a reward function that maximizes action gap and minimizes subjective discount.

Q-Learning Reinforcement Learning (RL)

Cannot find the paper you are looking for? You can Submit a new open access paper.