Search Results for author: Hsiao-Ru Pan

Skill or Luck? Return Decomposition via Advantage Functions

Learning from off-policy data is essential for sample-efficient reinforcement learning.

Paper
Add Code

How can agents learn internal models that veridically represent interactions with the real world is a largely open question.

Paper
Code

The predominant approach in reinforcement learning is to assign credit to actions based on the expected return.

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.