Search Results for author: Hsiao-Ru Pan

Found 3 papers, 2 papers with code

Skill or Luck? Return Decomposition via Advantage Functions

no code implementations20 Feb 2024 Hsiao-Ru Pan, Bernhard Schölkopf

Learning from off-policy data is essential for sample-efficient reinforcement learning.

Direct Advantage Estimation

1 code implementation13 Sep 2021 Hsiao-Ru Pan, Nico Gürtler, Alexander Neitz, Bernhard Schölkopf

The predominant approach in reinforcement learning is to assign credit to actions based on the expected return.

Cannot find the paper you are looking for? You can Submit a new open access paper.