no code implementations • 20 Feb 2024 • Hsiao-Ru Pan, Bernhard Schölkopf
Learning from off-policy data is essential for sample-efficient reinforcement learning.
1 code implementation • 25 Jul 2022 • Hamza Keurti, Hsiao-Ru Pan, Michel Besserve, Benjamin F. Grewe, Bernhard Schölkopf
How can agents learn internal models that veridically represent interactions with the real world is a largely open question.
1 code implementation • 13 Sep 2021 • Hsiao-Ru Pan, Nico Gürtler, Alexander Neitz, Bernhard Schölkopf
The predominant approach in reinforcement learning is to assign credit to actions based on the expected return.