Search Results for author: Katarzyna Kańska

Found 1 papers, 1 papers with code

Q-Value Weighted Regression: Reinforcement Learning with Limited Data

1 code implementation12 Feb 2021 Piotr Kozakowski, Łukasz Kaiser, Henryk Michalewski, Afroz Mohiuddin, Katarzyna Kańska

QWR is an extension of Advantage Weighted Regression (AWR), an off-policy actor-critic algorithm that performs very well on continuous control tasks, also in the offline setting, but has low sample efficiency and struggles with high-dimensional observation spaces.

Atari Games Continuous Control +4

Cannot find the paper you are looking for? You can Submit a new open access paper.