1 code implementation • 25 Sep 2019 • Piotr Miłoś, Łukasz Kuciński, Konrad Czechowski, Piotr Kozakowski, Maciej Klimek
Notably, our method performs well in environments with sparse rewards where standard $TD(1)$ backups fail.
no code implementations • 16 Feb 2018 • Maciej Jaśkowski, Jakub Świątkowski, Michał Zając, Maciej Klimek, Jarek Potiuk, Piotr Rybicki, Piotr Polatowski, Przemysław Walczyk, Kacper Nowicki, Marek Cygan
In this work we improve on one of the most promising approaches, the Grasp Quality Convolutional Neural Network (GQ-CNN) trained on the DexNet 2. 0 dataset.
no code implementations • 19 May 2017 • Robert Adamski, Tomasz Grel, Maciej Klimek, Henryk Michalewski
The asynchronous nature of the state-of-the-art reinforcement learning algorithms such as the Asynchronous Advantage Actor-Critic algorithm, makes them exceptionally suitable for CPU computations.