2 code implementations • 24 May 2022 • Fabian Paischer, Thomas Adler, Vihang Patil, Angela Bitto-Nemling, Markus Holzleitner, Sebastian Lehner, Hamid Eghbal-zadeh, Sepp Hochreiter
We propose to utilize a frozen Pretrained Language Transformer (PLT) for history representation and compression to improve sample efficiency.
2 code implementations • 8 Nov 2021 • Kajetan Schweighofer, Andreas Radler, Marius-Constantin Dinu, Markus Hofmarcher, Vihang Patil, Angela Bitto-Nemling, Hamid Eghbal-zadeh, Sepp Hochreiter
The dataset characteristics are determined by the behavioral policy that samples this dataset.
1 code implementation • 12 Jul 2022 • Christian Steinparz, Thomas Schmied, Fabian Paischer, Marius-Constantin Dinu, Vihang Patil, Angela Bitto-Nemling, Hamid Eghbal-zadeh, Sepp Hochreiter
Therefore, exploration strategies and learning methods are required that are capable of tracking the steady domain shifts, and adapting to them.
no code implementations • 21 Feb 2022 • Youssef Diouane, Aurelien Lucchi, Vihang Patil
Evolutionary strategies have recently been shown to achieve competing levels of performance for complex optimization problems in reinforcement learning.