1 code implementation • 5 Nov 2023 • Artem Tsypin, Leonid Ugadiarov, Kuzma Khrabrov, Alexander Telepov, Egor Rumiantsev, Alexey Skrynnik, Aleksandr I. Panov, Dmitry Vetrov, Elena Tutubalina, Artur Kadurin
Our results demonstrate that the neural network trained with GOLF performs on par with the oracle on a benchmark of diverse drug-like molecules using $50$x less additional data.
no code implementations • 26 Oct 2023 • Leonid Ugadiarov, Aleksandr I. Panov
Our algorithm performs better in a visually complex 3D robotic environment and a 2D environment with compositional structure than the state-of-the-art model-free actor-critic algorithm built upon transformer architecture and the state-of-the-art monolithic model-based algorithm.
1 code implementation • 21 Sep 2021 • Leonid Ugadiarov, Alexey Skrynnik, Aleksandr I. Panov
Exploration is an essential part of reinforcement learning, which restricts the quality of learned policy.