Search Results for author: Dmitry Akimov

Found 3 papers, 3 papers with code

Q-Ensemble for Offline RL: Don't Scale the Ensemble, Scale the Batch Size

2 code implementations • 20 Nov 2022 • Alexander Nikulin, Vladislav Kurenkov, Denis Tarasov, Dmitry Akimov, Sergey Kolesnikov

Training large neural networks is known to be time-consuming, with the learning duration taking days or even weeks.

Offline RL

393

Paper
Code

CORL: Research-oriented Deep Offline Reinforcement Learning Library

3 code implementations • NeurIPS 2023 • Denis Tarasov, Alexander Nikulin, Dmitry Akimov, Vladislav Kurenkov, Sergey Kolesnikov

CORL is an open-source library that provides thoroughly benchmarked single-file implementations of both deep offline and offline-to-online reinforcement learning algorithms.

Benchmarking D4RL +1

967

Paper
Code

Distributed Soft Actor-Critic with Multivariate Reward Representation and Knowledge Distillation

1 code implementation • 29 Nov 2019 • Dmitry Akimov

In this paper, we describe NeurIPS 2019 Learning to Move - Walk Around challenge physics-based environment and present our solution to this competition which scored 1303. 727 mean reward points and took 3rd place.

Knowledge Distillation reinforcement-learning +1

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.