About

Benchmarks

No evaluation results yet. Help compare methods by submit evaluation metrics.

Subtasks

Datasets

Greatest papers with code

Offline Reinforcement Learning with Fisher Divergence Critic Regularization

14 Mar 2021google-research/google-research

Many modern approaches to offline Reinforcement Learning (RL) utilize behavior regularization, typically augmenting a model-free actor critic algorithm with a penalty measuring divergence of the policy from the offline data.

OFFLINE RL

Behavior Regularized Offline Reinforcement Learning

26 Nov 2019google-research/google-research

In reinforcement learning (RL) research, it is common to assume access to direct online interactions with the environment.

CONTINUOUS CONTROL OFFLINE RL

RL Unplugged: A Collection of Benchmarks for Offline Reinforcement Learning

NeurIPS 2020 deepmind/deepmind-research

We hope that our suite of benchmarks will increase the reproducibility of experiments and make it possible to study challenging tasks with a limited computational budget, thus making RL research both more systematic and more accessible across the community.

OFFLINE RL

RL Unplugged: A Suite of Benchmarks for Offline Reinforcement Learning

24 Jun 2020deepmind/deepmind-research

We hope that our suite of benchmarks will increase the reproducibility of experiments and make it possible to study challenging tasks with a limited computational budget, thus making RL research both more systematic and more accessible across the community.

ATARI GAMES DQN REPLAY DATASET MUJOCO GAMES

Acme: A Research Framework for Distributed Reinforcement Learning

1 Jun 2020deepmind/acme

Ultimately, we show that the design decisions behind Acme lead to agents that can be scaled both up and down and that, for the most part, greater levels of parallelization result in agents with equivalent performance, just faster.

DQN REPLAY DATASET

DeepAveragers: Offline Reinforcement Learning by Solving Derived Non-Parametric MDPs

ICLR 2021 maximecb/gym-miniworld

We study an approach to offline reinforcement learning (RL) based on optimally solving finitely-represented MDPs derived from a static dataset of experience.

OFFLINE RL

D4RL: Datasets for Deep Data-Driven Reinforcement Learning

15 Apr 2020rail-berkeley/d4rl

In this work, we introduce benchmarks specifically designed for the offline setting, guided by key properties of datasets relevant to real-world applications of offline RL.

OFFLINE RL

An Optimistic Perspective on Offline Reinforcement Learning

10 Jul 2019google-research/batch_rl

The DQN replay dataset can serve as an offline RL benchmark and is open-sourced.

ATARI GAMES DQN REPLAY DATASET Q-LEARNING

Human-centric Dialog Training via Offline Reinforcement Learning

EMNLP 2020 natashamjaques/neural_chat

We start by hosting models online, and gather human feedback from real-time, open-ended conversations, which we then use to train and improve the models using offline reinforcement learning (RL).

LANGUAGE MODELLING OFFLINE RL