D4RL

69 papers with code • 1 benchmarks • 1 datasets

This task has no description! Would you like to contribute one?

Benchmarks

Add a Result

These leaderboards are used to track progress in D4RL

Trend	Dataset	Best Model	Paper	Code	Compare
	D4RL	PMDB			See all

Libraries

Use these libraries to find D4RL models and implementations

zzmtsvv/rl_task

9 papers

opendilab/DI-engine

4 papers

2,534

corl-team/CORL

4 papers

387

yihaosun1124/OfflineRL-Kit

4 papers

227

See all 8 libraries.

Datasets

D4RL

Most implemented papers

Most implemented Social Latest No code

Mildly Conservative Q-Learning for Offline Reinforcement Learning

dmksjfl/mcq • • 9 Jun 2022

The distribution shift between the learned policy and the behavior policy makes it necessary for the value function to stay conservative such that out-of-distribution (OOD) actions will not be severely overestimated.

Paper
Code

Diffusion Policies as an Expressive Policy Class for Offline Reinforcement Learning

zhendong-wang/diffusion-policies-for-offline-rl • • 12 Aug 2022

In our approach, we learn an action-value function and we add a term maximizing action-values into the training loss of the conditional diffusion model, which results in a loss that seeks optimal actions that are near the behavior policy.

Paper
Code

Model-Based Offline Reinforcement Learning with Pessimism-Modulated Dynamics Belief

huawei-noah/hebo • • 13 Oct 2022

To make practical, we further devise an offline RL algorithm to approximately find the solution.

Paper
Code

CORL: Research-oriented Deep Offline Reinforcement Learning Library

tinkoff-ai/CORL • • NeurIPS 2023

CORL is an open-source library that provides thoroughly benchmarked single-file implementations of both deep offline and offline-to-online reinforcement learning algorithms.

Paper
Code

Extreme Q-Learning: MaxEnt RL without Entropy

div99/xql • • 5 Jan 2023

Using EVT, we derive our \emph{Extreme Q-Learning} framework and consequently online and, for the first time, offline MaxEnt Q-learning algorithms, that do not explicitly require access to a policy or its entropy.

Paper
Code

Anti-Exploration by Random Network Distillation

tinkoff-ai/sac-rnd • • 31 Jan 2023

Despite the success of Random Network Distillation (RND) in various domains, it was shown as not discriminative enough to be used as an uncertainty estimator for penalizing out-of-distribution actions in offline reinforcement learning.

Paper
Code

Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization

ryanxhr/ivr • • 28 Mar 2023

This gives a deeper understanding of why the in-sample learning paradigm works, i. e., it applies implicit value regularization to the policy.

Paper
Code

Contrastive Energy Prediction for Exact Energy-Guided Diffusion Sampling in Offline Reinforcement Learning

thu-ml/cep-energy-guided-diffusion • • 25 Apr 2023

The main challenge for this setting is that the intermediate guidance during the diffusion sampling procedure, which is jointly defined by the sampling distribution and the energy function, is unknown and is hard to estimate.

Paper
Code

Datasets and Benchmarks for Offline Safe Reinforcement Learning

liuzuxin/osrl • • 15 Jun 2023

This paper presents a comprehensive benchmarking suite tailored to offline safe reinforcement learning (RL) challenges, aiming to foster progress in the development and evaluation of safe learning algorithms in both the training and deployment phases.

Paper
Code

d3rlpy: An Offline Deep Reinforcement Learning Library

takuseno/d3rlpy • • 6 Nov 2021

In this paper, we introduce d3rlpy, an open-sourced offline deep reinforcement learning (RL) library for Python.

Paper
Code

D4RL

Benchmarks Add a Result

Libraries

Datasets

Most implemented papers

Content

Benchmarks

Add a Result