Offline RL

225 papers with code • 2 benchmarks • 6 datasets

This task has no description! Would you like to contribute one?

Libraries

Use these libraries to find Offline RL models and implementations
14 papers
35
7 papers
387
4 papers
2,523
See all 10 libraries.

Compositional Conservatism: A Transductive Approach in Offline Reinforcement Learning

faceonlive/ai-research 6 Apr 2024

Our COCOA seeks both in-distribution anchors and differences by utilizing the learned reverse dynamics model, encouraging conservatism in the compositional input space for the policy or value function.

140
06 Apr 2024

Unsupervised Zero-Shot Reinforcement Learning via Functional Reward Encodings

kvfrans/fre 27 Feb 2024

Can we pre-train a generalist agent from a large amount of unlabeled offline trajectories such that it can be immediately adapted to any new downstream tasks in a zero-shot manner?

41
27 Feb 2024

Deep Generative Models for Offline Policy Learning: Tutorial, Survey, and Perspectives on Future Directions

lucascjysdl/dgms-for-offline-policy-learning 21 Feb 2024

This work offers a hands-on reference for the research progress in deep generative models for offline policy learning, and aims to inspire improved DGM-based offline RL or IL algorithms.

12
21 Feb 2024

MORE-3S:Multimodal-based Offline Reinforcement Learning with Shared Semantic Spaces

zheng0428/more_ 20 Feb 2024

Drawing upon the intuition that aligning different modalities to the same semantic embedding space would allow models to understand states and actions more easily, we propose a new perspective to the offline reinforcement learning (RL) challenge.

2
20 Feb 2024

Stitching Sub-Trajectories with Conditional Diffusion Model for Goal-Conditioned Offline RL

rlatjddbs/ssd 11 Feb 2024

In this paper, we propose SSD (Sub-trajectory Stitching with Diffusion), a model-based offline GCRL method that leverages the conditional diffusion model to address these limitations.

0
11 Feb 2024

SEABO: A Simple Search-Based Method for Offline Imitation Learning

dmksjfl/seabo 6 Feb 2024

Offline reinforcement learning (RL) has attracted much attention due to its ability in learning from static offline datasets and eliminating the need of interacting with the environment.

6
06 Feb 2024

Entropy-regularized Diffusion Policy with Q-Ensembles for Offline Reinforcement Learning

ruoqizzz/entropy-regularized-diffusion-policy-with-qensemble 6 Feb 2024

We show that such an SDE has a solution that we can use to calculate the log probability of the policy, yielding an entropy regularizer that improves the exploration of offline datasets.

0
06 Feb 2024

ODICE: Revealing the Mystery of Distribution Correction Estimation via Orthogonal-gradient Update

maoliyuan/odice-pytorch 1 Feb 2024

To resolve this issue, we propose a simple yet effective modification that projects the backward gradient onto the normal plane of the forward gradient, resulting in an orthogonal-gradient update, a new learning rule for DICE-based methods.

13
01 Feb 2024

Differentiable Tree Search in Latent State Space

openai/procgen 22 Jan 2024

In this work, we introduce Differentiable Tree Search (DTS), a novel neural network architecture that significantly strengthens the inductive bias by embedding the algorithmic structure of a best-first online search algorithm.

972
22 Jan 2024

Safe Offline Reinforcement Learning with Feasibility-Guided Diffusion Model

zhengyinan-air/fisor 19 Jan 2024

Interestingly, we discover that via reachability analysis of safe-control theory, the hard safety constraint can be equivalently translated to identifying the largest feasible region given the offline dataset.

42
19 Jan 2024