Partially Observable Reinforcement Learning

7 papers with code • 1 benchmarks • 1 datasets

This task has no description! Would you like to contribute one?

Benchmarks

Add a Result

These leaderboards are used to track progress in Partially Observable Reinforcement Learning

Trend	Dataset	Best Model	Paper	Code	Compare
	POPGym	Gated Recurrent Unit			See all

Datasets

POPGym

Most implemented papers

Most implemented Social Latest No code

Stabilizing Transformers for Reinforcement Learning

opendilab/DI-engine • • ICML 2020

Harnessing the transformer's ability to process long time horizons of information could provide a similar performance boost in partially observable reinforcement learning (RL) domains, but the large-scale transformers used in NLP have yet to be successfully applied to the RL setting.

Paper
Code

POPGym: Benchmarking Partially Observable Reinforcement Learning

proroklab/popgym • • 3 Mar 2023

Real world applications of Reinforcement Learning (RL) are often partially observable, thus requiring memory.

Paper
Code

Learning Reward Machines for Partially Observable Reinforcement Learning

RToroIcarte/lrm • • NeurIPS 2019

Reward Machines (RMs), originally proposed for specifying problems in Reinforcement Learning (RL), provide a structured, automata-based representation of a reward function that allows an agent to decompose problems into subproblems that can be efficiently learned using off-policy learning.

Paper
Code