Reinforcement Learning (RL)

3929 papers with code • 1 benchmarks • 15 datasets

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Benchmarks

Add a Result

These leaderboards are used to track progress in Reinforcement Learning (RL)

Trend	Dataset	Best Model	Paper	Code	Compare
	ProcGen	PPG			See all

Libraries

Use these libraries to find Reinforcement Learning (RL) models and implementations

opendilab/DI-engine

28 papers

2,593

zzmtsvv/rl_task

15 papers

chainer/chainerrl

14 papers

1,155

hill-a/stable-baselines

13 papers

4,050

See all 35 libraries.

Datasets

Subtasks

Most implemented papers

Most implemented Social Latest No code

Simple random search provides a competitive approach to reinforcement learning

modestyachts/ARS • 19 Mar 2018

A common belief in model-free reinforcement learning is that methods based on random search in the parameter space of policies exhibit significantly worse sample complexity than those that explore the space of actions.

Paper
Code

SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient

LantaoYu/SeqGAN • • 18 Sep 2016

As a new way of training generative models, Generative Adversarial Nets (GAN) that uses a discriminative model to guide the training of the generative model has enjoyed considerable success in generating real-valued data.

Paper
Code

Evolution Strategies as a Scalable Alternative to Reinforcement Learning

ray-project/ray • 10 Mar 2017

We explore the use of Evolution Strategies (ES), a class of black box optimization algorithms, as an alternative to popular MDP-based RL techniques such as Q-learning and Policy Gradients.

Paper
Code

IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures

deepmind/scalable_agent • • ICML 2018

In this work we aim to solve a large collection of tasks using a single reinforcement learning agent with a single set of parameters.

Paper
Code

ParlAI: A Dialog Research Software Platform

facebookresearch/ParlAI • • EMNLP 2017

We introduce ParlAI (pronounced "par-lay"), an open-source software platform for dialog research implemented in Python, available at http://parl. ai.

Paper
Code

A Distributional Perspective on Reinforcement Learning

facebookresearch/ReAgent • • ICML 2017

We obtain both state-of-the-art results and anecdotal evidence demonstrating the importance of the value distribution in approximate reinforcement learning.

Paper
Code

World Models

hardmaru/WorldModelsExperiments • • 27 Mar 2018

We explore building generative neural network models of popular reinforcement learning environments.

Paper
Code

Exploration by Random Network Distillation

openai/random-network-distillation • • ICLR 2019

In particular we establish state of the art performance on Montezuma's Revenge, a game famously difficult for deep reinforcement learning methods.

Paper
Code

Implicit Quantile Networks for Distributional Reinforcement Learning

google/dopamine • • ICML 2018

In this work, we build on recent advances in distributional reinforcement learning to give a generally applicable, flexible, and state-of-the-art distributional variant of DQN.

Paper
Code

The StarCraft Multi-Agent Challenge

oxwhirl/pymarl • • 11 Feb 2019

In this paper, we propose the StarCraft Multi-Agent Challenge (SMAC) as a benchmark problem to fill this gap.

Paper
Code

Reinforcement Learning (RL)

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Most implemented papers

Content

Benchmarks

Add a Result