Reinforcement Learning (RL)

3912 papers with code • 1 benchmarks • 15 datasets

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Benchmarks

Add a Result

These leaderboards are used to track progress in Reinforcement Learning (RL)

Trend	Dataset	Best Model	Paper	Code	Compare
	ProcGen	PPG			See all

Libraries

Use these libraries to find Reinforcement Learning (RL) models and implementations

opendilab/DI-engine

28 papers

2,548

zzmtsvv/rl_task

15 papers

chainer/chainerrl

14 papers

1,154

hill-a/stable-baselines

13 papers

4,042

See all 35 libraries.

Datasets

Subtasks

Most implemented papers

Most implemented Social Latest No code

Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm

ray-project/ray • 5 Dec 2017

The game of chess is the most widely-studied domain in the history of artificial intelligence.

Paper
Code

DARTS: Differentiable Architecture Search

quark0/darts • • ICLR 2019

This paper addresses the scalability challenge of architecture search by formulating the task in a differentiable manner.

Paper
Code

Soft Actor-Critic Algorithms and Applications

rail-berkeley/softlearning • • 13 Dec 2018

A fork of OpenAI Baselines, implementations of reinforcement learning algorithms

Paper
Code

OpenAI Gym

openai/gym • • 5 Jun 2016

OpenAI Gym is a toolkit for reinforcement learning research.

Paper
Code

Weight Uncertainty in Neural Networks

tensorflow/models • • 20 May 2015

We introduce a new, efficient, principled and backpropagation-compatible algorithm for learning a probability distribution on the weights of a neural network, called Bayes by Backprop.

Paper
Code

Rainbow: Combining Improvements in Deep Reinforcement Learning

thu-ml/tianshou • • 6 Oct 2017

The deep reinforcement learning community has made several independent improvements to the DQN algorithm.

Paper
Code

Self-critical Sequence Training for Image Captioning

ruotianluo/ImageCaptioning.pytorch • • CVPR 2017

In this paper we consider the problem of optimizing image captioning systems using reinforcement learning, and show that by carefully optimizing our systems using the test metrics of the MSCOCO task, significant gains in performance can be realized.

Paper
Code

Multi-Goal Reinforcement Learning: Challenging Robotics Environments and Request for Research

DartEnv/dart-env • • 26 Feb 2018

The purpose of this technical report is two-fold.

Paper
Code

A Deep Reinforcement Learning Framework for the Financial Portfolio Management Problem

ZhengyaoJiang/PGPortfolio • • 30 Jun 2017

They are, along with a number of recently reviewed or published portfolio-selection strategies, examined in three back-test experiments with a trading period of 30 minutes in a cryptocurrency market.

Paper
Code

Hindsight Experience Replay

DLR-RM/stable-baselines3 • • NeurIPS 2017

Dealing with sparse rewards is one of the biggest challenges in Reinforcement Learning (RL).

Paper
Code

Reinforcement Learning (RL)

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Most implemented papers

Content

Benchmarks

Add a Result