Reinforcement Learning (RL)

3912 papers with code • 1 benchmarks • 15 datasets

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Libraries

Use these libraries to find Reinforcement Learning (RL) models and implementations
28 papers
2,548
15 papers
35
14 papers
1,154
See all 35 libraries.

Most implemented papers

Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm

ray-project/ray 5 Dec 2017

The game of chess is the most widely-studied domain in the history of artificial intelligence.

DARTS: Differentiable Architecture Search

quark0/darts ICLR 2019

This paper addresses the scalability challenge of architecture search by formulating the task in a differentiable manner.

Soft Actor-Critic Algorithms and Applications

rail-berkeley/softlearning 13 Dec 2018

A fork of OpenAI Baselines, implementations of reinforcement learning algorithms

OpenAI Gym

openai/gym 5 Jun 2016

OpenAI Gym is a toolkit for reinforcement learning research.

Weight Uncertainty in Neural Networks

tensorflow/models 20 May 2015

We introduce a new, efficient, principled and backpropagation-compatible algorithm for learning a probability distribution on the weights of a neural network, called Bayes by Backprop.

Rainbow: Combining Improvements in Deep Reinforcement Learning

thu-ml/tianshou 6 Oct 2017

The deep reinforcement learning community has made several independent improvements to the DQN algorithm.

Self-critical Sequence Training for Image Captioning

ruotianluo/ImageCaptioning.pytorch CVPR 2017

In this paper we consider the problem of optimizing image captioning systems using reinforcement learning, and show that by carefully optimizing our systems using the test metrics of the MSCOCO task, significant gains in performance can be realized.

A Deep Reinforcement Learning Framework for the Financial Portfolio Management Problem

ZhengyaoJiang/PGPortfolio 30 Jun 2017

They are, along with a number of recently reviewed or published portfolio-selection strategies, examined in three back-test experiments with a trading period of 30 minutes in a cryptocurrency market.

Hindsight Experience Replay

DLR-RM/stable-baselines3 NeurIPS 2017

Dealing with sparse rewards is one of the biggest challenges in Reinforcement Learning (RL).