reinforcement-learning

3461 papers with code • 1 benchmarks • 1 datasets

This task has no description! Would you like to contribute one?

Libraries

Use these libraries to find reinforcement-learning models and implementations
26 papers
2,653
14 papers
1,156
12 papers
38
See all 27 libraries.

Most implemented papers

OpenAI Gym

openai/gym 5 Jun 2016

OpenAI Gym is a toolkit for reinforcement learning research.

Weight Uncertainty in Neural Networks

tensorflow/models 20 May 2015

We introduce a new, efficient, principled and backpropagation-compatible algorithm for learning a probability distribution on the weights of a neural network, called Bayes by Backprop.

Rainbow: Combining Improvements in Deep Reinforcement Learning

thu-ml/tianshou 6 Oct 2017

The deep reinforcement learning community has made several independent improvements to the DQN algorithm.

Self-critical Sequence Training for Image Captioning

ruotianluo/ImageCaptioning.pytorch CVPR 2017

In this paper we consider the problem of optimizing image captioning systems using reinforcement learning, and show that by carefully optimizing our systems using the test metrics of the MSCOCO task, significant gains in performance can be realized.

A Deep Reinforcement Learning Framework for the Financial Portfolio Management Problem

ZhengyaoJiang/PGPortfolio 30 Jun 2017

They are, along with a number of recently reviewed or published portfolio-selection strategies, examined in three back-test experiments with a trading period of 30 minutes in a cryptocurrency market.

Simple random search provides a competitive approach to reinforcement learning

modestyachts/ARS 19 Mar 2018

A common belief in model-free reinforcement learning is that methods based on random search in the parameter space of policies exhibit significantly worse sample complexity than those that explore the space of actions.

Evolution Strategies as a Scalable Alternative to Reinforcement Learning

ray-project/ray 10 Mar 2017

We explore the use of Evolution Strategies (ES), a class of black box optimization algorithms, as an alternative to popular MDP-based RL techniques such as Q-learning and Policy Gradients.

IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures

deepmind/scalable_agent ICML 2018

In this work we aim to solve a large collection of tasks using a single reinforcement learning agent with a single set of parameters.

ParlAI: A Dialog Research Software Platform

facebookresearch/ParlAI EMNLP 2017

We introduce ParlAI (pronounced "par-lay"), an open-source software platform for dialog research implemented in Python, available at http://parl. ai.