reinforcement-learning

We introduce a new, efficient, principled and backpropagation-compatible algorithm for learning a probability distribution on the weights of a neural network, called Bayes by Backprop.

Paper
Code

Rainbow: Combining Improvements in Deep Reinforcement Learning

thu-ml/tianshou • • 6 Oct 2017

The deep reinforcement learning community has made several independent improvements to the DQN algorithm.

Paper
Code

Self-critical Sequence Training for Image Captioning

ruotianluo/ImageCaptioning.pytorch • • CVPR 2017

In this paper we consider the problem of optimizing image captioning systems using reinforcement learning, and show that by carefully optimizing our systems using the test metrics of the MSCOCO task, significant gains in performance can be realized.

Paper
Code

Multi-Goal Reinforcement Learning: Challenging Robotics Environments and Request for Research

DartEnv/dart-env • • 26 Feb 2018

The purpose of this technical report is two-fold.

Paper
Code

A Deep Reinforcement Learning Framework for the Financial Portfolio Management Problem

ZhengyaoJiang/PGPortfolio • • 30 Jun 2017

They are, along with a number of recently reviewed or published portfolio-selection strategies, examined in three back-test experiments with a trading period of 30 minutes in a cryptocurrency market.

Paper
Code

Simple random search provides a competitive approach to reinforcement learning

modestyachts/ARS • 19 Mar 2018

A common belief in model-free reinforcement learning is that methods based on random search in the parameter space of policies exhibit significantly worse sample complexity than those that explore the space of actions.

Paper
Code

Evolution Strategies as a Scalable Alternative to Reinforcement Learning

ray-project/ray • 10 Mar 2017

We explore the use of Evolution Strategies (ES), a class of black box optimization algorithms, as an alternative to popular MDP-based RL techniques such as Q-learning and Policy Gradients.

Paper
Code