OpenAI Gym
160 papers with code • 9 benchmarks • 3 datasets
An open-source toolkit from OpenAI that implements several Reinforcement Learning benchmarks including: classic control, Atari, Robotics and MuJoCo tasks.
(Description by Evolutionary learning of interpretable decision trees)
(Image Credit: OpenAI Gym)
Libraries
Use these libraries to find OpenAI Gym models and implementationsMost implemented papers
A Benchmark Environment Motivated by Industrial Control Problems
On one hand, these benchmarks are designed to provide interpretable RL training scenarios and detailed insight into the learning process of the method on hand.
Recurrent Predictive State Policy Networks
Predictive state policy networks consist of a recursive filter, which keeps track of a belief about the state of the environment, and a reactive policy that directly maps beliefs to actions, to maximize the cumulative reward.
Monte Carlo Tree Search for Asymmetric Trees
Asymmetric termination of search trees introduces a type of uncertainty for which the standard upper confidence bound (UCB) formula does not account.
Deep Reinforcement Learning for General Video Game AI
In this paper, we describe how we interface GVGAI to the OpenAI Gym environment, a widely used way of connecting agents to reinforcement learning problems.
Deep Reinforcement Learning with Feedback-based Exploration
We employ binary corrective feedback as a general and intuitive manner to incorporate human intuition and domain knowledge in model-free machine learning.
Towards Interactive Training of Non-Player Characters in Video Games
We propose to create such NPC behaviors interactively by training an agent in the target environment using imitation learning with a human in the loop.
MoËT: Mixture of Expert Trees and its Application to Verifiable Reinforcement Learning
By training Mo\"ET models using an imitation learning procedure on deep RL agents we outperform the previous state-of-the-art technique based on decision trees while preserving the verifiability of the models.
QFlip: An Adaptive Reinforcement Learning Strategy for the FlipIt Security Game
FlipIt is a security game that models attacker-defender interactions in advanced scenarios such as APTs.
Arena: a toolkit for Multi-Agent Reinforcement Learning
We introduce Arena, a toolkit for multi-agent reinforcement learning (MARL) research.
Towards a Reinforcement Learning Environment Toolbox for Intelligent Electric Motor Control
An intelligent controller example based on the deep deterministic policy gradient algorithm which controls a series DC motor is presented and compared to a cascaded PI-controller as a baseline for future research.