Methodology

Q-Learning

386 papers with code • 0 benchmarks • 2 datasets

The goal of Q-learning is to learn a policy, which tells an agent what action to take under what circumstances.

( Image credit: Playing Atari with Deep Reinforcement Learning )

Benchmarks

Add a Result

These leaderboards are used to track progress in Q-Learning

No evaluation results yet. Help compare methods by submitting evaluation metrics.

Libraries

Use these libraries to find Q-Learning models and implementations

opendilab/DI-engine

6 papers

2,513

zzmtsvv/rl_task

6 papers

hill-a/stable-baselines

5 papers

4,038

toni-sm/skrl

5 papers

396

See all 29 libraries.

Datasets

Most implemented papers

Most implemented Social Latest No code

Deep Neuroevolution: Genetic Algorithms Are a Competitive Alternative for Training Deep Neural Networks for Reinforcement Learning

uber-research/deep-neuroevolution • • 18 Dec 2017

Here we demonstrate they can: we evolve the weights of a DNN with a simple, gradient-free, population-based genetic algorithm (GA) and it performs well on hard deep RL problems, including Atari and humanoid locomotion.

Paper
Code

ViZDoom: A Doom-based AI Research Platform for Visual Reinforcement Learning

mwydmuch/ViZDoom • • 6 May 2016

Here, we propose a novel test-bed platform for reinforcement learning research from raw visual information which employs the first-person perspective in a semi-realistic 3D world.

Paper
Code

rlpyt: A Research Code Base for Deep Reinforcement Learning in PyTorch

astooke/rlpyt • • 3 Sep 2019

rlpyt is designed as a high-throughput code base for small- to medium-scale research in deep RL.

Paper
Code

Continuous Deep Q-Learning with Model-based Acceleration

jakegrigsby/deep_control • • 2 Mar 2016

In this paper, we explore algorithms and representations to reduce the sample complexity of deep reinforcement learning for continuous control tasks.

Paper
Code

Playing FPS Games with Deep Reinforcement Learning

glample/Arnold • • 18 Sep 2016

Advances in deep reinforcement learning have allowed autonomous agents to perform well on Atari games, often outperforming humans, using only raw pixels to make their decisions.

Paper
Code

Optimization of Molecules via Deep Reinforcement Learning

google-research/google-research • • 19 Oct 2018

We present a framework, which we call Molecule Deep $Q$-Networks (MolDQN), for molecule optimization by combining domain knowledge of chemistry and state-of-the-art reinforcement learning techniques (double $Q$-learning and randomized value functions).

Paper
Code

DeepTraffic: Crowdsourced Hyperparameter Tuning of Deep Reinforcement Learning Systems for Multi-Agent Dense Traffic Navigation

lexfridman/deeptraffic • 9 Jan 2018

We present a traffic simulation named DeepTraffic where the planning systems for a subset of the vehicles are handled by a neural network as part of a model-free, off-policy reinforcement learning process.

Paper
Code

Randomized Ensembled Double Q-Learning: Learning Fast Without a Model

watchernyu/REDQ • • ICLR 2021

Using a high Update-To-Data (UTD) ratio, model-based methods have recently achieved much higher sample efficiency than previous model-free methods for continuous-action DRL benchmarks.

Paper
Code

Hierarchical Reinforcement Learning with the MAXQ Value Function Decomposition

borea17/efficient_rl • 21 May 1999

The paper presents an online model-free learning algorithm, MAXQ-Q, and proves that it converges wih probability 1 to a kind of locally-optimal policy known as a recursively optimal policy, even in the presence of the five kinds of state abstraction.

Paper
Code

Deep Recurrent Q-Learning for Partially Observable MDPs

marload/DeepRL-TensorFlow2 • • 23 Jul 2015

Deep Reinforcement Learning has yielded proficient controllers for complex tasks.

Paper
Code

Q-Learning

Benchmarks Add a Result

Libraries

Datasets

Most implemented papers

Content

Benchmarks

Add a Result