Methodology

Q-Learning

390 papers with code • 0 benchmarks • 2 datasets

The goal of Q-learning is to learn a policy, which tells an agent what action to take under what circumstances.

( Image credit: Playing Atari with Deep Reinforcement Learning )

Benchmarks

Add a Result

These leaderboards are used to track progress in Q-Learning

No evaluation results yet. Help compare methods by submitting evaluation metrics.

Libraries

Use these libraries to find Q-Learning models and implementations

opendilab/DI-engine

6 papers

2,622

zzmtsvv/rl_task

6 papers

hill-a/stable-baselines

5 papers

4,056

toni-sm/skrl

5 papers

413

See all 29 libraries.

Datasets

Most implemented papers

Most implemented Social Latest No code

Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble

corl-team/CORL • • NeurIPS 2021

However, prior methods typically require accurate estimation of the behavior policy or sampling from OOD data points, which themselves can be a non-trivial problem.

Paper
Code

Deep Reinforcement Learning with a Natural Language Action Space

jvking/text-games • ACL 2016

This paper introduces a novel architecture for reinforcement learning with deep neural networks designed to handle state and action spaces characterized by natural language, as found in text-based games.

Paper
Code

Taming the Noise in Reinforcement Learning via Soft Updates

jollyraven100/Quant_algorithmic-trading_and_More • 28 Dec 2015

We propose G-learning, a new off-policy learning algorithm that regularizes the value estimates by penalizing deterministic policies in the beginning of the learning process.

Paper
Code

Learning to Communicate with Deep Multi-Agent Reinforcement Learning

iassael/learning-to-communicate • • NeurIPS 2016

We consider the problem of multiple agents sensing and acting in environments with the goal of maximising their shared utility.

Paper
Code

Reinforcement Learning with Deep Energy-Based Policies

haarnoja/softqlearning • ICML 2017

We propose a method for learning expressive energy-based policies for continuous states and actions, which has been feasible only in tabular domains before.

Paper
Code

Mean Field Multi-Agent Reinforcement Learning

mlii/mfrl • ICML 2018

Existing multi-agent reinforcement learning methods are limited typically to a small number of agents.

Paper
Code

Deep Reinforcement Learning for Traffic Light Control in Vehicular Networks

Ring367/A-Deep-Reinforcement-Learning-Network-for-Traffic-Light-Cycle-Control • • 29 Mar 2018

In terms of how to dynamically adjust traffic signals' duration, existing works either split the traffic signal into equal duration or extract limited traffic information from the real data.

Paper
Code

Deep Quality-Value (DQV) Learning

paintception/Deep-Quality-Value-DQV-Learning- • • 30 Sep 2018

We introduce a novel Deep Reinforcement Learning (DRL) algorithm called Deep Quality-Value (DQV) Learning.

Paper
Code

Adversarial Learning of a Sampler Based on an Unnormalized Distribution

ChunyuanLI/RAS • • 3 Jan 2019

We investigate adversarial learning in the case when only an unnormalized form of the density can be accessed, rather than samples.

Paper
Code

Deep Reinforcement Learning for Imbalanced Classification

linenus/DRL-For-imbalanced-Classification • • 5 Jan 2019

The agent finally finds an optimal classification policy in imbalanced data under the guidance of specific reward function and beneficial learning environment.

Paper
Code

Q-Learning

Benchmarks Add a Result

Libraries

Datasets

Most implemented papers

Content

Benchmarks

Add a Result