Distributional Reinforcement Learning

31 papers with code • 0 benchmarks • 0 datasets

Value distribution is the distribution of the random return received by a reinforcement learning agent. it been used for a specific purpose such as implementing risk-aware behaviour.

We have random return Z whose expectation is the value Q. This random return is also described by a recursive equation, but one of a distributional nature

Benchmarks

Add a Result

These leaderboards are used to track progress in Distributional Reinforcement Learning

No evaluation results yet. Help compare methods by submitting evaluation metrics.

Most implemented papers

Most implemented Social Latest No code

Unifying Cardiovascular Modelling with Deep Reinforcement Learning for Uncertainty Aware Control of Sepsis Treatment

thxsxth/POMDP_RLSepsis • • 21 Jan 2021

Sepsis is a potentially life threatening inflammatory response to infection or severe tissue damage.

Paper
Code

Distributional Reinforcement Learning with Unconstrained Monotonic Neural Networks

ThibautTheate/Unconstrained-Monotonic-Deep-Q-Network-algorithm • • 6 Jun 2021

The results highlight the main strengths and weaknesses associated with each probability metric together with an important limitation of the Wasserstein distance.

Paper
Code

Conservative Offline Distributional Reinforcement Learning

JasonMa2016/CODAC • • NeurIPS 2021

We prove that CODAC learns a conservative return distribution -- in particular, for finite MDPs, CODAC converges to an uniform lower bound on the quantiles of the return distribution; our proof relies on a novel analysis of the distributional Bellman operator.

Paper
Code

Exploring the Training Robustness of Distributional Reinforcement Learning against Noisy State Observations

datake/robustdistrl • • 17 Sep 2021

In real scenarios, state observations that an agent observes may contain measurement errors or adversarial noises, misleading the agent to take suboptimal actions or even collapse while training.

Paper
Code

A Cramér Distance perspective on Quantile Regression based Distributional Reinforcement Learning

alherit/cr-dqn • • NeurIPS 2021

Distributional reinforcement learning (DRL) extends the value-based approach by approximating the full distribution over future returns instead of the mean only, providing a richer signal that leads to improved performances.

Paper
Code

Distributional Reinforcement Learning for Multi-Dimensional Reward Functions

zpschang/MD3QN • • NeurIPS 2021

To fully inherit the benefits of distributional RL and hybrid reward architectures, we introduce Multi-Dimensional Distributional DQN (MD3QN), which extends distributional RL to model the joint return distribution from multiple reward sources.

Paper
Code

Two steps to risk sensitivity

crgagne/twosteps_neurips2021 • NeurIPS 2021

Distributional reinforcement learning (RL) -- in which agents learn about all the possible long-term consequences of their actions, and not just the expected value -- is of great recent interest.

Paper
Code

Conjugated Discrete Distributions for Distributional Reinforcement Learning

bjliaa/c2d • • 14 Dec 2021

In this work we continue to build upon recent advances in reinforcement learning for finite Markov processes.

Paper
Code

Adaptive Risk-Tendency: Nano Drone Navigation in Cluttered Environments with Distributional Reinforcement Learning

tudelft/risk-sensitive-rl • • 28 Mar 2022

Enabling the capability of assessing risk and making risk-aware decisions is essential to applying reinforcement learning to safety-critical robots like drones.

Paper
Code

Gamma and Vega Hedging Using Deep Distributional Reinforcement Learning

rotmanfinhub/gamma-vega-rl-hedging • • 10 May 2022

We show how D4PG can be used in conjunction with quantile regression to develop a hedging strategy for a trader responsible for derivatives that arrive stochastically and depend on a single underlying asset.

Paper
Code

Distributional Reinforcement Learning

Benchmarks Add a Result

Most implemented papers

Content

Benchmarks

Add a Result