Policy Gradient Methods

89 papers with code • 0 benchmarks • 2 datasets

This task has no description! Would you like to contribute one?

Libraries

Use these libraries to find Policy Gradient Methods models and implementations
2 papers
1,152
2 papers
614
See all 7 libraries.

Online Portfolio Management via Deep Reinforcement Learning with High-Frequency Data

jiahaoli57/LSRE-CAAN Information Processing & Management 2023

In addition, while the vast majority of SOTA strategies maintain a poor turnover rate of approximately greater than 50% on average, our framework enjoys a relatively low turnover rate on all datasets, efficiency analysis illustrates that our framework no longer has the quadratic dependency limitation.

13
01 May 2023

Distributional constrained reinforcement learning for supply chain optimization

jaimesabalimperial/jaisalab 3 Feb 2023

We introduce Distributional Constrained Policy Optimization (DCPO), a novel approach for reliable constraint satisfaction in RL.

8
03 Feb 2023

Partial advantage estimator for proximal policy optimization

ubiquition/drl 26 Jan 2023

Estimation of value in policy gradient methods is a fundamental problem.

21
26 Jan 2023

Policy Gradient in Robust MDPs with Global Convergence Guarantee

jerrisonwang/icml-drpg 20 Dec 2022

In contrast with prior robust policy gradient algorithms, DRPG monotonically reduces approximation errors to guarantee convergence to a globally optimal policy in tabular RMDPs.

4
20 Dec 2022

Is Reinforcement Learning (Not) for Natural Language Processing: Benchmarks, Baselines, and Building Blocks for Natural Language Policy Optimization

allenai/rl4lms 3 Oct 2022

To help answer this, we first introduce an open-source modular library, RL4LMs (Reinforcement Learning for Language Models), for optimizing language generators with RL.

2,077
03 Oct 2022

Continuous MDP Homomorphisms and Homomorphic Policy Gradient

sahandrez/homomorphic_policy_gradient 15 Sep 2022

Abstraction has been widely studied as a way to improve the efficiency and generalization of reinforcement learning algorithms.

18
15 Sep 2022

The Performance Impact of Combining Agent Factorization with Different Learning Algorithms for Multiagent Coordination

stavrosgreece/MultiAgentLearning SETN 2022

In this work, we explore if the performance impact of agent factorization is different when using different learning algorithms in multiagent coordination set- tings.

3
09 Sep 2022

Reactive Exploration to Cope with Non-Stationarity in Lifelong Reinforcement Learning

ml-jku/reactive-exploration 12 Jul 2022

Therefore, exploration strategies and learning methods are required that are capable of tracking the steady domain shifts, and adapting to them.

14
12 Jul 2022

The Sufficiency of Off-Policyness and Soft Clipping: PPO is still Insufficient according to an Off-Policy Measure

raincchio/p3o 20 May 2022

The popular Proximal Policy Optimization (PPO) algorithm approximates the solution in a clipped policy space.

10
20 May 2022

Synthesis of Stabilizing Recurrent Equilibrium Network Controllers

neelayjunnarkar/stabilizing-ren 31 Mar 2022

We propose a parameterization of a nonlinear dynamic controller based on the recurrent equilibrium network, a generalization of the recurrent neural network.

0
31 Mar 2022