Search Results for author: Matias Alvo

Found 1 papers, 1 papers with code

Neural Inventory Control in Networks via Hindsight Differentiable Policy Optimization

1 code implementation • 20 Jun 2023 • Matias Alvo, Daniel Russo, Yash Kanoria

The first is Hindsight Differentiable Policy Optimization (HDPO), which performs stochastic gradient descent to optimize policy performance while avoiding the need to repeatedly deploy randomized policies in the environment-as is common with generic policy gradient methods.

Management Policy Gradient Methods +2

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.