Search Results for author: Matias Alvo

Found 1 papers, 1 papers with code

Neural Inventory Control in Networks via Hindsight Differentiable Policy Optimization

1 code implementation20 Jun 2023 Matias Alvo, Daniel Russo, Yash Kanoria

The first is Hindsight Differentiable Policy Optimization (HDPO), which performs stochastic gradient descent to optimize policy performance while avoiding the need to repeatedly deploy randomized policies in the environment-as is common with generic policy gradient methods.

Management Policy Gradient Methods +2

Cannot find the paper you are looking for? You can Submit a new open access paper.