Search Results for author: Andrew Grimshaw

Launchpad: Learning to Schedule Using Offline and Online RL Methods

We utilize Offline RL as a launchpad to learn effective scheduling policies from prior experience collected using Oracle or heuristic policies.

Paper
Add Code

Finally, we demonstrate that the DRL scheduler can learn from and improve upon existing heuristic policies using Offline Learning.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.