no code implementations • 5 Jan 2024 • Harvey Merton, Thomas Delamore, Karl Stol, Henry Williams
Two state-of-the-art algorithms not previously tested in this context: soft actor critic (SAC) and adversarial inverse reinforcement learning (AIRL), are used to train models in a representative simulation.