1 code implementation • 22 May 2024 • Maximilian Nägele, Jan Olle, Thomas Fösel, Remmy Zen, Florian Marquardt
Their optimal policy typically maximizes the expected sum of rewards given at each step of the decision process.
Portfolio Optimization reinforcement-learning