no code implementations • 10 Oct 2020 • Beiran Chen, Yi Zhang, George Iosifidis, Mingming Liu
This paper models this dynamic computational resource allocation problem into a Markov Decision Process (MDP) and designs a model-based reinforcement-learning agent to optimise the dynamic resource allocation of the CPU usage.