no code implementations • 27 Sep 2017 • Einar Cesar Santos
It is significant that an optimal policy takes into account some resources available to each traffic class while considering the spectral efficiency and other related channel issues.
Q-Learning reinforcement-learning +2