1 code implementation • 29 Jun 2021 • Alexandre Variengien, Stefano Nichele, Tom Glover, Sidney Pontes-Filho
The observations of the environment are transmitted in input cells, while the values of output cells are used as a readout of the system.
Q-Learning