1 code implementation • 25 Jul 2020 • Tiago Costa, Andres Laan, Francisco J. H. Heras, Gonzalo G. de Polavieja
We obtained this result by using recent advances in reinforcementlearning to systematically solve the inverse modeling problem: given an observedcollective behavior, we automatically find a policy generating it.