3 code implementations • 9 Nov 2019 • J. Gomez Robles, J. Vanschoren
We empirically investigate the agent's behavior during training when challenged to design chain-structured neural architectures for three datasets with increasing levels of hardness, to later fix the policy and evaluate it on two unseen datasets of different difficulty.