no code implementations • 1 Nov 2022 • Remco Coppens, Robbert Reijnen, Yingqian Zhang, Laurens Bliek, Berend Steenhuisen
The DRL policy is trained to adaptively set the values that dictate the intensity and probability of mutation for solutions during optimization.