no code implementations • 1 Oct 2019 • Murugeswari Issakkimuthu, Alan Fern, Prasad Tadepalli
There are notable examples of online search improving over hand-coded or learned policies (e. g. AlphaZero) for sequential decision making.
Decision Making