no code implementations • 15 Aug 2018 • MinZhong Luo, Li Liu
First, the formula is abstractly expressed as a multiway tree model, and then each step of the formula derivation transformation is abstracted as a mapping of multiway trees.
Q-Learning