2 code implementations • 12 Nov 2019 • Qiang Ma, Suwen Ge, Danyang He, Darshan Thaker, Iddo Drori
Furthermore, to approximate solutions to constrained combinatorial optimization problems such as the TSP with time windows, we train hierarchical GPNs (HGPNs) using RL, which learns a hierarchical policy to find an optimal city permutation under constraints.
Ranked #2 on Traveling Salesman Problem on TSPLIB