Learning to Escape: Multi-mode Policy Learning for the Traveling Salesmen Problem | IEEE Conference Publication | IEEE Xplore