Abstract:
Recently, there is an emerging trend to apply deep reinforcement learning to solve the vehicle routing problem (VRP), where a learnt policy governs the selection of next ...Show MoreMetadata
Abstract:
Recently, there is an emerging trend to apply deep reinforcement learning to solve the vehicle routing problem (VRP), where a learnt policy governs the selection of next node for visiting. However, existing methods could not handle well the pairing and precedence relationships in the pickup and delivery problem (PDP), which is a representative variant of VRP. To address this challenging issue, we leverage a novel neural network integrated with a heterogeneous attention mechanism to empower the policy in deep reinforcement learning to automatically select the nodes. In particular, the heterogeneous attention mechanism specifically prescribes attentions for each role of the nodes while taking into account the precedence constraint, i.e., the pickup node must precede the pairing delivery node. Further integrated with a masking scheme, the learnt policy is expected to find higher-quality solutions for solving PDP. Extensive experimental results show that our method outperforms the state-of-the-art heuristic and deep learning model, respectively, and generalizes well to different distributions and problem sizes.
Published in: IEEE Transactions on Intelligent Transportation Systems ( Volume: 23, Issue: 3, March 2022)
Funding Agency:
Keywords assist with retrieval of results and provide a means to discovering other relevant content. Learn more.
- IEEE Keywords
- Index Terms
- Deep Reinforcement Learning ,
- Deep Learning ,
- Pairing ,
- Deep Learning Models ,
- Attention Mechanism ,
- Problem Size ,
- Routing Problem ,
- Node Selection ,
- High-quality Solutions ,
- Role Of Nodes ,
- Precedence Constraints ,
- Precedence Relations ,
- Masking Strategy ,
- Computation Time ,
- Objective Value ,
- Pair Of Nodes ,
- Actor Network ,
- Exact Method ,
- Heuristic Method ,
- Markov Decision Process ,
- Node Embeddings ,
- Types Of Attention ,
- Deep Reinforcement Learning Method ,
- Traveling Salesman Problem ,
- Policy Network ,
- Policy Gradient Method ,
- Deep Reinforcement Learning Model ,
- Delivery Points ,
- Variable Neighbourhood Search ,
- Encoder-decoder Structure
- Author Keywords
Keywords assist with retrieval of results and provide a means to discovering other relevant content. Learn more.
- IEEE Keywords
- Index Terms
- Deep Reinforcement Learning ,
- Deep Learning ,
- Pairing ,
- Deep Learning Models ,
- Attention Mechanism ,
- Problem Size ,
- Routing Problem ,
- Node Selection ,
- High-quality Solutions ,
- Role Of Nodes ,
- Precedence Constraints ,
- Precedence Relations ,
- Masking Strategy ,
- Computation Time ,
- Objective Value ,
- Pair Of Nodes ,
- Actor Network ,
- Exact Method ,
- Heuristic Method ,
- Markov Decision Process ,
- Node Embeddings ,
- Types Of Attention ,
- Deep Reinforcement Learning Method ,
- Traveling Salesman Problem ,
- Policy Network ,
- Policy Gradient Method ,
- Deep Reinforcement Learning Model ,
- Delivery Points ,
- Variable Neighbourhood Search ,
- Encoder-decoder Structure
- Author Keywords