Skip to main navigation Skip to search Skip to main content

Synergetic attention-driven transformer: A deep reinforcement learning approach for vehicle routing problems

  • Qingshu Guan
  • , Hui Cao
  • , Lixin Jia
  • , Dapeng Yan
  • , Badong Chen
  • Xi'an Jiaotong University

Research output: Contribution to journalArticlepeer-review

18 Scopus citations

Abstract

Recently, deep reinforcement learning (DRL) has emerged as a promising avenue for tackling vehicle routing problems (VRPs). However, prevailing methods predominantly concentrate on learning construction policies, which often yield suboptimal outcomes in terms of generalization. In this study, we build upon an encoder–decoder framework and propose a synergetic attention-driven transformer (SAT) to acquire improvement policies, which strive to refine the current complete solution iteratively. During the encoding phase, we devise a multi-head synergetic attention mechanism to simultaneously learn node feature tokens and access index tokens in a dual pathway, thereby mitigating potential noises and incompatible correlations. Furthermore, we employ a cyclic mapping strategy to embed access sequences, enabling the capture of the inherent circularity and symmetry attributes within VRP solutions. In the decoding phase, we incorporate multiple decoders and formulate two regularization losses from perspectives of external policies and internal parameters to enhance the diversity of alternative solutions, thus broadening the search space efficiently. In the training phase, we utilize proximal policy optimization to ensure rapid and stable convergence of our model. Finally, we evaluate the efficacy of our method on two representative VRPs, namely the traveling salesman problem (TSP) and the capacitated VRP (CVRP). Comprehensive experiments on both synthetic and real-world benchmarks demonstrate that our SAT outperforms heuristic and DRL methods by a significant margin of up to 8.56% in precision, and also exhibits much better generalization performance on problems across varying sizes and distributions.

Original languageEnglish
Article number126961
JournalExpert Systems with Applications
Volume274
DOIs
StatePublished - 15 May 2025

Keywords

  • Deep reinforcement learning
  • Markov decision process
  • Multi-head attention
  • Vehicle routing problem

Fingerprint

Dive into the research topics of 'Synergetic attention-driven transformer: A deep reinforcement learning approach for vehicle routing problems'. Together they form a unique fingerprint.

Cite this