Dynamic embedding-based deep reinforcement learning for heterogeneous capacitated VRPs with unloading time constraints

  • Qingshu Guan
  • , Shuangsi Xue
  • , Junkai Tan
  • , Lixin Jia
  • , Hui Cao
  • , Badong Chen

Research output: Contribution to journalArticlepeer-review

3 Scopus citations

Abstract

Capacitated vehicle routing problems (CVRPs) have garnered growing attention due to their extensive applications across various fields. However, existing deep reinforcement learning (DRL) approaches often cope with homogeneous vehicle fleets, failing to account for differences in vehicle capacities and speeds. Moreover, these methods typically overlook the real-life constraint of unloading time, where vehicles cannot depart until all goods are delivered. These limitations intrinsically restrict their practical applications. To address these issues, we introduce a heterogeneous CVRP with unloading time constraints (HCVRP-UTC) and propose a dynamic embedding-based DRL (DE-DRL) for tackling it. Our approach leverages an innovative encoder-updater-decoder (EUD) framework. Specifically, the encoder generates feature embeddings for both customer nodes and heterogeneous vehicles, while the updater iteratively refines these embeddings, incorporating both static customer data and dynamic vehicle information, to capture the real-time state variation and provide sufficient clues for decision-making. Subsequently, the decoder decouples the complicated problem into a series of recursive vehicle-selection and vehicle-specific node-selection tasks, enhancing the precision and efficiency of route planning. Finally, we evaluate the proposed approach on both synthetic and real-world datasets of varying scales and distributions. Experimental results demonstrate that our DE-DRL consistently outperforms heuristic and state-of-the-art DRL-based methods, reducing optimality gaps by up to 13.53 %. Notably, DE-DRL also exhibits superior generalization performance, extending its applicability to broader real-world scenarios.

Original languageEnglish
Article number128660
JournalExpert Systems with Applications
Volume293
DOIs
StatePublished - 1 Dec 2025

Keywords

  • Attention mechanism
  • Capacitated vehicle routing problem
  • Decision-making
  • Deep reinforcement learning
  • Task scheduling

Fingerprint

Dive into the research topics of 'Dynamic embedding-based deep reinforcement learning for heterogeneous capacitated VRPs with unloading time constraints'. Together they form a unique fingerprint.

Cite this