TY - JOUR
T1 - Dynamic embedding-based deep reinforcement learning for heterogeneous capacitated VRPs with unloading time constraints
AU - Guan, Qingshu
AU - Xue, Shuangsi
AU - Tan, Junkai
AU - Jia, Lixin
AU - Cao, Hui
AU - Chen, Badong
N1 - Publisher Copyright:
© 2025 Elsevier Ltd
PY - 2025/12/1
Y1 - 2025/12/1
N2 - Capacitated vehicle routing problems (CVRPs) have garnered growing attention due to their extensive applications across various fields. However, existing deep reinforcement learning (DRL) approaches often cope with homogeneous vehicle fleets, failing to account for differences in vehicle capacities and speeds. Moreover, these methods typically overlook the real-life constraint of unloading time, where vehicles cannot depart until all goods are delivered. These limitations intrinsically restrict their practical applications. To address these issues, we introduce a heterogeneous CVRP with unloading time constraints (HCVRP-UTC) and propose a dynamic embedding-based DRL (DE-DRL) for tackling it. Our approach leverages an innovative encoder-updater-decoder (EUD) framework. Specifically, the encoder generates feature embeddings for both customer nodes and heterogeneous vehicles, while the updater iteratively refines these embeddings, incorporating both static customer data and dynamic vehicle information, to capture the real-time state variation and provide sufficient clues for decision-making. Subsequently, the decoder decouples the complicated problem into a series of recursive vehicle-selection and vehicle-specific node-selection tasks, enhancing the precision and efficiency of route planning. Finally, we evaluate the proposed approach on both synthetic and real-world datasets of varying scales and distributions. Experimental results demonstrate that our DE-DRL consistently outperforms heuristic and state-of-the-art DRL-based methods, reducing optimality gaps by up to 13.53 %. Notably, DE-DRL also exhibits superior generalization performance, extending its applicability to broader real-world scenarios.
AB - Capacitated vehicle routing problems (CVRPs) have garnered growing attention due to their extensive applications across various fields. However, existing deep reinforcement learning (DRL) approaches often cope with homogeneous vehicle fleets, failing to account for differences in vehicle capacities and speeds. Moreover, these methods typically overlook the real-life constraint of unloading time, where vehicles cannot depart until all goods are delivered. These limitations intrinsically restrict their practical applications. To address these issues, we introduce a heterogeneous CVRP with unloading time constraints (HCVRP-UTC) and propose a dynamic embedding-based DRL (DE-DRL) for tackling it. Our approach leverages an innovative encoder-updater-decoder (EUD) framework. Specifically, the encoder generates feature embeddings for both customer nodes and heterogeneous vehicles, while the updater iteratively refines these embeddings, incorporating both static customer data and dynamic vehicle information, to capture the real-time state variation and provide sufficient clues for decision-making. Subsequently, the decoder decouples the complicated problem into a series of recursive vehicle-selection and vehicle-specific node-selection tasks, enhancing the precision and efficiency of route planning. Finally, we evaluate the proposed approach on both synthetic and real-world datasets of varying scales and distributions. Experimental results demonstrate that our DE-DRL consistently outperforms heuristic and state-of-the-art DRL-based methods, reducing optimality gaps by up to 13.53 %. Notably, DE-DRL also exhibits superior generalization performance, extending its applicability to broader real-world scenarios.
KW - Attention mechanism
KW - Capacitated vehicle routing problem
KW - Decision-making
KW - Deep reinforcement learning
KW - Task scheduling
UR - https://www.scopus.com/pages/publications/105008645000
U2 - 10.1016/j.eswa.2025.128660
DO - 10.1016/j.eswa.2025.128660
M3 - 文章
AN - SCOPUS:105008645000
SN - 0957-4174
VL - 293
JO - Expert Systems with Applications
JF - Expert Systems with Applications
M1 - 128660
ER -