跳到主要导航 跳到搜索 跳到主要内容

Token-based deep reinforcement learning for Heterogeneous VRP with Service Time Constraints

  • Yujun Wang
  • , Xiaopeng Hong
  • , Yabin Wang
  • , Junzhou Zhao
  • , Guanghui Sun
  • , Baoxing Qin
  • Xi'an Jiaotong University
  • Harbin Institute of Technology
  • Gaussian Robotics

科研成果: 期刊稿件文章同行评审

17 引用 (Scopus)

摘要

Heterogeneous Vehicle Routing aims to construct routes for various vehicles while optimizing an objective with a series of constraints. However, existing deep reinforcement learning-based methods often ignore the service time constraints, which prohibits vehicles from leaving current nodes until the service time is met. This limitation restricts their practical application. To address these concerns, we introduce the Heterogeneous Vehicle Routing Problem with Service Time Constraints (HVRP-STC) and formulate it as a Markov Decision Process with Service Time Constraints. We propose a novel deep reinforcement learning-based model, Token-based Deep Reinforcement Learning (TDRL), to solve this problem. To provide sufficient and timely information for decision making, we design a State Token Coding (STC) mechanism that encodes and updates individual and overall vehicle and node states as tokens of different types. To determine the pairs of vehicles and nodes and generate actions, we propose a Heterogeneous Decoder (HD) with a vehicle-selector and multiple vehicle-specific node-selectors. This decouples the vehicle-node selection tasks and customizes the task of choosing nodes to visit for individual vehicles, better catering to the heterogeneous nature of HVRP-STC. We evaluate the proposed method on four types of datasets with instances of different sizes, large spatial coverage, and varied mathematical model. Our results show that TDRL consistently outperforms state-of-the-art DRL methods. We will release the datasets and the source code of this benchmark with the paper via https://github.com/Vision-Intelligence-and-Robots-Group/ToDRL.

源语言英语
文章编号112173
期刊Knowledge-Based Systems
300
DOI
出版状态已出版 - 27 9月 2024

学术指纹

探究 'Token-based deep reinforcement learning for Heterogeneous VRP with Service Time Constraints' 的科研主题。它们共同构成独一无二的指纹。

引用此