A two-stage framework based on RL for Truck-Drone Collaborative Delivery Problem

  • Yuanbo Li
  • , Chengwei Zhang
  • , Wanting Liu
  • , Chao Li
  • , Dou An
  • , Qi Wang

Research output: Contribution to journalArticlepeer-review

Abstract

With the explosive growth of e-commerce, efficient last-mile delivery has emerged as a critical challenge. Truck-drone collaborative delivery has garnered significant attention as a promising solution to this problem. In this work, we formulate the truck-drone collaborative delivery problem as a collaborative optimization problem, aiming to minimize the total completion time for delivering packages to customers by leveraging the complementary strengths of the drone’s speed and the truck’s endurance. We propose a two-stage framework to address this challenge. In the first stage, the Lin-Kernighan Helsgaun (LKH) algorithm is employed to generate a high-quality initial Traveling Salesman Problem (TSP) solution, serving as a robust starting point. In the second stage, a Sequence Allocate Policy (SAPPO), based on Proximal Policy Optimization, refines the TSP solution by optimizing the truck-drone collaborative path using a specially designed action space. Extensive experiments conducted on both random dataset and TSPLIB benchmarks demonstrate that our method significantly outperforms existing algorithms regarding delivery time, while exhibiting improved scalability and less training time.

Original languageEnglish
JournalIEEE Internet of Things Journal
DOIs
StateAccepted/In press - 2025

Keywords

  • Drones
  • Reinforcement learning
  • Traveling salesman problem
  • Vehicle routing

Fingerprint

Dive into the research topics of 'A two-stage framework based on RL for Truck-Drone Collaborative Delivery Problem'. Together they form a unique fingerprint.

Cite this