H2L: High-Performance Multi-agent Path Finding in High-Obstacle-Density and Large-Size Maps

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

Multi-Agent Path Finding (MAPF) is vital for large-scale Multi-Agent Systems (MAS), where agents must plan collision-free paths to reach their goals. While Reinforcement Learning (RL) methods aim to enhance real-time performance and scalability over search-based approaches, their success on complex maps is limited. This is due to the use of independent RL algorithms, which fail to address the non-stationarity of the environment, and inappropriate reward functions that cause the agent’s policy to worsen with greater distance to the goal. To tackle these issues, we propose a MAPF algorithm based on a new variant of the Value Decomposition Network (VDN), a multi-agent RL algorithm, and introduce a novel reward function. This VDN variant trains the network using agents within a specific agent’s field of view, addressing non-stationarity and training challenges in large-scale MAS, unlike naive VDN, which considers all agents. We introduce a novel reward function using potential-based reward shaping, rendering the agent’s policy independent of the map size. Additionally, we enhance the reward to alleviate congestion by preventing agents from stopping next to each other and by penalizing the following conflicts. Experiments show our planner has a notably higher success rate than other RL-based planners and slightly lower than the latest state-of-the-art search-based planner, LaCAM*, on complex maps. For instance, on a 160 × 160 map with 30% obstacle density and 1024 agents, our planner achieves an 88% success rate, while other RL-based planners achieve virtually 0%.

Original languageEnglish
Title of host publicationArtificial Intelligence and Robotics - 9th International Symposium, ISAIR 2024, Revised Selected Papers
EditorsHuimin Lu
PublisherSpringer Science and Business Media Deutschland GmbH
Pages124-137
Number of pages14
ISBN (Print)9789819629107
DOIs
StatePublished - 2025
Event9th International Symposium on Artificial Intelligence and Robotics, ISAIR 2024 - Guilin, China
Duration: 27 Sep 202430 Sep 2024

Publication series

NameCommunications in Computer and Information Science
Volume2402 CCIS
ISSN (Print)1865-0929
ISSN (Electronic)1865-0937

Conference

Conference9th International Symposium on Artificial Intelligence and Robotics, ISAIR 2024
Country/TerritoryChina
CityGuilin
Period27/09/2430/09/24

Keywords

  • Coordination and cooperation
  • Multi-agent deep reinforcement learning
  • Multi-agent path finding
  • Multi-agent path planning

Fingerprint

Dive into the research topics of 'H2L: High-Performance Multi-agent Path Finding in High-Obstacle-Density and Large-Size Maps'. Together they form a unique fingerprint.

Cite this