TY - JOUR
T1 - Cooperative UAV Trajectory Design for Disaster Area Emergency Communications
T2 - A Multiagent PPO Method
AU - Guan, Yue
AU - Zou, Sai
AU - Peng, Haixia
AU - Ni, Wei
AU - Sun, Yanglong
AU - Gao, Hongfeng
N1 - Publisher Copyright:
© 2014 IEEE.
PY - 2024/3/1
Y1 - 2024/3/1
N2 - This article investigates the issue of cooperative real-time trajectory design for multiple unmanned aerial vehicles (UAVs) to support emergency communication in disaster areas. To restore communication links rapidly between mobile users (MUs) and the ground base stations, UAVs equipped with both radio frequency (RF) modules and free space optics (FSO) modules are utilized as relay nodes. Given the challenges of setting up a central controller for the UAVs and the urgency of emergency communication, the trajectory design problem for these UAVs is formulated as a distributed cooperative optimization problem. Based on the enhanced K-mean algorithm and multiagent PPO (MAPPO) algorithm, a cooperative trajectory design method, abbreviated as KMAPPO, is proposed for the UAVs to minimize interaction overhead and optimize deployment efficiency. Compared to the state-of-the-art deep reinforcement learning (DRL) methods, simulations reveal KMAPPO's superior performance. It converges 32% faster, boosts RF allocation efficiency, and augments FSO communication backhaul capacity.
AB - This article investigates the issue of cooperative real-time trajectory design for multiple unmanned aerial vehicles (UAVs) to support emergency communication in disaster areas. To restore communication links rapidly between mobile users (MUs) and the ground base stations, UAVs equipped with both radio frequency (RF) modules and free space optics (FSO) modules are utilized as relay nodes. Given the challenges of setting up a central controller for the UAVs and the urgency of emergency communication, the trajectory design problem for these UAVs is formulated as a distributed cooperative optimization problem. Based on the enhanced K-mean algorithm and multiagent PPO (MAPPO) algorithm, a cooperative trajectory design method, abbreviated as KMAPPO, is proposed for the UAVs to minimize interaction overhead and optimize deployment efficiency. Compared to the state-of-the-art deep reinforcement learning (DRL) methods, simulations reveal KMAPPO's superior performance. It converges 32% faster, boosts RF allocation efficiency, and augments FSO communication backhaul capacity.
KW - Free space optical (FSO)
KW - K-means
KW - multiagent proximal policy optimization (MAPPO)
KW - radio frequency (RF)
KW - trajectory optimization
KW - unmanned aerial vehicle (UAV)
UR - https://www.scopus.com/pages/publications/85173036236
U2 - 10.1109/JIOT.2023.3320796
DO - 10.1109/JIOT.2023.3320796
M3 - 文章
AN - SCOPUS:85173036236
SN - 2327-4662
VL - 11
SP - 8848
EP - 8859
JO - IEEE Internet of Things Journal
JF - IEEE Internet of Things Journal
IS - 5
ER -