TY - GEN
T1 - Underwater Multi-agent Cooperative Formation Hunting Based on Deep Reinforcement Learning
AU - Shi, Xiaobo
AU - Liu, Meiqin
AU - Dong, Shanling
AU - Zheng, Ronghao
AU - Wei, Ping
N1 - Publisher Copyright:
© 2024 Technical Committee on Control Theory, Chinese Association of Automation.
PY - 2024
Y1 - 2024
N2 - In addressing the issue of formation hunting and trajectory planning for multi-autonomous underwater vehicles (AUVs) in complex underwater environments, traditional virtual structure algorithms, and leader-follower models exhibit shortcomings in environmental adaptability and vulnerability to single-point failures. To solve this problem, this article establishes a multi-agent reinforcement learning model with continuous state and action spaces, aiming to optimize the success rate and completion time of the formation hunting task. Furthermore, in establishing the simulation environment for underwater multi-AUVs, a reward function module for the formation hunting task is meticulously designed, taking into account various factors including navigation, formation, efficiency, boundary, and collision avoidance. The efficacy of the proposed methodology was substantiated through a comparative analysis involving the artificial potential field method and the proposed deep reinforcement learning algorithm within the simulation environment. Besides, the efficiency of task execution has improved by approximately 10%, with a success rate approaching 100%.
AB - In addressing the issue of formation hunting and trajectory planning for multi-autonomous underwater vehicles (AUVs) in complex underwater environments, traditional virtual structure algorithms, and leader-follower models exhibit shortcomings in environmental adaptability and vulnerability to single-point failures. To solve this problem, this article establishes a multi-agent reinforcement learning model with continuous state and action spaces, aiming to optimize the success rate and completion time of the formation hunting task. Furthermore, in establishing the simulation environment for underwater multi-AUVs, a reward function module for the formation hunting task is meticulously designed, taking into account various factors including navigation, formation, efficiency, boundary, and collision avoidance. The efficacy of the proposed methodology was substantiated through a comparative analysis involving the artificial potential field method and the proposed deep reinforcement learning algorithm within the simulation environment. Besides, the efficiency of task execution has improved by approximately 10%, with a success rate approaching 100%.
KW - Multi-AUVs
KW - Multi-agent reinforcement learning
KW - collision avoidance
KW - formation hunting
KW - trajectory planning
UR - https://www.scopus.com/pages/publications/85205450668
U2 - 10.23919/CCC63176.2024.10662795
DO - 10.23919/CCC63176.2024.10662795
M3 - 会议稿件
AN - SCOPUS:85205450668
T3 - Chinese Control Conference, CCC
SP - 5722
EP - 5727
BT - Proceedings of the 43rd Chinese Control Conference, CCC 2024
A2 - Na, Jing
A2 - Sun, Jian
PB - IEEE Computer Society
T2 - 43rd Chinese Control Conference, CCC 2024
Y2 - 28 July 2024 through 31 July 2024
ER -