A New Deep Reinforcement Learning Algorithm for UAV Swarm Confrontation Game

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

UAV swarm confrontation game is a type of intelligent game problem. Multi-agent reinforcement learning theory provides an effective solution for this game. However, when using common multi-agent deep reinforcement learning algorithms, such as the multi-agent deep deterministic policy gradient (MADDPG) algorithm, to train the strategy of UAV swarm, there are issues such as slow convergence speed and weak generalization ability on similar tasks. To address these issues, this paper combines the model-agnostic meta-learning (MAML) algorithm in few-shot learning with the original MADDPG algorithm, and proposes an improved MB-MADDPG algorithm, which is applied to the strategy optimization of a UAV swarm confrontation task. Experimental results show that compared with the original algorithm, the improved algorithm can accelerate the convergence while maintaining the training effect, and the success rate of defense after training with both algorithms exceeds 50%.

Original languageEnglish
Title of host publicationData Mining and Big Data - 8th International Conference, DMBD 2023, Proceedings
EditorsYing Tan, Yuhui Shi
PublisherSpringer Science and Business Media Deutschland GmbH
Pages199-210
Number of pages12
ISBN (Print)9789819708369
DOIs
StatePublished - 2024
Event8th International Conference on Data Mining and Big Data, DMBD 2023 - Sanya, China
Duration: 9 Dec 202312 Dec 2023

Publication series

NameCommunications in Computer and Information Science
Volume2017 CCIS
ISSN (Print)1865-0929
ISSN (Electronic)1865-0937

Conference

Conference8th International Conference on Data Mining and Big Data, DMBD 2023
Country/TerritoryChina
CitySanya
Period9/12/2312/12/23

Keywords

  • Few-shot Learning
  • MADDPG
  • MAML
  • Multi-agent Reinforcement Learning
  • UAV Swarm Confrontation

Fingerprint

Dive into the research topics of 'A New Deep Reinforcement Learning Algorithm for UAV Swarm Confrontation Game'. Together they form a unique fingerprint.

Cite this