Attention Based Large Scale Multi-agent Reinforcement Learning

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

7 Scopus citations

Abstract

Learning in large scale Multi-Agent Reinforcement Learning is fundamentally difficult due to the curse of dimensionality. In homogeneous multi-agent setting, mean field theory provides an effective way of scaling MARL to environments with many agents by abstracting other agents to a virtual mean agent, which assumes the impact of each player on the outcome is equal and infinitesimal. However, in some real scenarios, it is only several neighboring agents that affect the decision-making of an agent, need not all other agents. In addition, different neighboring agents may have different degrees of influence on the decision-making of an agent. In this paper, not restricted to homogeneous setting, we propose Adaptive Mean Field Multi-Agent Reinforcement Learning (AMF-MARL), which is based on the attention mechanism and can be used to deal with many agent scenarios in which there may be different influence relationships among agents. Specifically, we firstly derive the mean field approximation with adaptive weight. Then, we propose the Adaptive Mean Field Q-learning (AMF-Q) approach, and describe how to obtain the adaptive weight. Finally, we conduct experiment to study the learning effectiveness of proposed approach.

Original languageEnglish
Title of host publication2022 IEEE 5th International Conference on Artificial Intelligence and Big Data, ICAIBD 2022
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages112-117
Number of pages6
ISBN (Electronic)9781665499132
DOIs
StatePublished - 2022
Event5th IEEE International Conference on Artificial Intelligence and Big Data, ICAIBD 2022 - Virtual, Chengdu, China
Duration: 27 May 202230 May 2022

Publication series

Name2022 IEEE 5th International Conference on Artificial Intelligence and Big Data, ICAIBD 2022

Conference

Conference5th IEEE International Conference on Artificial Intelligence and Big Data, ICAIBD 2022
Country/TerritoryChina
CityVirtual, Chengdu
Period27/05/2230/05/22

Keywords

  • adaptive mean field approximation
  • large scale
  • multi-agent reinforcement learning

Fingerprint

Dive into the research topics of 'Attention Based Large Scale Multi-agent Reinforcement Learning'. Together they form a unique fingerprint.

Cite this