A Multitask-Based Transfer Framework for Cooperative Multi-Agent Reinforcement Learning

  • Cheng Hu
  • , Chenxu Wang
  • , Weijun Luo
  • , Chaowen Yang
  • , Liuyu Xiang
  • , Zhaofeng He

Research output: Contribution to journalArticlepeer-review

3 Scopus citations

Abstract

Multi-agent reinforcement learning (MARL) has proven to be effective and promising in team collaboration tasks. Knowledge transfer in MARL has also received increasing attention. Compared to knowledge transfer in single-agent tasks, knowledge transfer in multi-agent tasks is more complex due to the need to account for coordination among agents. However, existing knowledge transfer-based methods only focus on strategies or agent-level knowledge in a single task, and the transfer of knowledge in such a specific task to new and different types of tasks is likely to fail. In this paper, we propose a multitask-based training framework termed MTT in cooperative MARL, which aims to learn shared collaborative knowledge across multiple tasks simultaneously and then apply it to solve other related tasks. However, models obtained from multitask learning may fail on other tasks because the gradients from different tasks may conflict with each other. To obtain a model with shared knowledge, we provide conflict-free updates by ensuring a positive dot product between the final update and the gradient of each specific task. It also maintains a consistent optimization rate for all tasks. Experiments conducted in two popular environments, StarCraft II Multi-Agent Challenge and Google Research Football, demonstrate that our method outperforms the baselines, significantly improving the efficiency of team collaboration.

Original languageEnglish
Article number2216
JournalApplied Sciences (Switzerland)
Volume15
Issue number4
DOIs
StatePublished - Feb 2025
Externally publishedYes

Keywords

  • cooperative multi-agent reinforcement learning
  • multitask learning
  • transfer learning

Fingerprint

Dive into the research topics of 'A Multitask-Based Transfer Framework for Cooperative Multi-Agent Reinforcement Learning'. Together they form a unique fingerprint.

Cite this