跳到主要导航 跳到搜索 跳到主要内容

Federated multi-objective reinforcement learning

  • Xi'an Jiaotong University

科研成果: 期刊稿件文章同行评审

7 引用 (Scopus)

摘要

Multi-objective reinforcement Learning (MORL) has significant potential for solving complex decision problems with conflicting objectives. Desiring sufficient training samples, it is promising to achieve federated MORL in large-scale distributed settings. However, itstill suffers from poor efficiency and high privacy risks. To mitigate the inefficiency issue, we first propose a novel probablistic algorithm PMORL that can seek an optimal policy via the expectation maximization (EM) algorithm with high efficiency. To extend PMORL to distributed settings with privacy protection, we then present the first federated MORL algorithm Fed-PMORL with client-level differential privacy (DP). In Fed-PMORL, personalized actors are trained and maintained at local clients whereas critics are aggregated and sanitized at the central server. Extensive experimental results in benchmark MORL environments demonstrate that Fed-PMORL under DP guarantees can achieve superior performance with high efficiency. In particular, compared with the state-of-the-art methods, PMORL and Fed-PMORL can save up to 50% training episodes for achieving the same model utility. With a sufficient number of clients (e.g., 1000 clients), Fed-PMORL with a formal DP guarantee shows utility comparable to that of the non-private algorithm.

源语言英语
页(从-至)811-832
页数22
期刊Information Sciences
624
DOI
出版状态已出版 - 5月 2023

学术指纹

探究 'Federated multi-objective reinforcement learning' 的科研主题。它们共同构成独一无二的指纹。

引用此