跳到主要导航 跳到搜索 跳到主要内容

Constrained Intrinsic Motivation for Reinforcement Learning

  • Xiang Zheng
  • , Xingjun Ma
  • , Chao Shen
  • , Cong Wang
  • City University of Hong Kong
  • Fudan University

科研成果: 书/报告/会议事项章节会议稿件同行评审

1 引用 (Scopus)

摘要

This paper investigates two fundamental problems that arise when utilizing Intrinsic Motivation (IM) for reinforcement learning in Reward-Free Pre-Training (RFPT) tasks and Exploration with Intrinsic Motivation (EIM) tasks: 1) how to design an effective intrinsic objective in RFPT tasks, and 2) how to reduce the bias introduced by the intrinsic objective in EIM tasks. Existing IM methods suffer from static skills, limited state coverage, sample inefficiency in RFPT tasks, and suboptimality in EIM tasks. To tackle these problems, we propose Constrained Intrinsic Motivation (CIM) for RFPT and EIM tasks, respectively: 1) CIM for RFPT maximizes the lower bound of the conditional state entropy subject to an alignment constraint on the state encoder network for efficient dynamic and diverse skill discovery and state coverage maximization; 2) CIM for EIM leverages constrained policy optimization to adaptively adjust the coefficient of the intrinsic objective to mitigate the distraction from the intrinsic objective. In various MuJoCo robotics environments, we empirically show that CIM for RFPT greatly surpasses fifteen IM methods for unsupervised skill discovery in terms of skill diversity, state coverage, and fine-tuning performance. Additionally, we showcase the effectiveness of CIM for EIM in redeeming intrinsic rewards when task rewards are exposed from the beginning. Our code is available at https://github.com/x-zheng16/CIM.

源语言英语
主期刊名Proceedings of the 33rd International Joint Conference on Artificial Intelligence, IJCAI 2024
编辑Kate Larson
出版商International Joint Conferences on Artificial Intelligence
5608-5616
页数9
ISBN(电子版)9781956792041
出版状态已出版 - 2024
活动33rd International Joint Conference on Artificial Intelligence, IJCAI 2024 - Jeju, 韩国
期限: 3 8月 20249 8月 2024

出版系列

姓名IJCAI International Joint Conference on Artificial Intelligence
ISSN(印刷版)1045-0823

会议

会议33rd International Joint Conference on Artificial Intelligence, IJCAI 2024
国家/地区韩国
Jeju
时期3/08/249/08/24

学术指纹

探究 'Constrained Intrinsic Motivation for Reinforcement Learning' 的科研主题。它们共同构成独一无二的指纹。

引用此