跳到主要导航 跳到搜索 跳到主要内容

Trajectory planning of mobile robot: A Lyapunov-based reinforcement learning approach with implicit policy

  • Jialun Lai
  • , Zongze Wu
  • , Zhigang Ren
  • , Qi Tan
  • , Shengli Xie
  • Guangzhou Maritime University
  • Guangdong University of Technology
  • Shenzhen University
  • Guangdong Laboratory of Artificial Intelligence and Digital Economy (SZ)
  • Key Lab of the Ministry of Education for Process Control and Efficiency Egineering

科研成果: 期刊稿件文章同行评审

4 引用 (Scopus)

摘要

Trajectory planning for mobile robots is a crucial aspect of achieving intelligence in many industrial applications. Learning-based approaches are extremely useful for problems involving complex and difficult-to-define rule designs. However, these approaches frequently require a large amount of training data and lack convergence or interpretability. This work proposes a reinforcement learning paradigm that combines implicit policy with Lyapunov theory to solve the problem of mobile robot trajectory planning. Firstly, we develop a weighted asymmetric Lyapunov reward function and provide an analytical solution with modest dynamics as the implicit policy. Then, we propose event-triggered multi-objective policy optimization, an approach that dynamically adjusts optimization objectives based on event-triggered conditions, which organically fuse it into the modified soft Actor-Critic algorithm, thus shrinking the exploration space and enabling iterative improvement of RL policy. We demonstrate that in disturbed and random scenarios, the proposed fusion policy can achieve specialized policy learning and that its convergence, efficiency, and generalization are verifiable. This clearly demonstrates that our approach can be utilized as a foundational paradigm for the design of reinforcement learning reward and motion control in trajectory planning using an end-to-end approach, which has significant advantages in terms of convergence speed and interpretability.

源语言英语
文章编号113870
期刊Knowledge-Based Systems
325
DOI
出版状态已出版 - 5 9月 2025
已对外发布

学术指纹

探究 'Trajectory planning of mobile robot: A Lyapunov-based reinforcement learning approach with implicit policy' 的科研主题。它们共同构成独一无二的指纹。

引用此