Trajectory planning of mobile robot: A Lyapunov-based reinforcement learning approach with implicit policy

  • Jialun Lai
  • , Zongze Wu
  • , Zhigang Ren
  • , Qi Tan
  • , Shengli Xie

Research output: Contribution to journalArticlepeer-review

1 Scopus citations

Abstract

Trajectory planning for mobile robots is a crucial aspect of achieving intelligence in many industrial applications. Learning-based approaches are extremely useful for problems involving complex and difficult-to-define rule designs. However, these approaches frequently require a large amount of training data and lack convergence or interpretability. This work proposes a reinforcement learning paradigm that combines implicit policy with Lyapunov theory to solve the problem of mobile robot trajectory planning. Firstly, we develop a weighted asymmetric Lyapunov reward function and provide an analytical solution with modest dynamics as the implicit policy. Then, we propose event-triggered multi-objective policy optimization, an approach that dynamically adjusts optimization objectives based on event-triggered conditions, which organically fuse it into the modified soft Actor-Critic algorithm, thus shrinking the exploration space and enabling iterative improvement of RL policy. We demonstrate that in disturbed and random scenarios, the proposed fusion policy can achieve specialized policy learning and that its convergence, efficiency, and generalization are verifiable. This clearly demonstrates that our approach can be utilized as a foundational paradigm for the design of reinforcement learning reward and motion control in trajectory planning using an end-to-end approach, which has significant advantages in terms of convergence speed and interpretability.

Original languageEnglish
Article number113870
JournalKnowledge-Based Systems
Volume325
DOIs
StatePublished - 5 Sep 2025
Externally publishedYes

Keywords

  • Dynamical system movement
  • Intelligent control
  • Lyapunov theory
  • Mobile robots
  • Reinforcement learning

Fingerprint

Dive into the research topics of 'Trajectory planning of mobile robot: A Lyapunov-based reinforcement learning approach with implicit policy'. Together they form a unique fingerprint.

Cite this