Adaptive composite learning control of a flexible two-link manipulator with unknown spatiotemporally varying disturbance

Research output: Contribution to journalArticlepeer-review

5 Scopus citations

Abstract

This article presents a novel adaptive composite learning (ACL) control strategy combining reinforcement learning and a disturbance observer (DOB) to address vibration issues in a flexible two-link manipulator (FTLM) system affected by unknown spatiotemporally varying disturbances. Based on the assumed mode method, the FTLM system is initially transformed into an ordinary differential equation model, while effectively capturing the elastic deformation and vibration characteristics of the flexible link. A composite learning controller, based on the actor-critic algorithm and DOB, is then developed to achieve trajectory tracking and vibration suppression in the FTLM system. The DOB in the controller compensates for unknown disturbances resulting in reduced system error. It is noting that the proposed optimal control strategy is continuously gathering system experience and evaluating the current policy's effectiveness. The stability and robustness of the closed-loop system incorporating the composite controller are analyzed using Lyapunov's direct method, and the semi-global uniform ultimate boundedness of the tracking and vibration errors are also demonstrated. To validate the effectiveness and superiority of the proposed ACL controller, comparative simulations and experiments are conducted on the Quanser experimental platform.

Original languageEnglish
Pages (from-to)7764-7785
Number of pages22
JournalInternational Journal of Robust and Nonlinear Control
Volume34
Issue number12
DOIs
StatePublished - Aug 2024
Externally publishedYes

Keywords

  • adaptive control
  • disturbance observer
  • flexible manipulator
  • reinforcement learning
  • vibration control

Fingerprint

Dive into the research topics of 'Adaptive composite learning control of a flexible two-link manipulator with unknown spatiotemporally varying disturbance'. Together they form a unique fingerprint.

Cite this