Abstract
In this paper, our main goal is to solve optimal control problem by using reinforcement learning (RL) algorithm for marine surface vessel system with known dynamic. And this algorithm is an optimal control algorithm based on policy iteration (PI), and it can obtain the suitable approximations of cost function and the optimized control policy. There are two neural networks (NNs), where critic NN aims to estimate the cost-to-go and actor NN is utilized to design suitable input controller and minimize the tracking error. A novel tuning method is given for critic NN and actor NN. The stability and convergence are proven by Lyapunov's direct method. Finally, the numerical simulations are conducted to demonstrate the feasibility and superiority of presented algorithm.
| Original language | English |
|---|---|
| Title of host publication | Proceedings of the 37th Chinese Control Conference, CCC 2018 |
| Editors | Xin Chen, Qianchuan Zhao |
| Publisher | IEEE Computer Society |
| Pages | 2735-2740 |
| Number of pages | 6 |
| ISBN (Electronic) | 9789881563941 |
| DOIs | |
| State | Published - 5 Oct 2018 |
| Externally published | Yes |
| Event | 37th Chinese Control Conference, CCC 2018 - Wuhan, China Duration: 25 Jul 2018 → 27 Jul 2018 |
Publication series
| Name | Chinese Control Conference, CCC |
|---|---|
| Volume | 2018-July |
| ISSN (Print) | 1934-1768 |
| ISSN (Electronic) | 2161-2927 |
Conference
| Conference | 37th Chinese Control Conference, CCC 2018 |
|---|---|
| Country/Territory | China |
| City | Wuhan |
| Period | 25/07/18 → 27/07/18 |
UN SDGs
This output contributes to the following UN Sustainable Development Goals (SDGs)
-
SDG 14 Life Below Water
Keywords
- Actor Neural Networks
- Critic Neural Networks
- Lyapunov Method
- Marine Vessel
- Reinforcement Learning
Fingerprint
Dive into the research topics of 'Adaptive control of a marine vessel based on reinforcement learning'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver