摘要
In this paper, our main goal is to solve optimal control problem by using reinforcement learning (RL) algorithm for marine surface vessel system with known dynamic. And this algorithm is an optimal control algorithm based on policy iteration (PI), and it can obtain the suitable approximations of cost function and the optimized control policy. There are two neural networks (NNs), where critic NN aims to estimate the cost-to-go and actor NN is utilized to design suitable input controller and minimize the tracking error. A novel tuning method is given for critic NN and actor NN. The stability and convergence are proven by Lyapunov's direct method. Finally, the numerical simulations are conducted to demonstrate the feasibility and superiority of presented algorithm.
| 源语言 | 英语 |
|---|---|
| 主期刊名 | Proceedings of the 37th Chinese Control Conference, CCC 2018 |
| 编辑 | Xin Chen, Qianchuan Zhao |
| 出版商 | IEEE Computer Society |
| 页 | 2735-2740 |
| 页数 | 6 |
| ISBN(电子版) | 9789881563941 |
| DOI | |
| 出版状态 | 已出版 - 5 10月 2018 |
| 已对外发布 | 是 |
| 活动 | 37th Chinese Control Conference, CCC 2018 - Wuhan, 中国 期限: 25 7月 2018 → 27 7月 2018 |
出版系列
| 姓名 | Chinese Control Conference, CCC |
|---|---|
| 卷 | 2018-July |
| ISSN(印刷版) | 1934-1768 |
| ISSN(电子版) | 2161-2927 |
会议
| 会议 | 37th Chinese Control Conference, CCC 2018 |
|---|---|
| 国家/地区 | 中国 |
| 市 | Wuhan |
| 时期 | 25/07/18 → 27/07/18 |
联合国可持续发展目标
此成果有助于实现下列可持续发展目标:
-
可持续发展目标 14 水下生物
学术指纹
探究 'Adaptive control of a marine vessel based on reinforcement learning' 的科研主题。它们共同构成独一无二的指纹。引用此
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver