Sliding mode heading control for AUV based on continuous hybrid model-free and model-based reinforcement learning

被引:23
作者
Wang, Dianrui [1 ]
Shen, Yue [1 ]
Wan, Junhe [1 ]
Sha, Qixin [1 ]
Li, Guangliang [1 ]
Chen, Guanzhong [1 ]
He, Bo [1 ]
机构
[1] Ocean Univ China, Sch Informat Sci & Engn, Qingdao 266000, Shandong, Peoples R China
基金
中国国家自然科学基金;
关键词
Autonomous underwater vehicle (AUV); Model-based reinforcement learning; Model-free reinforcement learning; Deterministic policy gradient (DPG); Sliding mode control (SMC); NONLINEAR-SYSTEMS; ADAPTIVE-CONTROL; PID CONTROL; DESIGN;
D O I
10.1016/j.apor.2021.102960
中图分类号
P75 [海洋工程];
学科分类号
0814 ; 081505 ; 0824 ; 082401 ;
摘要
For autonomous underwater vehicles (AUVs), control over AUV heading is of key importance to enable highperformance locomotion control. In this study, the heading control is achieved by using the robust sliding mode control (SMC) method. The performance of the controller can be seriously affected by its parameters. However, it is time-consuming and labor-intensive to manually adjust the parameters. Most of the existing methods rely on the accurate AUV model or prior knowledge, which are difficult to obtain. Therefore, this study is concerned with the problem of automatically tuning the SMC parameters through reinforcement learning (RL). First, an AUV dynamic model with and without current influence was successfully established. Second, a continuous hybrid Model-based Model-free (MbMf) RL method based on the deterministic policy gradient was introduced and explained. Then, the framework for tuning the parameters of SMC by the RL method was described. Finally, to demonstrate the robustness and effectiveness of our approach, extensive numerical simulations were conducted on the established AUV model. The results show that our method can automatically tune the SMC parameters. The performance is more effective than SMC with fixed parameters or SMC with a purely model-free learner.
引用
收藏
页数:14
相关论文
共 36 条
  • [1] Proximate time optimal for the heading control of underactuated autonomous underwater vehicle with input nonlinearities
    An, Li
    Li, Ye
    Cao, Jian
    Jiang, Yanqing
    He, Jiayu
    Wu, Haowei
    [J]. APPLIED OCEAN RESEARCH, 2020, 95
  • [2] PID control system analysis, design, and technology
    Ang, KH
    Chong, G
    Li, Y
    [J]. IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2005, 13 (04) : 559 - 576
  • [3] Bansal Somil., 2017, MBMF: Model-Based Priors for Model-Free Reinforcement Learning
  • [4] Indirect Adaptive Control for Higher Order Sliding Mode
    Barth, Alexander
    Reger, Johann
    Moreno, Jaime A.
    [J]. IFAC PAPERSONLINE, 2018, 51 (13): : 591 - 596
  • [5] Bejar E, 2018, IEEE INT SYMP SIGNAL, P202, DOI 10.1109/ISSPIT.2018.8642777
  • [6] Incremental Q-learning strategy for adaptive PID control of mobile robots
    Carlucho, Ignacio
    De Paula, Mariano
    Villar, Sebastian A.
    Acosta, Gerardo G.
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2017, 80 : 183 - 199
  • [7] Fuzzy Categorical Deep Reinforcement Learning of a Defensive Game for an Unmanned Surface Vessel
    Cheng, Yin
    Sun, Zhijian
    Huang, Yuexin
    Zhang, Weidong
    [J]. INTERNATIONAL JOURNAL OF FUZZY SYSTEMS, 2019, 21 (02) : 592 - 606
  • [8] Neural network fuzzy sliding mode control of pneumatic muscle actuators
    Chiang, Chia-Jui
    Chen, Ying-Chen
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2017, 65 : 68 - 86
  • [9] Event-Triggered Adaptive Integral Higher-Order Sliding Mode Control for Load Frequency Problems in Multi-area Power Systems
    Dev, Ark
    Sarkar, Mrinal Kanti
    Asthana, Pankhuri
    Narzary, Daijiry
    [J]. IRANIAN JOURNAL OF SCIENCE AND TECHNOLOGY-TRANSACTIONS OF ELECTRICAL ENGINEERING, 2019, 43 (01) : 137 - 152
  • [10] Erez T., 2017, CONTINUOUS CONTROL D