Game-based distributed optimal formation tracking control of underactuated AUVs based on reinforcement learning

被引:13
作者
Wang, Zhengkun [1 ]
Zhang, Lijun [1 ,2 ]
Zhu, Zeyu [2 ]
机构
[1] Harbin Engn Univ, Coll Intelligent Syst Sci & Engn, Harbin 150001, Peoples R China
[2] Northwestern Polytech Univ, Sch Marine & Technol, Xian 710072, Peoples R China
关键词
Distributed optimal; Game; Backstepping; Reinforcement learning; Single critic neural network; UNDERWATER VEHICLES;
D O I
10.1016/j.oceaneng.2023.115879
中图分类号
U6 [水路运输]; P75 [海洋工程];
学科分类号
0814 ; 081505 ; 0824 ; 082401 ;
摘要
This paper investigates game-based distributed optimal time-varying formation tracking control problem based on backstepping technique and reinforcement learning (RL) for autonomous underwater vehicles (AUVs) with time-varying disturbances and input saturation. The proposed method enables the multi-AUV to track a desired trajectory while holding the time-varying formation. To accomplish this, we adopt a game between leader and followers to enhance robustness of the system to time-varying disturbances. The approximate optimal solution of the established Hamilton-Jacobi-Isaacs (HJI) equation is obtained by RL which performs online via the single critic Neural Network (SCNN). Hence, the optimal solution corresponds to the saddle point equilibrium of the tracking game. In addition, the command filter is used to construct the unknown hydrodynamic parameter and time-varying disturbances. Furthermore, we introduce a quasi-norm to confront input saturation. The effectiveness of the proposed approach is verified by the simulation results provided.
引用
收藏
页数:13
相关论文
共 29 条
[1]   Energy optimal depth control for long range underwater vehicles with applications to a hybrid underwater glider [J].
Claus, Brian ;
Bachmayer, Ralf .
AUTONOMOUS ROBOTS, 2016, 40 (07) :1307-1320
[2]  
Fossen T. I., 2011, HDB MARINE CRAFT HYD, DOI DOI 10.1002/9781119994138
[3]   Spatial Path-Following Control of Underactuated AUV With Multiple Uncertainties and Input Saturation [J].
Guo, Chen ;
Han, Yanan ;
Yu, Haomiao ;
Qin, Jinmeng .
IEEE ACCESS, 2019, 7 :98014-98022
[4]   An Architecture for Using Autonomous Underwater Vehicles in Wireless Sensor Networks for Underwater Pipeline Monitoring [J].
Jawhar, Imad ;
Mohamed, Nader ;
Al-Jaroodi, Jameela ;
Zhang, Sheng .
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2019, 15 (03) :1329-1340
[5]   Formation control and obstacle avoidance algorithm of multiple autonomous underwater vehicles(AUVs) based on potential function and behavior rules [J].
Jia, Qiuling ;
Li, Guangwen .
2007 IEEE INTERNATIONAL CONFERENCE ON AUTOMATION AND LOGISTICS, VOLS 1-6, 2007, :569-573
[6]  
Lewis FL, 2012, OPTIMAL CONTROL, 3RD EDITION, P438
[7]   Optimal Tracking Control Based on Integral Reinforcement Learning for An Underactuated Drone [J].
Li, Shaobao ;
Durdevic, Petar ;
Yang, Zhenyu .
IFAC PAPERSONLINE, 2019, 52 (08) :55-60
[8]   An Adaptive SOM Neural Network Method for Distributed Formation Control of a Group of AUVs [J].
Li, Xin ;
Zhu, Daqi .
IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2018, 65 (10) :8260-8270
[9]   Cooperative source seeking via networked multi-vehicle systems [J].
Li, Zhuo ;
You, Keyou ;
Song, Shiji .
AUTOMATICA, 2020, 115
[10]   Online Synchronous Approximate Optimal Learning Algorithm for Multiplayer Nonzero-Sum Games With Unknown Dynamics [J].
Liu, Derong ;
Li, Hongliang ;
Wang, Ding .
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2014, 44 (08) :1015-1027