Two-player zero-sum game based neural critic tracking control for UAUV with unknown disturbance via backstepping method

被引：3

作者：

Che, Gaofeng ^{[1
,2
,3
]}

机构：

[1] Henan Normal Univ, Coll Comp & Informat Engn, Xinxiang 453007, Peoples R China

[2] Key Lab Artificial Intelligence & Personalized Lea, Xinxiang 453007, Peoples R China

[3] Henan Normal Univ, Coll Math & Informat Sci, Xinxiang 453007, Peoples R China

来源：

OCEAN ENGINEERING | 2023年 / 287卷

关键词：

Adaptive dynamic programming (ADP); Zero-sum game; Tracking control; Underactuated autonomous underwater vehicle (UAUV); Unknown disturbance; ADP;

D O I：

10.1016/j.oceaneng.2023.115878

中图分类号：

U6 [水路运输]; P75 [海洋工程];

学科分类号：

0814 ; 081505 ; 0824 ; 082401 ;

摘要：

In this work, a new tracking control scheme is developed for underactuated autonomous underwater vehicle(UAUV) with unknown disturbance. First, an error tracking system is constructed using backstepping method, which transforms the tracking control problem into the two-player zero-sum game problem. One player is the optimal controller which is designed to make the UAUV track the desired trajectory. The other player is the unknown disturbance that represent the overall damping effect of the UAUV. Then, the single critic network based online policy iteration algorithm is designed to get the optimal control law and the worst-case disturbance, which can achieve the near-optimal control performance and relax the requirements for initial admissible control conditions. In order to improve the converge velocity of tracking error, the weight update law are designed. Furthermore, a discount coefficient is introduced into the performance index due to the nonlinearity and the complexity of UAUV. In addition, the stability of the UAUV is analyzed based on the Lyapunov stability theory the system is guaranteed to be uniformly ultimately bounded (UUB). Finally, the effectiveness of the proposed method is demonstrated through the real-time simulations on the UAUV model.

引用

页数：14

共 27 条

[1] Neurodynamic programming and zero-sum games for constrained control systems [J].

Abu-Khalaf, Murad ;

Lewis, Frank L. ;

Huang, Jie .

IEEE TRANSACTIONS ON NEURAL NETWORKS, 2008, 19 (07) :1243-1252

[2]

Che Gaofeng, 2023, Journal of Ambient Intelligence and Humanized Computing, P7265, DOI [10.1007/s12652-022-04435-2, 10.1007/s12652-022-04435-2]

[3] ADP based output-feedback fault-tolerant tracking control for underactuated AUV with actuators faults [J].

Che, Gaofeng ;

Yu, Zhen .

JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 45 (04) :5871-5883

[4] Single critic network based fault-tolerant tracking control for underactuated AUV with actuator fault [J].

Che, Gaofeng .

OCEAN ENGINEERING, 2022, 254

[5] Neural-network estimators based fault-tolerant tracking control for AUV via ADP with rudders faults and ocean current disturbance [J].

Che, Gaofeng ;

Yu, Zhen .

NEUROCOMPUTING, 2020, 411 :442-454

[6] Zero-sum game-based neuro-optimal control of modular robot manipulators with uncertain disturbance using critic only policy iteration [J].

Dong, Bo ;

An, Tianjiao ;

Zhu, Xinye ;

Li, Yuanchun ;

Liu, Keping .

NEUROCOMPUTING, 2021, 450 :183-196

[7] Zero-Sum Game Based Cooperative Control for Onboard Pulsed Power Load Accommodation [J].

Duan, Jiajun ;

Xu, Hao ;

Liu, Wenxin ;

Peng, Jian-Chun ;

Jiang, Hui .

IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2020, 16 (01) :238-247

[8] Neural adaptive output feedback tracking control of underactuated AUVs [J].

Fang, Kai ;

Fang, Haolin ;

Zhang, Jiawen ;

Yao, Jiaqi ;

Li, Jiawang .

OCEAN ENGINEERING, 2021, 234

[9] Online Solution of Two-Player Zero-Sum Games for Continuous-Time Nonlinear Systems With Completely Unknown Dynamics [J].

Fu, Yue ;

Chai, Tianyou .

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2016, 27 (12) :2577-2587

[10]

Heshmati-Alamdari S., 2021, IEEE Trans. Autom. Sci. Eng., V18, P3524

← 1 2 3 →