Adaptive optimal trajectory tracking control of AUVs based on reinforcement learning

被引:24
作者
Li, Zhifu [1 ]
Wang, Ming [1 ]
Ma, Ge [1 ]
机构
[1] Guangzhou Univ, Sch Mech & Elect Engn, Guangzhou 510006, Peoples R China
基金
中国国家自然科学基金;
关键词
Reinforcement learning (RL); Optimal control; Neural networks (NNs); Autonomous underwater vehicle (AUV); Input saturation; SYSTEMS; VEHICLES;
D O I
10.1016/j.isatra.2022.12.003
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, an adaptive model-free optimal reinforcement learning (RL) neural network (NN) control scheme based on filter error is proposed for the trajectory tracking control problem of an autonomous underwater vehicle (AUV) with input saturation. Generally, the optimal control is realized by solving the Hamilton-Jacobi-Bellman (HJB) equation. However, due to its inherent nonlinearity and complexity, the HJB equation of AUV dynamics is challenging to solve. To deal with this problem, an RL strategy based on an actor-critic framework is proposed to approximate the solution of the HJB equation, where actor and critic NNs are used to perform control behavior and evaluate control performance, respectively. In addition, for the AUV system with the second-order strict-feedback dynamic model, the optimal controller design method based on filtering errors is proposed for the first time to simplify the controller design and accelerate the response speed of the system. Then, to solve the model-dependent problem, an extended state observer (ESO) is designed to estimate the unknown nonlinear dynamics, and an adaptive law is designed to estimate the unknown model parameters. To deal with the input saturation, an auxiliary variable system is utilized in the control law. The strict Lyapunov analysis guarantees that all signals of the system are semi-global uniformly ultimately bounded (SGUUB). Finally, the superiority of the proposed method is verified by comparative experiments.& COPY; 2022 ISA. Published by Elsevier Ltd. All rights reserved.
引用
收藏
页码:122 / 132
页数:11
相关论文
共 40 条
[1]   A novel actor-critic-identifier architecture for approximate optimal control of uncertain nonlinear systems [J].
Bhasin, S. ;
Kamalapurkar, R. ;
Johnson, M. ;
Vamvoudakis, K. G. ;
Lewis, F. L. ;
Dixon, W. E. .
AUTOMATICA, 2013, 49 (01) :82-92
[2]   Adaptive Neural Network Control of AUVs With Control Input Nonlinearities Using Reinforcement Learning [J].
Cui, Rongxin ;
Yang, Chenguang ;
Li, Yang ;
Sharma, Sanjay .
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2017, 47 (06) :1019-1029
[3]   Event-Triggered Adaptive Dynamic Programming for Continuous-Time Systems With Control Constraints [J].
Dong, Lu ;
Zhong, Xiangnan ;
Sun, Changyin ;
He, Haibo .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2017, 28 (08) :1941-1952
[4]   Adaptive neural control of uncertain MIMO nonlinear systems [J].
Ge, SS ;
Wang, C .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2004, 15 (03) :674-692
[5]   Performance assessment of control loops involving unstable systems for set point tracking and disturbance rejection [J].
Ghousiya, Begum K. ;
Seshagiri, Rao A. ;
Radhakrishnan, T. K. .
JOURNAL OF THE TAIWAN INSTITUTE OF CHEMICAL ENGINEERS, 2018, 85 :1-17
[6]   Integral Reinforcement Learning-Based Adaptive NN Control for Continuous-Time Nonlinear MIMO Systems With Unknown Control Directions [J].
Guo, Xinxin ;
Yan, Weisheng ;
Cui, Rongxin .
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2020, 50 (11) :4068-4077
[7]   Adaptive-Critic Design for Decentralized Event-Triggered Control of Constrained Nonlinear Interconnected Systems Within an Identifier-Critic Framework [J].
Huo, Xin ;
Karimi, Hamid Reza ;
Zhao, Xudong ;
Wang, Bohui ;
Zong, Guangdeng .
IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (08) :7478-7491
[8]  
Lewis F., 2012, OPTIMAL CONTROL, V3rd, DOI [10.1002/9781118122631, DOI 10.1002/9781118122631]
[9]   Compensated model-free adaptive tracking control scheme for autonomous underwater vehicles via extended state observer [J].
Li, Xiaohan ;
Ren, Chao ;
Ma, Shugen ;
Zhu, Xinshan .
OCEAN ENGINEERING, 2020, 217
[10]   AUV Based Source Seeking with Estimated Gradients [J].
Li, Zhuo ;
You, Keyou ;
Song, Shiji .
JOURNAL OF SYSTEMS SCIENCE & COMPLEXITY, 2018, 31 (01) :262-275