Reinforcement Learning-Based Constrained Optimal Control of Strict-feedback Nonlinear Systems: Application to Autonomous Underwater Vehicles

被引:0
|
作者
Farzanegan, Behzad [1 ]
Jagannathan, S. [1 ]
机构
[1] Missouri Univ Sci & Technol, Dept Elec & Comp Engn, Rolla, MO 65409 USA
来源
2024 IEEE CONFERENCE ON CONTROL TECHNOLOGY AND APPLICATIONS, CCTA 2024 | 2024年
关键词
Autonomous vehicles; Lifelong learning; Optimal control; Control barrier function; Reinforcement learning;
D O I
10.1109/CCTA60707.2024.10666630
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper addresses a constrained neural network (NN)-based optimal tracking scheme for a class of uncertain nonlinear discrete-time systems in strict-feedback form by using a control barrier function (CBF). First, a modified barriertype cost function is introduced for each subsystem, guiding the actual system trajectory toward the safe set or desired trajectory while avoiding unwanted sets. To address the tracking problem, an augmented system is employed to convert the time-varying optimal tracking to a time-invariant optimal regulation. Then, an actor-critic framework is employed with the backstepping technique to obtain both virtual and actual optimal control policies for each subsystem to avoid the noncausality problem. Additionally, a novel online regularizer method is introduced to reduce catastrophic forgetting in multitasking scenarios by maintaining the significance of weight connections in the critic NN without directly computing the Fisher information matrix (FIM). Further, to guarantee safety during online learning, the actor update law incorporates the safety condition through the utilization of the CBF. Simulation results using underwater vehicles are carried out to verify the effectiveness of the proposed approach.
引用
收藏
页码:651 / 656
页数:6
相关论文
共 50 条
  • [1] Learning-Based Adaptive Optimal Tracking Control of Strict-Feedback Nonlinear Systems
    Gao, Weinan
    Jiang, Zhong-Ping
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (06) : 2614 - 2624
  • [2] Reinforcement learning-based adaptive predefined-time optimal tracking control for strict-feedback nonlinear systems
    Chen, Yilin
    Pan, Yingnan
    Lu, Qing
    INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING, 2024, 38 (02) : 492 - 512
  • [3] Reinforcement learning-based optimized backstepping control for strict-feedback nonlinear systems subject to external disturbances
    Qin, Yan
    Cao, Liang
    Lu, Qing
    Pan, Yingnan
    OPTIMAL CONTROL APPLICATIONS & METHODS, 2023, 44 (05) : 2724 - 2743
  • [4] Reinforcement learning-based optimized output feedback control of nonlinear strict-feedback systems with event sampled states
    Xin, Chun
    Li, Yuan-Xin
    INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING, 2023, 37 (01) : 38 - 58
  • [5] Adaptive reinforcement learning optimal tracking control for strict-feedback nonlinear systems with prescribed performance
    Huang, Zongsheng
    Bai, Weiwei
    Li, Tieshan
    Long, Yue
    Chen, C. L. Philip
    Liang, Hongjing
    Yang, Hanqing
    INFORMATION SCIENCES, 2023, 621 : 407 - 423
  • [6] Distributed Fuzzy Optimal Consensus Control of State-Constrained Nonlinear Strict-Feedback Systems
    Wang, Wei
    Li, Yongming
    IEEE TRANSACTIONS ON CYBERNETICS, 2023, 53 (05) : 2914 - 2929
  • [7] Reinforcement learning-based optimal control of uncertain nonlinear systems
    Garcia, Miguel
    Dong, Wenjie
    INTERNATIONAL JOURNAL OF CONTROL, 2024, 97 (12) : 2839 - 2850
  • [8] Reinforcement learning-based adaptive optimal output feedback control for nonlinear systems with output quantization
    Jin, Yitong
    Wang, Fang
    Lai, Guanyu
    Zhang, Xueyi
    NONLINEAR DYNAMICS, 2024, : 7029 - 7045
  • [9] Reinforcement Learning-Based Formation Control of Autonomous Underwater Vehicles with Model Interferences
    Cao, Wenqiang
    Yan, Jing
    Yang, Xian
    Luo, Xiaoyuan
    2021 PROCEEDINGS OF THE 40TH CHINESE CONTROL CONFERENCE (CCC), 2021, : 4020 - 4025
  • [10] Lifelong reinforcement learning tracking control of nonlinear strict-feedback systems using multilayer neural networks with constraints
    Ganie, Irfan
    Jagannathan, S.
    NEUROCOMPUTING, 2024, 600