Fuzzy Control Based on Reinforcement Learning and Subsystem Error Derivatives for Strict-Feedback Systems With an Observer

被引:37
作者
Li, Dongdong [1 ,2 ,3 ]
Dong, Jiuxiang [1 ,2 ,3 ]
机构
[1] Northeastern Univ, Coll Informat Sci & Engn, Shenyang 110819, Peoples R China
[2] Northeastern Univ, Key Lab Vibrat & Control Aeroprop Syst, Minist Educ China, Shenyang 110819, Peoples R China
[3] Northeastern Univ, Key Lab Synthet Automation Proc Ind, Shenyang 110819, Peoples R China
基金
中国国家自然科学基金;
关键词
Adaptive dynamic programming (ADP); fuzzy adaptive control; fuzzy logic systems (FLSs); fuzzy state observer; optimized backstepping control (OBC); reinforcement learning (RL); TRACKING CONTROL;
D O I
10.1109/TFUZZ.2022.3227993
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this article, a novel optimized fuzzy adaptive control method based on tracking error derivatives of subsystems is proposed for strict-feedback systems with unmeasurable states. A cost function based on the tracking error derivative is used. It not only solves the problem that the traditional input quadratic cost function at the infinite time is unbounded, but also solves the problem that the optimal control input derived from the cost function with exponential discount factor cannot make the error asymptotically stable. Considering the case where the states are unmeasurable, a fuzzy state observer is designed that removes the restriction of the Hurwitz equation for the gain parameters. Based on reinforcement learning, the observer, and error derivative cost function, an improved optimized backstepping control method is given. Using observed information and actor-critic structure to train fuzzy logic systems online, the control inputs are obtained to achieve approximate optimal control. Finally, all closed-loop signals are proved to be bounded by the Lyapunov method, and the effectiveness and advantages of the proposed algorithm are verified through two examples.
引用
收藏
页码:2509 / 2521
页数:13
相关论文
共 44 条
  • [21] Event-Triggered Optimal Control With Performance Guarantees Using Adaptive Dynamic Programming
    Luo, Biao
    Yang, Yin
    Liu, Derong
    Wu, Huai-Ning
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (01) : 76 - 88
  • [22] H∞ Tracking Control of Completely Unknown Continuous-Time Systems via Off-Policy Reinforcement Learning
    Modares, Hamidreza
    Lewis, Frank L.
    Jiang, Zhong-Ping
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 26 (10) : 2550 - 2562
  • [23] Optimal tracking control of nonlinear partially-unknown constrained-input systems using integral reinforcement learning
    Modares, Hamidreza
    Lewis, Frank L.
    [J]. AUTOMATICA, 2014, 50 (07) : 1780 - 1792
  • [24] Optimal Tracking Control of Nonlinear Multiagent Systems Using Internal Reinforce Q-Learning
    Peng, Zhinan
    Luo, Rui
    Hu, Jiangping
    Shi, Kaibo
    Nguang, Sing Kiong
    Ghosh, Bijoy Kumar
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (08) : 4043 - 4055
  • [25] Adaptive Fuzzy Output-Feedback Control for Nonaffine MIMO Nonlinear Systems With Prescribed Performance
    Shi, Wuxi
    [J]. IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2021, 29 (05) : 1107 - 1120
  • [26] Adaptive Fuzzy Control of Spacecraft Proximity Operations Using Hierarchical Fuzzy Systems
    Sun, Liang
    Huo, Wei
    [J]. IEEE-ASME TRANSACTIONS ON MECHATRONICS, 2016, 21 (03) : 1629 - 1640
  • [27] Observer-Based Adaptive Fuzzy Backstepping Dynamic Surface Control for a Class of MIMO Nonlinear Systems
    Tong, Shao-Cheng
    Li, Yong-Ming
    Feng, Gang
    Li, Tie-Shan
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2011, 41 (04): : 1124 - 1135
  • [28] Observer-Based Adaptive Fuzzy Tracking Control for Strict-Feedback Nonlinear Systems With Unknown Control Gain Functions
    Tong, Shaocheng
    Min, Xiao
    Li, Yuanxin
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (09) : 3903 - 3913
  • [29] An Approximate Neuro-Optimal Solution of Discounted Guaranteed Cost Control Design
    Wang, Ding
    Qiao, Junfei
    Cheng, Long
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (01) : 77 - 86
  • [30] Observer-Based Fuzzy Adaptive Output-Feedback Control of Stochastic Nonlinear Multiple Time-Delay Systems
    Wang, Huanqing
    Liu, Peter Xiaoping
    Shi, Peng
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2017, 47 (09) : 2568 - 2578