Fuzzy Control Based on Reinforcement Learning and Subsystem Error Derivatives for Strict-Feedback Systems With an Observer

被引:37
作者
Li, Dongdong [1 ,2 ,3 ]
Dong, Jiuxiang [1 ,2 ,3 ]
机构
[1] Northeastern Univ, Coll Informat Sci & Engn, Shenyang 110819, Peoples R China
[2] Northeastern Univ, Key Lab Vibrat & Control Aeroprop Syst, Minist Educ China, Shenyang 110819, Peoples R China
[3] Northeastern Univ, Key Lab Synthet Automation Proc Ind, Shenyang 110819, Peoples R China
基金
中国国家自然科学基金;
关键词
Adaptive dynamic programming (ADP); fuzzy adaptive control; fuzzy logic systems (FLSs); fuzzy state observer; optimized backstepping control (OBC); reinforcement learning (RL); TRACKING CONTROL;
D O I
10.1109/TFUZZ.2022.3227993
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this article, a novel optimized fuzzy adaptive control method based on tracking error derivatives of subsystems is proposed for strict-feedback systems with unmeasurable states. A cost function based on the tracking error derivative is used. It not only solves the problem that the traditional input quadratic cost function at the infinite time is unbounded, but also solves the problem that the optimal control input derived from the cost function with exponential discount factor cannot make the error asymptotically stable. Considering the case where the states are unmeasurable, a fuzzy state observer is designed that removes the restriction of the Hurwitz equation for the gain parameters. Based on reinforcement learning, the observer, and error derivative cost function, an improved optimized backstepping control method is given. Using observed information and actor-critic structure to train fuzzy logic systems online, the control inputs are obtained to achieve approximate optimal control. Finally, all closed-loop signals are proved to be bounded by the Lyapunov method, and the effectiveness and advantages of the proposed algorithm are verified through two examples.
引用
收藏
页码:2509 / 2521
页数:13
相关论文
共 44 条
  • [1] Optimal Transmission Power Scheduling of Networked Control Systems Via Fuzzy Adaptive Dynamic Programming
    An, Liwei
    Yang, Guang-Hong
    [J]. IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2021, 29 (06) : 1629 - 1639
  • [2] DYNAMIC PROGRAMMING
    BELLMAN, R
    [J]. SCIENCE, 1966, 153 (3731) : 34 - &
  • [3] Global Event-Triggered Output Feedback Stabilization of a Class of Nonlinear Systems
    Cheng, Yingying
    Zhang, Jun
    Du, Haibo
    Wen, Guanghui
    Lin, Xiangze
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2021, 51 (07): : 4040 - 4047
  • [4] Adaptive Neural Network-Based Finite-Time Online Optimal Tracking Control of the Nonlinear System With Dead Zone
    Ding, Liang
    Li, Shu
    Gao, Haibo
    Liu, Yan-Jun
    Huang, Lan
    Deng, Zongquan
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2021, 51 (01) : 382 - 392
  • [5] A survey on analysis and design of model-based fuzzy control systems
    Feng, Gang
    [J]. IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2006, 14 (05) : 676 - 697
  • [6] Approximate Optimal Tracking Control of Nondifferentiable Signals for a Class of Continuous-Time Nonlinear Systems
    Fu, Yue
    Hong, Chengwen
    Fu, Jun
    Chai, Tianyou
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (06) : 4441 - 4450
  • [7] Output-feedback adaptive optimal control of interconnected systems based on robust adaptive dynamic programming
    Gao, Weinan
    Jiang, Yu
    Jiang, Zhong-Ping
    Chai, Tianyou
    [J]. AUTOMATICA, 2016, 72 : 37 - 45
  • [8] Robust Neurooptimal Control for a Robot via Adaptive Dynamic Programming
    Kong, Linghuan
    He, Wei
    Yang, Chenguang
    Sun, Changyin
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (06) : 2584 - 2594
  • [9] Lewis F. L., 2012, OPTIMAL CONTROL, DOI 10.1002/9781118122631
  • [10] A novel adaptive dynamic programming based on tracking error for nonlinear discrete-time systems
    Li, Chun
    Ding, Jinliang
    Lewis, Frank L.
    Chai, Tianyou
    [J]. AUTOMATICA, 2021, 129