Reinforcement learning-based adaptive optimal output feedback control for nonlinear systems with output quantization

被引:0
作者
Jin, Yitong [1 ]
Wang, Fang [1 ]
Lai, Guanyu [2 ]
Zhang, Xueyi [3 ]
机构
[1] Shandong Univ Sci & Technol, Coll Math & Syst Sci, Qingdao 266590, Shandong, Peoples R China
[2] Guangdong Univ Technol, Sch Automat, Guangzhou 510006, Guangdong, Peoples R China
[3] Shandong Univ Sci & Technol, Coll Foreign Languages, Qingdao 266590, Shandong, Peoples R China
基金
中国国家自然科学基金;
关键词
Reinforcement learning; Optimal control; Output quantization; Identifier-critic-actor architecture; Adaptive fuzzy control; TRACKING CONTROL; UNCERTAIN SYSTEMS;
D O I
10.1007/s11071-024-10504-2
中图分类号
TH [机械、仪表工业];
学科分类号
0802 ;
摘要
In this research, a novel adaptive optimal control approach is proposed for nonlinear systems under output quantization. In order to achieve the optimized control, the reinforcement learning algorithm of the identifier-actor-critic architecture is implemented based on fuzzy logic systems. The identifier, critic, and actor are used for estimating unknown dynamics, assessing system performance, and carrying out control actions, respectively. Firstly, the updating laws of critics and actors are derived by using the negative gradient of a simple positive function generated by the partial derivatives of the Hamilton Jacobi Bellman equation. At the same time, the design has the ability to eliminate the persistence excitation that is necessary for the majority of current optimal controls. Secondly, the command filtering technique is employed to avoid direct differentiation of virtual control signals. This is necessary because the virtual control signals become discontinuous and non-differentiable under output quantization. Thirdly, the boundedness of the quantization errors is illustrated in Lemma 3 by establishing the relationships between the quantized signals and the unquantized signals. Based on this lemma, it is ensured that all signals in the closed-loop system are semi-globally uniformly ultimately bounded (SGUUB). Finally, the proposed method's effectiveness is validated through two simulations.
引用
收藏
页码:7029 / 7045
页数:17
相关论文
共 33 条
  • [1] Improving the performance of stabilizing controls for nonlinear systems
    Bear, R
    Saridis, G
    Wen, J
    [J]. IEEE CONTROL SYSTEMS MAGAZINE, 1996, 16 (05): : 27 - 35
  • [2] A novel actor-critic-identifier architecture for approximate optimal control of uncertain nonlinear systems
    Bhasin, S.
    Kamalapurkar, R.
    Johnson, M.
    Vamvoudakis, K. G.
    Lewis, F. L.
    Dixon, W. E.
    [J]. AUTOMATICA, 2013, 49 (01) : 82 - 92
  • [3] Input and Output Quantized Feedback Linear Systems
    Coutinho, Daniel F.
    Fu, Minyue
    de Souza, Carlos E.
    [J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2010, 55 (03) : 761 - 766
  • [4] The weighted logarithmic matrix norm and bounds of the matrix exponential
    Hu, GD
    Liu, MZ
    [J]. LINEAR ALGEBRA AND ITS APPLICATIONS, 2004, 390 : 145 - 154
  • [5] Time-Varying Optimal Formation Control for Second-Order Multiagent Systems Based on Neural Network Observer and Reinforcement Learning
    Lan, Jie
    Liu, Yan-Jun
    Yu, Dengxiu
    Wen, Guoxing
    Tong, Shaocheng
    Liu, Lei
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (03) : 3144 - 3155
  • [6] Fuzzy Adaptive Optimized Leader-Following Formation Control for Second-Order Stochastic Multiagent Systems
    Li, Yongming
    Zhang, Jiaxin
    Tong, Shaocheng
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2022, 18 (09) : 6026 - 6037
  • [7] Supervisory control of uncertain systems with quantized information
    Linh Vu
    Liberzon, Daniel
    [J]. INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING, 2012, 26 (08) : 739 - 756
  • [8] Predefined-time backstepping control for a nonlinear strict-feedback system
    Liu, Bojun
    Hou, Mingshan
    Wu, Cihang
    Wang, Wencong
    Wu, Zhonghua
    Huang, Bing
    [J]. INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2021, 31 (08) : 3354 - 3372
  • [9] Dynamic quantization of uncertain linear networked control systems
    Liu, Kun
    Fridman, Emilia
    Johansson, Karl Henrik
    [J]. AUTOMATICA, 2015, 59 : 248 - 255
  • [10] A sector bound approach to feedback control of nonlinear systems with state quantization
    Liu, Tengfei
    Jiang, Zhong-Ping
    Hill, David J.
    [J]. AUTOMATICA, 2012, 48 (01) : 145 - 152