Reinforcement learning-based adaptive optimal output feedback control for nonlinear systems with output quantization

被引：0

作者：

Jin, Yitong ^{[1
]}

Wang, Fang ^{[1
]}

Lai, Guanyu ^{[2
]}

Zhang, Xueyi ^{[3
]}

机构：

[1] Shandong Univ Sci & Technol, Coll Math & Syst Sci, Qingdao 266590, Shandong, Peoples R China

[2] Guangdong Univ Technol, Sch Automat, Guangzhou 510006, Guangdong, Peoples R China

[3] Shandong Univ Sci & Technol, Coll Foreign Languages, Qingdao 266590, Shandong, Peoples R China

来源：

NONLINEAR DYNAMICS | 2024年

基金：

中国国家自然科学基金;

关键词：

Reinforcement learning; Optimal control; Output quantization; Identifier-critic-actor architecture; Adaptive fuzzy control; TRACKING CONTROL; UNCERTAIN SYSTEMS;

D O I：

10.1007/s11071-024-10504-2

中图分类号：

TH [机械、仪表工业];

学科分类号：

0802 ;

摘要：

In this research, a novel adaptive optimal control approach is proposed for nonlinear systems under output quantization. In order to achieve the optimized control, the reinforcement learning algorithm of the identifier-actor-critic architecture is implemented based on fuzzy logic systems. The identifier, critic, and actor are used for estimating unknown dynamics, assessing system performance, and carrying out control actions, respectively. Firstly, the updating laws of critics and actors are derived by using the negative gradient of a simple positive function generated by the partial derivatives of the Hamilton Jacobi Bellman equation. At the same time, the design has the ability to eliminate the persistence excitation that is necessary for the majority of current optimal controls. Secondly, the command filtering technique is employed to avoid direct differentiation of virtual control signals. This is necessary because the virtual control signals become discontinuous and non-differentiable under output quantization. Thirdly, the boundedness of the quantization errors is illustrated in Lemma 3 by establishing the relationships between the quantized signals and the unquantized signals. Based on this lemma, it is ensured that all signals in the closed-loop system are semi-globally uniformly ultimately bounded (SGUUB). Finally, the proposed method's effectiveness is validated through two simulations.

引用

页码：7029 / 7045

页数：17

共 33 条

[1] Improving the performance of stabilizing controls for nonlinear systems
Bear, R
Saridis, G
Wen, J
[J]. IEEE CONTROL SYSTEMS MAGAZINE, 1996, 16 (05): : 27 - 35
[2] A novel actor-critic-identifier architecture for approximate optimal control of uncertain nonlinear systems
Bhasin, S.
Kamalapurkar, R.
Johnson, M.
Vamvoudakis, K. G.
Lewis, F. L.
Dixon, W. E.
[J]. AUTOMATICA, 2013, 49 (01) : 82 - 92
[3] Input and Output Quantized Feedback Linear Systems
Coutinho, Daniel F.
Fu, Minyue
de Souza, Carlos E.
[J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2010, 55 (03) : 761 - 766
[4] The weighted logarithmic matrix norm and bounds of the matrix exponential
Hu, GD
Liu, MZ
[J]. LINEAR ALGEBRA AND ITS APPLICATIONS, 2004, 390 : 145 - 154
[5] Time-Varying Optimal Formation Control for Second-Order Multiagent Systems Based on Neural Network Observer and Reinforcement Learning
Lan, Jie
Liu, Yan-Jun
Yu, Dengxiu
Wen, Guoxing
Tong, Shaocheng
Liu, Lei
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (03) : 3144 - 3155
[6] Fuzzy Adaptive Optimized Leader-Following Formation Control for Second-Order Stochastic Multiagent Systems
Li, Yongming
Zhang, Jiaxin
Tong, Shaocheng
[J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2022, 18 (09) : 6026 - 6037
[7] Supervisory control of uncertain systems with quantized information
Linh Vu
Liberzon, Daniel
[J]. INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING, 2012, 26 (08) : 739 - 756
[8] Predefined-time backstepping control for a nonlinear strict-feedback system
Liu, Bojun
Hou, Mingshan
Wu, Cihang
Wang, Wencong
Wu, Zhonghua
Huang, Bing
[J]. INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2021, 31 (08) : 3354 - 3372
[9] Dynamic quantization of uncertain linear networked control systems
Liu, Kun
Fridman, Emilia
Johansson, Karl Henrik
[J]. AUTOMATICA, 2015, 59 : 248 - 255
[10] A sector bound approach to feedback control of nonlinear systems with state quantization
Liu, Tengfei
Jiang, Zhong-Ping
Hill, David J.
[J]. AUTOMATICA, 2012, 48 (01) : 145 - 152

← 1 2 3 4 →