A Zero-Sum Game-Based Hybrid Iteration Reinforcement Learning Scheme to Optimal Control for Fuzzy Singularly Perturbed Systems

被引:0
作者
Dong, Jie [1 ]
Wang, Yun [1 ]
Su, Lei [1 ]
Shen, Hao [1 ]
机构
[1] Anhui Univ Technol, Sch Elect & Informat Engn, Maanshan 243032, Peoples R China
基金
中国国家自然科学基金;
关键词
Reinforcement learning; Hybrid iteration; Takagi-Sugeno fuzzy model; Singularly perturbed systems; Optimal control; ADAPTIVE OPTIMAL-CONTROL; ALGORITHM; DESIGN;
D O I
10.1007/s40815-024-01901-1
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper proposes a model-free hybrid iteration algorithm to solve the zero-sum game for nonlinear singularly perturbed systems based on the Takagi-Sugeno fuzzy model. Firstly, the original nonlinear systems are approximated using fuzzy modeling, while the zero-sum game framework is employed to solve the optimal control problem of singularly perturbed systems. Therefore, the nonlinear optimal control problem is reduced to designing fuzzy control policies by solving game algebraic Riccati equations. Subsequently, a model-free hybrid iteration approach is developed that removes the requirement for system dynamics. Furthermore, the hybrid iteration method does not rely on initial stabilizing control policies and has a faster convergence rate than value iteration. Meanwhile, the convergence of the proposed algorithm is demonstrated through Lyapunov stability analysis. Finally, a numerical example is employed to illustrate the effectiveness of the designed algorithm.
引用
收藏
页数:11
相关论文
共 41 条
[1]   A Robust H∞ Non-PDC Design Scheme for Singularly Perturbed T-S Fuzzy Systems With Immeasurable State Variables [J].
Asemani, Mohammad Hassan ;
Majd, Vahid Johari .
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2015, 23 (03) :525-541
[2]   A Residual Gradient Fuzzy Reinforcement Learning Algorithm for Differential Games [J].
Awheda, Mostafa D. ;
Schwartz, Howard M. .
INTERNATIONAL JOURNAL OF FUZZY SYSTEMS, 2017, 19 (04) :1058-1076
[3]   Control design with guaranteed cost for synchronization in networks of linear singularly perturbed systems [J].
Ben Rejeb, Jihene ;
Morarescu, Irinel-Constantin ;
Daafouz, Jamal .
AUTOMATICA, 2018, 91 :89-97
[4]   Value iteration and adaptive dynamic programming for data-driven adaptive optimal control design [J].
Bian, Tao ;
Jiang, Zhong-Ping .
AUTOMATICA, 2016, 71 :348-360
[5]   A NEW ANALYSIS OF ITERATIVE REFINEMENT AND ITS APPLICATION TO ACCURATE SOLUTION OF ILL-CONDITIONED SPARSE LINEAR SYSTEMS [J].
Carson, Erin ;
Higham, Nicholas J. .
SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2017, 39 (06) :A2834-A2856
[6]   Multi-constrained Fuzzy Control for Perturbed T-S Fuzzy Singular Systems by Proportional-Plus-Derivative State Feedback Method [J].
Chang, Wen-Jer ;
Lian, Kuang-Yow ;
Su, Che-Lun ;
Tsai, Ming-Hsuan .
INTERNATIONAL JOURNAL OF FUZZY SYSTEMS, 2021, 23 (07) :1972-1985
[7]   A Fuzzy Curiosity-Driven Mechanism for Multi-Agent Reinforcement Learning [J].
Chen, Wenbai ;
Shi, Haobin ;
Li, Jingchen ;
Hwang, Kao-Shing .
INTERNATIONAL JOURNAL OF FUZZY SYSTEMS, 2021, 23 (05) :1222-1233
[8]   Multi-rate sampled-data composite control of linear singularly perturbed systems [J].
Chen, Wu-Hua ;
He, Haohua ;
Lu, Xiaomei .
JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2020, 357 (04) :2028-2048
[9]   Output-Feedback Control for Fuzzy Singularly Perturbed Systems: A Nonhomogeneous Stochastic Communication Protocol Approach [J].
Cheng, Jun ;
Yan, Huaicheng ;
Park, Ju H. ;
Zong, Guangdeng .
IEEE TRANSACTIONS ON CYBERNETICS, 2023, 53 (01) :76-87
[10]   Nonlinear Two-Time-Scale Stochastic Approximation: Convergence and Finite-Time Performance [J].
Doan, Thinh T. T. .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2023, 68 (08) :4695-4705