Suboptimal reduced control of unknown nonlinear singularly perturbed systems via reinforcement learning

被引：6

作者：

Liu, Xiaomin ^{[1
,2
]}

Yang, Chunyu ^{[1
,2
]}

Zhou, Linna ^{[1
,2
]}

Fu, Jun ^{[3
]}

Dai, Wei ^{[1
,2
]}

机构：

[1] China Univ Min & Technol, Engn Res Ctr Intelligent Control Underground Spac, Minist Educ, Xuzhou 221116, Jiangsu, Peoples R China

[2] China Univ Min & Technol, Sch Informat & Control Engn, Xuzhou 221116, Jiangsu, Peoples R China

[3] Northeastern Univ, State Key Lab Synthet Automat Proc Ind, Shenyang, Peoples R China

来源：

INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL | 2021年 / 31卷 / 14期

基金：

中国国家自然科学基金;

关键词：

neural networks; nonlinear singularly perturbed systems; reinforcement learning; suboptimal reduced control; ADAPTIVE OPTIMAL-CONTROL; ITERATION; EQUATION;

D O I：

10.1002/rnc.5624

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this paper, a suboptimal reduced control method is proposed for a class of nonlinear singularly perturbed systems (SPSs) with unknown dynamics. By using singular perturbation theory, the original system is reduced to a reduced system, by which a policy iterative method is proposed to solve the corresponding reduced Hamilton-Jacobi-Bellman (HJB) equation with convergence guaranteed. A reinforcement learning (RL) algorithm is proposed to implement the policy iterative method without using any knowledge of the system dynamics. In the RL algorithm, the unmeasurable state of the virtual reduced system is reconstructed by the slow state measurements of the original system, the controller and cost function are approximated by actor-critic neural networks (NNs) and the method of weighted residuals is utilized to update the NN weights. The influence introduced by state reconstruction error and NN function approximation on the convergence, suboptimality of the reduced controller and stability of the closed-loop SPSs are rigorously analyzed. Finally, the effectiveness of our proposed method is illustrated by examples.

引用

页码：6626 / 6645

页数：20

共 50 条

[1] Reinforcement learning-based composite suboptimal control for Markov jump singularly perturbed systems with unknown dynamics
Li, Wenqian
Jia, Guolong
Wang, Yun
Su, Lei
Shen, Hao
MATHEMATICAL METHODS IN THE APPLIED SCIENCES, 2024, 47 (14) : 11551 - 11564
[2] Adaptive composite suboptimal control for linear singularly perturbed systems with unknown slow dynamics
Yang, Chunyu
Zhong, Shanshan
Liu, Xiaomin
Dai, Wei
Zhou, Linna
INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2020, 30 (07) : 2625 - 2643
[3] SUBOPTIMAL CONTROL OF SINGULARLY PERTURBED SYSTEMS AND PERIODIC OPTIMIZATION
GAITSGORY, V
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1993, 38 (06) : 888 - 903
[4] On Model-Free Reinforcement Learning of Reduced-order Optimal Control for Singularly Perturbed Systems
Mukherjee, Sayak
Bai, He
Chakrabortty, Aranya
2018 IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2018, : 5288 - 5293
[5] On Robust Model-Free Reduced-Dimensional Reinforcement Learning Control for Singularly Perturbed Systems
Mukherjee, Sayak
Bai, He
Chakrabortty, Aranya
2020 AMERICAN CONTROL CONFERENCE (ACC), 2020, : 3914 - 3919
[6] Reinforcement Learning Based Optimal Control of Linear Singularly Perturbed Systems
Zhao, Jianguo
Yang, Chunyu
Gao, Weinan
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2022, 69 (03) : 1362 - 1366
[7] Time-suboptimal control design of singularly perturbed systems by reduced order feedback design
Mikhailov, SA
Muller, PC
MATHEMATICS AND COMPUTERS IN SIMULATION, 1998, 46 (5-6) : 593 - 600
[8] Time-suboptimal control design of singularly perturbed systems by reduced order feedback design
University of Wuppertal, Gauss-Str. 20, D-42097 Wuppertal, Germany
Math Comput Simul, 5-6 (593-600):
[9] Composite control of nonlinear singularly perturbed systems via approximate feedback linearization
Aleksey Kabanov
Vasiliy Alchakov
International Journal of Automation and Computing, 2020, 17 : 610 - 620
[10] Composite Control of Nonlinear Singularly Perturbed Systems via Approximate Feedback Linearization
Aleksey Kabanov
Vasiliy Alchakov
International Journal of Automation and Computing, 2020, 17 (04) : 610 - 620

← 1 2 3 4 5 →