Suboptimal reduced control of unknown nonlinear singularly perturbed systems via reinforcement learning

被引:6
|
作者
Liu, Xiaomin [1 ,2 ]
Yang, Chunyu [1 ,2 ]
Zhou, Linna [1 ,2 ]
Fu, Jun [3 ]
Dai, Wei [1 ,2 ]
机构
[1] China Univ Min & Technol, Engn Res Ctr Intelligent Control Underground Spac, Minist Educ, Xuzhou 221116, Jiangsu, Peoples R China
[2] China Univ Min & Technol, Sch Informat & Control Engn, Xuzhou 221116, Jiangsu, Peoples R China
[3] Northeastern Univ, State Key Lab Synthet Automat Proc Ind, Shenyang, Peoples R China
基金
中国国家自然科学基金;
关键词
neural networks; nonlinear singularly perturbed systems; reinforcement learning; suboptimal reduced control; ADAPTIVE OPTIMAL-CONTROL; ITERATION; EQUATION;
D O I
10.1002/rnc.5624
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, a suboptimal reduced control method is proposed for a class of nonlinear singularly perturbed systems (SPSs) with unknown dynamics. By using singular perturbation theory, the original system is reduced to a reduced system, by which a policy iterative method is proposed to solve the corresponding reduced Hamilton-Jacobi-Bellman (HJB) equation with convergence guaranteed. A reinforcement learning (RL) algorithm is proposed to implement the policy iterative method without using any knowledge of the system dynamics. In the RL algorithm, the unmeasurable state of the virtual reduced system is reconstructed by the slow state measurements of the original system, the controller and cost function are approximated by actor-critic neural networks (NNs) and the method of weighted residuals is utilized to update the NN weights. The influence introduced by state reconstruction error and NN function approximation on the convergence, suboptimality of the reduced controller and stability of the closed-loop SPSs are rigorously analyzed. Finally, the effectiveness of our proposed method is illustrated by examples.
引用
收藏
页码:6626 / 6645
页数:20
相关论文
共 50 条
  • [1] Reinforcement learning-based composite suboptimal control for Markov jump singularly perturbed systems with unknown dynamics
    Li, Wenqian
    Jia, Guolong
    Wang, Yun
    Su, Lei
    Shen, Hao
    MATHEMATICAL METHODS IN THE APPLIED SCIENCES, 2024, 47 (14) : 11551 - 11564
  • [2] Adaptive composite suboptimal control for linear singularly perturbed systems with unknown slow dynamics
    Yang, Chunyu
    Zhong, Shanshan
    Liu, Xiaomin
    Dai, Wei
    Zhou, Linna
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2020, 30 (07) : 2625 - 2643
  • [3] SUBOPTIMAL CONTROL OF SINGULARLY PERTURBED SYSTEMS AND PERIODIC OPTIMIZATION
    GAITSGORY, V
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1993, 38 (06) : 888 - 903
  • [4] On Model-Free Reinforcement Learning of Reduced-order Optimal Control for Singularly Perturbed Systems
    Mukherjee, Sayak
    Bai, He
    Chakrabortty, Aranya
    2018 IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2018, : 5288 - 5293
  • [5] On Robust Model-Free Reduced-Dimensional Reinforcement Learning Control for Singularly Perturbed Systems
    Mukherjee, Sayak
    Bai, He
    Chakrabortty, Aranya
    2020 AMERICAN CONTROL CONFERENCE (ACC), 2020, : 3914 - 3919
  • [6] Reinforcement Learning Based Optimal Control of Linear Singularly Perturbed Systems
    Zhao, Jianguo
    Yang, Chunyu
    Gao, Weinan
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2022, 69 (03) : 1362 - 1366
  • [7] Time-suboptimal control design of singularly perturbed systems by reduced order feedback design
    Mikhailov, SA
    Muller, PC
    MATHEMATICS AND COMPUTERS IN SIMULATION, 1998, 46 (5-6) : 593 - 600
  • [8] Time-suboptimal control design of singularly perturbed systems by reduced order feedback design
    University of Wuppertal, Gauss-Str. 20, D-42097 Wuppertal, Germany
    Math Comput Simul, 5-6 (593-600):
  • [9] Composite control of nonlinear singularly perturbed systems via approximate feedback linearization
    Aleksey Kabanov
    Vasiliy Alchakov
    International Journal of Automation and Computing, 2020, 17 : 610 - 620
  • [10] Composite Control of Nonlinear Singularly Perturbed Systems via Approximate Feedback Linearization
    Aleksey Kabanov
    Vasiliy Alchakov
    International Journal of Automation and Computing, 2020, 17 (04) : 610 - 620