Reinforcement Learning-Based Near Optimization for Continuous-Time Markov Jump Singularly Perturbed Systems

被引:12
作者
Wang, Jing [1 ,2 ]
Peng, Chuanjun [1 ,2 ]
Park, Ju H. [3 ]
Shen, Hao [1 ,2 ]
Shi, Kaibo [4 ]
机构
[1] Anhui Univ Technol, Anhui Prov Key Lab Special Heavy Load Robot, Maanshan 243032, Peoples R China
[2] Anhui Univ Technol, Sch Elect & Informat Engn, Maanshan 243032, Peoples R China
[3] Yeungnam Univ, Dept Elect Engn, Gyongsan 38541, South Korea
[4] Chengdu Univ, Sch Informat Sci & Engn, Chengdu 610106, Peoples R China
基金
新加坡国家研究基金会; 中国国家自然科学基金;
关键词
Markov processes; Switches; Performance analysis; Optimal control; Integrated circuit modeling; Riccati equations; Reinforcement learning; Fast and slow decomposition technique; markov jump systems; reinforcement learning; singularly perturbed systems; ADAPTIVE OPTIMAL-CONTROL; LINEAR-SYSTEMS; STABILIZATION;
D O I
10.1109/TCSII.2022.3233790
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The design of a suboptimal controller for continuous-time Markov jump singularly perturbed systems with partially unknown dynamics is studied in this brief. With fast and slow decomposition technique, the original Markov jump singularly perturbed systems are decomposed into fast and slow subsystems as a new attempt. On this basis, an offline parallel Kleinman algorithm and an online parallel integral reinforcement learning algorithm are presented to cope with the different subsystems, respectively. Meanwhile, the controllers obtained by the above two algorithms are used to design the suboptimal controllers for original systems. Furthermore, the suboptimality of the proposed controllers is also discussed. Finally, an example of the electric circuit model is shown to illustrate the applicability of the proposed method.
引用
收藏
页码:2026 / 2030
页数:5
相关论文
共 18 条
[1]   PARALLEL ALGORITHMS FOR OPTIMAL-CONTROL OF WEAKLY COUPLED AND SINGULARLY PERTURBED JUMP LINEAR-SYSTEMS [J].
BORNO, I ;
GAJIC, Z .
AUTOMATICA, 1995, 31 (07) :985-988
[2]   DECOMPOSITION OF NEAR-OPTIMUM REGULATORS FOR SYSTEMS WITH SLOW AND FAST MODES [J].
CHOW, JH ;
KOKOTOVIC, PV .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1976, 21 (05) :701-705
[3]   Finite-Time Time-Variant Feedback Stabilization of Logical Control Networks With Markov Jump Disturbances [J].
Ding, Xueying ;
Li, Haitao .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2020, 67 (10) :2079-2083
[4]   Reinforcement learning and adaptive optimization of a class of Markov jump systems with completely unknown dynamic information [J].
He, Shuping ;
Zhang, Maoguang ;
Fang, Haiyang ;
Liu, Fei ;
Luan, Xiaoli ;
Ding, Zhengtao .
NEURAL COMPUTING & APPLICATIONS, 2020, 32 (18) :14311-14320
[5]   Online adaptive optimal control for continuous-time Markov jump linear systems using a novel policy iteration algorithm [J].
He, Shuping ;
Song, Jun ;
Ding, Zhengtao ;
Liu, Fei .
IET CONTROL THEORY AND APPLICATIONS, 2015, 9 (10) :1536-1543
[6]   Computational adaptive optimal control for continuous-time linear systems with completely unknown dynamics [J].
Jiang, Yu ;
Jiang, Zhong-Ping .
AUTOMATICA, 2012, 48 (10) :2699-2704
[7]   Stabilization of singularly perturbed fuzzy systems [J].
Li, THS ;
Lin, KJ .
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2004, 12 (05) :579-595
[8]   Secure Consensus of Multiagent Systems via Impulsive Control Subject to Deception Attacks [J].
Liu, Zhi-Wei ;
Shi, Yu-Lun ;
Yan, Huaicheng ;
Han, Bing-Xin ;
Guan, Zhi-Hong .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2023, 70 (01) :166-170
[9]   Optimal tracking control of nonlinear partially-unknown constrained-input systems using integral reinforcement learning [J].
Modares, Hamidreza ;
Lewis, Frank L. .
AUTOMATICA, 2014, 50 (07) :1780-1792
[10]   Non-Fragile $H_{∞ }$ Synchronization for Markov Jump Singularly Perturbed Coupled Neural Networks Subject to Double-Layer Switching Regulation [J].
Shen, Hao ;
Hu, Xiaohui ;
Wang, Jing ;
Cao, Jinde ;
Qian, Wenhua .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (05) :2682-2692