Reinforcement learning and adaptive optimization of a class of Markov jump systems with completely unknown dynamic information

被引:0
|
作者
Shuping He
Maoguang Zhang
Haiyang Fang
Fei Liu
Xiaoli Luan
Zhengtao Ding
机构
[1] Anhui University,School of Electrical Engineering and Automation
[2] Anhui University,Institute of Physical Science and Information Technology
[3] Jiangnan University,Key Laboratory of Advanced Process Control for Light Industry (Ministry of Education), Institute of Automation
[4] The University of Manchester,School of Electrical and Electronic Engineering
来源
Neural Computing and Applications | 2020年 / 32卷
关键词
Markov jump linear systems (MJLSs); Adaptive optimal control; Online; Reinforcement learning (RL); Coupled algebraic Riccati equations (AREs);
D O I
暂无
中图分类号
学科分类号
摘要
In this paper, an online adaptive optimal control problem of a class of continuous-time Markov jump linear systems (MJLSs) is investigated by using a parallel reinforcement learning (RL) algorithm with completely unknown dynamics. Before collecting and learning the subsystems information of states and inputs, the exploration noise is firstly added to describe the actual control input. Then, a novel parallel RL algorithm is used to parallelly compute the corresponding N coupled algebraic Riccati equations by online learning. By this algorithm, we will not need to know the dynamic information of the MJLSs. The convergence of the proposed algorithm is also proved. Finally, the effectiveness and applicability of this novel algorithm is illustrated by two simulation examples.
引用
收藏
页码:14311 / 14320
页数:9
相关论文
共 50 条
  • [41] Robust Output Regulation and Reinforcement Learning-Based Output Tracking Design for Unknown Linear Discrete-Time Systems
    Chen, Ci
    Xie, Lihua
    Jiang, Yi
    Xie, Kan
    Xie, Shengli
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2023, 68 (04) : 2391 - 2398
  • [42] Optimal Control of Stochastic Markovian Jump Systems With Wiener and Poisson Noises: Two Reinforcement Learning Approaches
    Yan, Zhiguo
    Sun, Tingkun
    Hu, Guolin
    IEEE Transactions on Artificial Intelligence, 2024, 5 (12): : 6591 - 6600
  • [43] A train trajectory optimization method based on the safety reinforcement learning with a relaxed dynamic reward
    Cheng, Ligang
    Cao, Jie
    Yang, Xiaofeng
    Wang, Wenxian
    Zhou, Zijian
    DISCOVER APPLIED SCIENCES, 2024, 6 (09)
  • [44] A Dynamic Resource Allocation Strategy with Reinforcement Learning for Multimodal Multi-objective Optimization
    Qian-Long Dang
    Wei Xu
    Yang-Fei Yuan
    Machine Intelligence Research, 2022, 19 : 138 - 152
  • [45] A Dynamic Resource Allocation Strategy with Reinforcement Learning for Multimodal Multi-objective Optimization
    Dang, Qian-Long
    Xu, Wei
    Yuan, Yang-Fei
    MACHINE INTELLIGENCE RESEARCH, 2022, 19 (02) : 138 - 152
  • [46] Adaptive Neural Network Optimized Control Using Reinforcement Learning of Critic-Actor Architecture for a Class of Non-Affine Nonlinear Systems
    Yang, Xue
    Li, Bin
    Wen, Guoxing
    IEEE ACCESS, 2021, 9 : 141758 - 141765
  • [47] H∞ Control for Interconnected Systems With Unknown System Dynamics: A Two-Stage Reinforcement Learning Method
    Liu, Jinxu
    Shen, Hao
    Wang, Jing
    Cao, Jinde
    Rutkowski, Leszek
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2025, 22 : 6388 - 6397
  • [48] Event-Triggered Optimal Neuro-Controller Design With Reinforcement Learning for Unknown Nonlinear Systems
    Yang, Xiong
    He, Haibo
    Liu, Derong
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2019, 49 (09): : 1866 - 1878
  • [49] Fuzzy Adaptive Tracking of Constrained Nonlinear Systems With Event-Sampling Reinforcement Learning
    Zhu, Hao-Yang
    Li, Yuan-Xin
    Tong, Shaocheng
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2024, 32 (02) : 536 - 546
  • [50] Reinforcement learning for adaptive optimal control of continuous-time linear periodic systems
    Pang, Bo
    Jiang, Zhong-Ping
    Mareels, Iven
    AUTOMATICA, 2020, 118