Reinforcement learning and adaptive optimization of a class of Markov jump systems with completely unknown dynamic information

被引:0
|
作者
Shuping He
Maoguang Zhang
Haiyang Fang
Fei Liu
Xiaoli Luan
Zhengtao Ding
机构
[1] Anhui University,School of Electrical Engineering and Automation
[2] Anhui University,Institute of Physical Science and Information Technology
[3] Jiangnan University,Key Laboratory of Advanced Process Control for Light Industry (Ministry of Education), Institute of Automation
[4] The University of Manchester,School of Electrical and Electronic Engineering
来源
Neural Computing and Applications | 2020年 / 32卷
关键词
Markov jump linear systems (MJLSs); Adaptive optimal control; Online; Reinforcement learning (RL); Coupled algebraic Riccati equations (AREs);
D O I
暂无
中图分类号
学科分类号
摘要
In this paper, an online adaptive optimal control problem of a class of continuous-time Markov jump linear systems (MJLSs) is investigated by using a parallel reinforcement learning (RL) algorithm with completely unknown dynamics. Before collecting and learning the subsystems information of states and inputs, the exploration noise is firstly added to describe the actual control input. Then, a novel parallel RL algorithm is used to parallelly compute the corresponding N coupled algebraic Riccati equations by online learning. By this algorithm, we will not need to know the dynamic information of the MJLSs. The convergence of the proposed algorithm is also proved. Finally, the effectiveness and applicability of this novel algorithm is illustrated by two simulation examples.
引用
收藏
页码:14311 / 14320
页数:9
相关论文
共 50 条
  • [1] Reinforcement learning and adaptive optimization of a class of Markov jump systems with completely unknown dynamic information
    He, Shuping
    Zhang, Maoguang
    Fang, Haiyang
    Liu, Fei
    Luan, Xiaoli
    Ding, Zhengtao
    NEURAL COMPUTING & APPLICATIONS, 2020, 32 (18) : 14311 - 14320
  • [2] Reinforcement learning-based optimal control for Markov jump systems with completely unknown dynamics
    Shi, Xiongtao
    Li, Yanjie
    Du, Chenglong
    Chen, Chaoyang
    Zong, Guangdeng
    Gui, Weihua
    AUTOMATICA, 2025, 171
  • [3] Adaptive optimization algorithm for nonlinear Markov jump systems with partial unknown dynamics
    Fang, Haiyang
    Zhu, Guozheng
    Stojanovic, Vladimir
    Nie, Rong
    He, Shuping
    Luan, Xiaoli
    Liu, Fei
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2021, 31 (06) : 2126 - 2140
  • [4] Fuzzy-Based Adaptive Optimization of Unknown Discrete-Time Nonlinear Markov Jump Systems With Off-Policy Reinforcement Learning
    Fang, Haiyang
    Tu, Yidong
    Wang, Hai
    He, Shuping
    Liu, Fei
    Ding, Zhengtao
    Cheng, Shing Shin
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2022, 30 (12) : 5276 - 5290
  • [5] Reinforcement Learning-Based Near Optimization for Continuous-Time Markov Jump Singularly Perturbed Systems
    Wang, Jing
    Peng, Chuanjun
    Park, Ju H.
    Shen, Hao
    Shi, Kaibo
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2023, 70 (06) : 2026 - 2030
  • [6] Event-Triggered Reinforcement Learning-Based Adaptive Tracking Control for Completely Unknown Continuous-Time Nonlinear Systems
    Guo, Xinxin
    Yan, Weisheng
    Cui, Rongxin
    IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (07) : 3231 - 3242
  • [7] H∞ Tracking Control of Completely Unknown Continuous-Time Systems via Off-Policy Reinforcement Learning
    Modares, Hamidreza
    Lewis, Frank L.
    Jiang, Zhong-Ping
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 26 (10) : 2550 - 2562
  • [8] Reinforcement Learning-Based Adaptive Optimal Exponential Tracking Control of Linear Systems With Unknown Dynamics
    Chen, Ci
    Modares, Hamidreza
    Xie, Kan
    Lewis, Frank L.
    Wan, Yan
    Xie, Shengli
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2019, 64 (11) : 4423 - 4438
  • [9] Optimal control for continuous-time Markov jump singularly perturbed systems : A hybrid reinforcement learning scheme
    Huang, Yaling
    Li, Wenqian
    Wang, Yun
    Shen, Hao
    JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2024, 361 (07):
  • [10] Data-Driven Adaptive LQR for Completely Unknown LTI Systems
    Jha, Sumit Kumar
    Roy, Sayan Basu
    Bhasin, Shubhendu
    IFAC PAPERSONLINE, 2017, 50 (01): : 4156 - 4161