Data-driven policy iteration algorithm for optimal control of continuous-time Ito stochastic systems with Markovian jumps

被引:32
作者
Song, Jun [1 ]
He, Shuping [2 ]
Liu, Fei [3 ]
Niu, Yugang [1 ]
Ding, Zhengtao [4 ]
机构
[1] East China Univ Sci & Technol, Key Lab Adv Control & Optimizat Chem Proc, Minist Educ, Shanghai 200237, Peoples R China
[2] Anhui Univ, Sch Elect Engn & Automat, Hefei 230601, Peoples R China
[3] Jiangnan Univ, Inst Automat, Key Lab Adv Proc Control Light Ind, Minist Educ, Wuxi 214122, Peoples R China
[4] Univ Manchester, Sch Elect & Elect Engn, Control Syst Ctr, Sackville St Bldg, Manchester M13 9PL, Lancs, England
关键词
stochastic systems; continuous time systems; iterative methods; Markov processes; convergence of numerical methods; Riccati equations; transforms; optimal control; ST-based data-driven policy iteration algorithm; infinite horizon optimal control problem; continuous-time Ito stochastic systems; Markovian jumps; multiplicative noises; stochastic coupled algebraic Riccatic equation; stochastic CARE; offline iteration algorithm; implicit iterative algorithm; subsystems transformation technique; parallel Kleinman iterative equations; SLIDING MODE CONTROL; OPTIMAL TRACKING CONTROL; ADAPTIVE OPTIMAL-CONTROL; H-INFINITY CONTROL; LINEAR-SYSTEMS; NONLINEAR-SYSTEMS; NEURAL-NETWORKS; TRANSFORMATION; STABILITY;
D O I
10.1049/iet-cta.2015.0973
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This studies the infinite horizon optimal control problem for a class of continuous-time systems subjected to multiplicative noises and Markovian jumps by using a data-driven policy iteration algorithm. The optimal control problem is equivalent to solve a stochastic coupled algebraic Riccatic equation (CARE). An off-line iteration algorithm is first established to converge the solutions of the stochastic CARE, which is generalised from an implicit iterative algorithm. By applying subsystems transformation (ST) technique, the off-line iterative algorithm is decoupled into N parallel Kleinman's iterative equations. To learn the solution of the stochastic CARE from N decomposed linear subsystems data, an ST-based data-driven policy iteration algorithm is proposed and the convergence is proved. Finally, a numerical example is given to illustrate the effectiveness and applicability of the proposed two iterative algorithms.
引用
收藏
页码:1431 / 1439
页数:9
相关论文
共 49 条
  • [1] Discrete-time nonlinear HJB solution using approximate dynamic programming: Convergence proof
    Al-Tamimi, Asma
    Lewis, Frank L.
    Abu-Khalaf, Murad
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2008, 38 (04): : 943 - 949
  • [2] A novel actor-critic-identifier architecture for approximate optimal control of uncertain nonlinear systems
    Bhasin, S.
    Kamalapurkar, R.
    Johnson, M.
    Vamvoudakis, K. G.
    Lewis, F. L.
    Dixon, W. E.
    [J]. AUTOMATICA, 2013, 49 (01) : 82 - 92
  • [3] PARALLEL COMPUTATION OF THE SOLUTIONS OF COUPLED ALGEBRAIC LYAPUNOV EQUATIONS
    BORNO, I
    [J]. AUTOMATICA, 1995, 31 (09) : 1345 - 1347
  • [4] PARALLEL ALGORITHM FOR SOLVING COUPLED ALGEBRAIC LYAPUNOV EQUATIONS OF DISCRETE-TIME JUMP LINEAR-SYSTEMS
    BORNO, I
    GAJIC, Z
    [J]. COMPUTERS & MATHEMATICS WITH APPLICATIONS, 1995, 30 (07) : 1 - 4
  • [5] Sliding mode control for stochastic Markovian jumping systems with incomplete transition rate
    Chen, Bei
    Niu, Yugang
    Zou, Yuanyuan
    [J]. IET CONTROL THEORY AND APPLICATIONS, 2013, 7 (10) : 1330 - 1338
  • [6] Adaptive sliding mode control for stochastic Markovian jumping systems with actuator degradation
    Chen, Bei
    Niu, Yugang
    Zou, Yuanyuan
    [J]. AUTOMATICA, 2013, 49 (06) : 1748 - 1754
  • [7] Exponential H∞ filtering for stochastic Markovian jump systems with time delays
    Chen, Yun
    Zheng, Wei Xing
    [J]. INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2014, 24 (04) : 625 - 643
  • [8] Stochastic state estimation for neural networks with distributed delays and Markovian jump
    Chen, Yun
    Zheng, Wei Xing
    [J]. NEURAL NETWORKS, 2012, 25 : 14 - 20
  • [9] The linear quadratic optimization problems for a class of linear stochastic systems with multiplicative white noise and Markovian jumping
    Dragan, V
    Morozan, T
    [J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2004, 49 (05) : 665 - 675
  • [10] LYAPUNOV ITERATIONS FOR OPTIMAL-CONTROL OF JUMP LINEAR-SYSTEMS AT STEADY-STATE
    GAJIC, Z
    BORNO, I
    [J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1995, 40 (11) : 1971 - 1975