Data-driven policy iteration algorithm for optimal control of continuous-time Ito stochastic systems with Markovian jumps

被引:33
作者
Song, Jun [1 ]
He, Shuping [2 ]
Liu, Fei [3 ]
Niu, Yugang [1 ]
Ding, Zhengtao [4 ]
机构
[1] East China Univ Sci & Technol, Key Lab Adv Control & Optimizat Chem Proc, Minist Educ, Shanghai 200237, Peoples R China
[2] Anhui Univ, Sch Elect Engn & Automat, Hefei 230601, Peoples R China
[3] Jiangnan Univ, Inst Automat, Key Lab Adv Proc Control Light Ind, Minist Educ, Wuxi 214122, Peoples R China
[4] Univ Manchester, Sch Elect & Elect Engn, Control Syst Ctr, Sackville St Bldg, Manchester M13 9PL, Lancs, England
关键词
stochastic systems; continuous time systems; iterative methods; Markov processes; convergence of numerical methods; Riccati equations; transforms; optimal control; ST-based data-driven policy iteration algorithm; infinite horizon optimal control problem; continuous-time Ito stochastic systems; Markovian jumps; multiplicative noises; stochastic coupled algebraic Riccatic equation; stochastic CARE; offline iteration algorithm; implicit iterative algorithm; subsystems transformation technique; parallel Kleinman iterative equations; SLIDING MODE CONTROL; OPTIMAL TRACKING CONTROL; ADAPTIVE OPTIMAL-CONTROL; H-INFINITY CONTROL; LINEAR-SYSTEMS; NONLINEAR-SYSTEMS; NEURAL-NETWORKS; TRANSFORMATION; STABILITY;
D O I
10.1049/iet-cta.2015.0973
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This studies the infinite horizon optimal control problem for a class of continuous-time systems subjected to multiplicative noises and Markovian jumps by using a data-driven policy iteration algorithm. The optimal control problem is equivalent to solve a stochastic coupled algebraic Riccatic equation (CARE). An off-line iteration algorithm is first established to converge the solutions of the stochastic CARE, which is generalised from an implicit iterative algorithm. By applying subsystems transformation (ST) technique, the off-line iterative algorithm is decoupled into N parallel Kleinman's iterative equations. To learn the solution of the stochastic CARE from N decomposed linear subsystems data, an ST-based data-driven policy iteration algorithm is proposed and the convergence is proved. Finally, a numerical example is given to illustrate the effectiveness and applicability of the proposed two iterative algorithms.
引用
收藏
页码:1431 / 1439
页数:9
相关论文
共 49 条
[1]   Discrete-time nonlinear HJB solution using approximate dynamic programming: Convergence proof [J].
Al-Tamimi, Asma ;
Lewis, Frank L. ;
Abu-Khalaf, Murad .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2008, 38 (04) :943-949
[2]   A novel actor-critic-identifier architecture for approximate optimal control of uncertain nonlinear systems [J].
Bhasin, S. ;
Kamalapurkar, R. ;
Johnson, M. ;
Vamvoudakis, K. G. ;
Lewis, F. L. ;
Dixon, W. E. .
AUTOMATICA, 2013, 49 (01) :82-92
[3]   PARALLEL COMPUTATION OF THE SOLUTIONS OF COUPLED ALGEBRAIC LYAPUNOV EQUATIONS [J].
BORNO, I .
AUTOMATICA, 1995, 31 (09) :1345-1347
[4]   PARALLEL ALGORITHM FOR SOLVING COUPLED ALGEBRAIC LYAPUNOV EQUATIONS OF DISCRETE-TIME JUMP LINEAR-SYSTEMS [J].
BORNO, I ;
GAJIC, Z .
COMPUTERS & MATHEMATICS WITH APPLICATIONS, 1995, 30 (07) :1-4
[5]   Sliding mode control for stochastic Markovian jumping systems with incomplete transition rate [J].
Chen, Bei ;
Niu, Yugang ;
Zou, Yuanyuan .
IET CONTROL THEORY AND APPLICATIONS, 2013, 7 (10) :1330-1338
[6]   Adaptive sliding mode control for stochastic Markovian jumping systems with actuator degradation [J].
Chen, Bei ;
Niu, Yugang ;
Zou, Yuanyuan .
AUTOMATICA, 2013, 49 (06) :1748-1754
[7]   Exponential H∞ filtering for stochastic Markovian jump systems with time delays [J].
Chen, Yun ;
Zheng, Wei Xing .
INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2014, 24 (04) :625-643
[8]   Stochastic state estimation for neural networks with distributed delays and Markovian jump [J].
Chen, Yun ;
Zheng, Wei Xing .
NEURAL NETWORKS, 2012, 25 :14-20
[9]   The linear quadratic optimization problems for a class of linear stochastic systems with multiplicative white noise and Markovian jumping [J].
Dragan, V ;
Morozan, T .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2004, 49 (05) :665-675
[10]   LYAPUNOV ITERATIONS FOR OPTIMAL-CONTROL OF JUMP LINEAR-SYSTEMS AT STEADY-STATE [J].
GAJIC, Z ;
BORNO, I .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1995, 40 (11) :1971-1975