Data-driven policy iteration algorithm for optimal control of continuous-time Ito stochastic systems with Markovian jumps

被引：33

作者：

Song, Jun ^{[1
]}

He, Shuping ^{[2
]}

Liu, Fei ^{[3
]}

Niu, Yugang ^{[1
]}

Ding, Zhengtao ^{[4
]}

机构：

[1] East China Univ Sci & Technol, Key Lab Adv Control & Optimizat Chem Proc, Minist Educ, Shanghai 200237, Peoples R China

[2] Anhui Univ, Sch Elect Engn & Automat, Hefei 230601, Peoples R China

[3] Jiangnan Univ, Inst Automat, Key Lab Adv Proc Control Light Ind, Minist Educ, Wuxi 214122, Peoples R China

[4] Univ Manchester, Sch Elect & Elect Engn, Control Syst Ctr, Sackville St Bldg, Manchester M13 9PL, Lancs, England

来源：

IET CONTROL THEORY AND APPLICATIONS | 2016年 / 10卷 / 12期

关键词：

stochastic systems; continuous time systems; iterative methods; Markov processes; convergence of numerical methods; Riccati equations; transforms; optimal control; ST-based data-driven policy iteration algorithm; infinite horizon optimal control problem; continuous-time Ito stochastic systems; Markovian jumps; multiplicative noises; stochastic coupled algebraic Riccatic equation; stochastic CARE; offline iteration algorithm; implicit iterative algorithm; subsystems transformation technique; parallel Kleinman iterative equations; SLIDING MODE CONTROL; OPTIMAL TRACKING CONTROL; ADAPTIVE OPTIMAL-CONTROL; H-INFINITY CONTROL; LINEAR-SYSTEMS; NONLINEAR-SYSTEMS; NEURAL-NETWORKS; TRANSFORMATION; STABILITY;

D O I：

10.1049/iet-cta.2015.0973

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This studies the infinite horizon optimal control problem for a class of continuous-time systems subjected to multiplicative noises and Markovian jumps by using a data-driven policy iteration algorithm. The optimal control problem is equivalent to solve a stochastic coupled algebraic Riccatic equation (CARE). An off-line iteration algorithm is first established to converge the solutions of the stochastic CARE, which is generalised from an implicit iterative algorithm. By applying subsystems transformation (ST) technique, the off-line iterative algorithm is decoupled into N parallel Kleinman's iterative equations. To learn the solution of the stochastic CARE from N decomposed linear subsystems data, an ST-based data-driven policy iteration algorithm is proposed and the convergence is proved. Finally, a numerical example is given to illustrate the effectiveness and applicability of the proposed two iterative algorithms.

引用

页码：1431 / 1439

页数：9

共 49 条

[1] Discrete-time nonlinear HJB solution using approximate dynamic programming: Convergence proof [J].

Al-Tamimi, Asma ;

Lewis, Frank L. ;

Abu-Khalaf, Murad .

IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2008, 38 (04) :943-949

[2] A novel actor-critic-identifier architecture for approximate optimal control of uncertain nonlinear systems [J].

Bhasin, S. ;

Kamalapurkar, R. ;

Johnson, M. ;

Vamvoudakis, K. G. ;

Lewis, F. L. ;

Dixon, W. E. .

AUTOMATICA, 2013, 49 (01) :82-92

[3] PARALLEL COMPUTATION OF THE SOLUTIONS OF COUPLED ALGEBRAIC LYAPUNOV EQUATIONS [J].

BORNO, I .

AUTOMATICA, 1995, 31 (09) :1345-1347

[4] PARALLEL ALGORITHM FOR SOLVING COUPLED ALGEBRAIC LYAPUNOV EQUATIONS OF DISCRETE-TIME JUMP LINEAR-SYSTEMS [J].

BORNO, I ;

GAJIC, Z .

COMPUTERS & MATHEMATICS WITH APPLICATIONS, 1995, 30 (07) :1-4

[5] Sliding mode control for stochastic Markovian jumping systems with incomplete transition rate [J].

Chen, Bei ;

Niu, Yugang ;

Zou, Yuanyuan .

IET CONTROL THEORY AND APPLICATIONS, 2013, 7 (10) :1330-1338

[6] Adaptive sliding mode control for stochastic Markovian jumping systems with actuator degradation [J].

Chen, Bei ;

Niu, Yugang ;

Zou, Yuanyuan .

AUTOMATICA, 2013, 49 (06) :1748-1754

[7] Exponential H∞ filtering for stochastic Markovian jump systems with time delays [J].

Chen, Yun ;

Zheng, Wei Xing .

INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2014, 24 (04) :625-643

[8] Stochastic state estimation for neural networks with distributed delays and Markovian jump [J].

Chen, Yun ;

Zheng, Wei Xing .

NEURAL NETWORKS, 2012, 25 :14-20

[9] The linear quadratic optimization problems for a class of linear stochastic systems with multiplicative white noise and Markovian jumping [J].

Dragan, V ;

Morozan, T .

IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2004, 49 (05) :665-675

[10] LYAPUNOV ITERATIONS FOR OPTIMAL-CONTROL OF JUMP LINEAR-SYSTEMS AT STEADY-STATE [J].

GAJIC, Z ;

BORNO, I .

IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1995, 40 (11) :1971-1975

← 1 2 3 4 5 →