Decentralized Stabilization for a Class of Continuous-Time Nonlinear Interconnected Systems Using Online Learning Optimal Control Approach

被引:289
作者
Liu, Derong [1 ]
Wang, Ding [1 ]
Li, Hongliang [1 ]
机构
[1] Chinese Acad Sci, Inst Automat, State Key Lab Management & Control Complex Syst, Beijing 100190, Peoples R China
基金
中国国家自然科学基金;
关键词
Adaptive dynamic programming; decentralized control; large-scale systems; neural networks; nonlinear interconnected systems; optimal control; policy iteration; reinforcement learning; LARGE-SCALE SYSTEMS; NEURAL-NETWORK; TRACKING CONTROL; ALGORITHM; IMPLEMENTATION;
D O I
10.1109/TNNLS.2013.2280013
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, using a neural-network-based online learning optimal control approach, a novel decentralized control strategy is developed to stabilize a class of continuous-time nonlinear interconnected large-scale systems. First, optimal controllers of the isolated subsystems are designed with cost functions reflecting the bounds of interconnections. Then, it is proven that the decentralized control strategy of the overall system can be established by adding appropriate feedback gains to the optimal control policies of the isolated subsystems. Next, an online policy iteration algorithm is presented to solve the Hamilton-Jacobi-Bellman equations related to the optimal control problem. Through constructing a set of critic neural networks, the cost functions can be obtained approximately, followed by the control policies. Furthermore, the dynamics of the estimation errors of the critic networks are verified to be uniformly and ultimately bounded. Finally, a simulation example is provided to illustrate the effectiveness of the present decentralized control scheme.
引用
收藏
页码:418 / 428
页数:11
相关论文
共 50 条
[1]   Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach [J].
Abu-Khalaf, M ;
Lewis, FL .
AUTOMATICA, 2005, 41 (05) :779-791
[2]   Discrete-time nonlinear HJB solution using approximate dynamic programming: Convergence proof [J].
Al-Tamimi, Asma ;
Lewis, Frank L. ;
Abu-Khalaf, Murad .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2008, 38 (04) :943-949
[3]  
[Anonymous], 1992, HDB INTELLIGENT CONT
[4]   Decentralized control: An overview [J].
Bakule, Lubomir .
ANNUAL REVIEWS IN CONTROL, 2008, 32 (01) :87-98
[5]   Issues on stability of ADP feedback controllers for dynamical systems [J].
Balakrishnan, S. N. ;
Ding, Jie ;
Lewis, Frank L. .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2008, 38 (04) :913-917
[6]   Galerkin approximations of the generalized Hamilton-Jacobi-Bellman equation [J].
Beard, RW ;
Saridis, GN ;
Wen, JT .
AUTOMATICA, 1997, 33 (12) :2159-2177
[7]   Missile defense and interceptor allocation by neuro-dynamic programming [J].
Bertsekas, DP ;
Homer, ML ;
Logan, DA ;
Patek, SD ;
Sandell, NR .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART A-SYSTEMS AND HUMANS, 2000, 30 (01) :42-51
[8]   A novel actor-critic-identifier architecture for approximate optimal control of uncertain nonlinear systems [J].
Bhasin, S. ;
Kamalapurkar, R. ;
Johnson, M. ;
Vamvoudakis, K. G. ;
Lewis, F. L. ;
Dixon, W. E. .
AUTOMATICA, 2013, 49 (01) :82-92
[9]   A model-free robust policy iteration algorithm for optimal control of nonlinear systems [J].
Bhasin, S. ;
Johnson, M. ;
Dixon, W. E. .
49TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2010, :3060-3065
[10]   Online Optimal Control of Affine Nonlinear Discrete-Time Systems With Unknown Internal Dynamics by Using Time-Based Policy Update [J].
Dierks, Travis ;
Jagannathan, Sarangapani .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2012, 23 (07) :1118-1129