Data-Based Adaptive Critic Designs for Nonlinear Robust Optimal Control With Uncertain Dynamics

被引:177
作者
Wang, Ding [1 ]
Liu, Derong [2 ]
Zhang, Qichao [1 ]
Zhao, Dongbin [1 ]
机构
[1] Chinese Acad Sci, Inst Automat, State Key Lab Management & Control Complex Syst, Beijing 100190, Peoples R China
[2] Univ Sci & Technol Beijing, Sch Automat & Elect Engn, Beijing 100083, Peoples R China
来源
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS | 2016年 / 46卷 / 11期
基金
中国国家自然科学基金;
关键词
Adaptive critic designs; adaptive dynamic programming; intelligent control; neural networks; policy iteration; robust optimal control; system identification; uncertain nonlinear systems; ONLINE OPTIMAL-CONTROL; DISCRETE-TIME-SYSTEMS; POLICY ITERATION; HJB SOLUTION; STABILIZATION; ALGORITHM;
D O I
10.1109/TSMC.2015.2492941
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, the infinite-horizon robust optimal control problem for a class of continuous-time uncertain non-linear systems is investigated by using data-based adaptive critic designs. The neural network identification scheme is combined with the traditional adaptive critic technique, in order to design the nonlinear robust optimal control under uncertain environment. First, the robust optimal controller of the original uncertain system with a specified cost function is established by adding a feedback gain to the optimal controller of the nominal system. Then, a neural network identifier is employed to reconstruct the unknown dynamics of the nominal system with stability analysis. Hence, the data-based adaptive critic designs can be developed to solve the Hamilton-Jacobi-Bellman equation corresponding to the transformed optimal control problem. The uniform ultimate boundedness of the closed-loop system is also proved by using the Lyapunov approach. Finally, two simulation examples are presented to illustrate the effectiveness of the developed control strategy.
引用
收藏
页码:1544 / 1555
页数:12
相关论文
共 50 条
[1]   Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach [J].
Abu-Khalaf, M ;
Lewis, FL .
AUTOMATICA, 2005, 41 (05) :779-791
[2]   Discrete-time nonlinear HJB solution using approximate dynamic programming: Convergence proof [J].
Al-Tamimi, Asma ;
Lewis, Frank L. ;
Abu-Khalaf, Murad .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2008, 38 (04) :943-949
[3]  
[Anonymous], 1992, HDB INTELLIGENT CONT
[4]   Missile defense and interceptor allocation by neuro-dynamic programming [J].
Bertsekas, DP ;
Homer, ML ;
Logan, DA ;
Patek, SD ;
Sandell, NR .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART A-SYSTEMS AND HUMANS, 2000, 30 (01) :42-51
[5]   A novel actor-critic-identifier architecture for approximate optimal control of uncertain nonlinear systems [J].
Bhasin, S. ;
Kamalapurkar, R. ;
Johnson, M. ;
Vamvoudakis, K. G. ;
Lewis, F. L. ;
Dixon, W. E. .
AUTOMATICA, 2013, 49 (01) :82-92
[6]   Online Optimal Control of Affine Nonlinear Discrete-Time Systems With Unknown Internal Dynamics by Using Time-Based Policy Update [J].
Dierks, Travis ;
Jagannathan, Sarangapani .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2012, 23 (07) :1118-1129
[7]   Adaptive Learning and Control for MIMO System Based on Adaptive Dynamic Programming [J].
Fu, Jian ;
He, Haibo ;
Zhou, Xinmin .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2011, 22 (07) :1133-1148
[8]   A new design of robust H2 filters for uncertain systems [J].
Gao, Huijun ;
Meng, Xiangyu ;
Chen, Tongwen .
SYSTEMS & CONTROL LETTERS, 2008, 57 (07) :585-593
[9]   Finite-Horizon Control-Constrained Nonlinear Optimal Control Using Single Network Adaptive Critics [J].
Heydari, Ali ;
Balakrishnan, Sivasubramanya N. .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2013, 24 (01) :145-157
[10]   Robust Nonlinear Control of an Intrinsically Compliant Robotic Gait Training Orthosis [J].
Hussain, Shahid ;
Xie, Sheng Q. ;
Jamwal, Prashant K. .
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2013, 43 (03) :655-665