Data-Based Self-Learning Optimal Control for Continuous-Time Unknown Nonlinear Systems With Disturbance

被引:0
作者
Wei, Qinglai [1 ]
Liu, Derong [2 ]
Song, Ruizhuo [2 ]
Yan, Pengfei [1 ]
机构
[1] Chinese Acad Sci, Inst Automat, State Key Lab Management & Control Complex Syst, Beijing 100190, Peoples R China
[2] Univ Sci & Technol Beijing, Sch Automat & Elect Engn, Beijing 100083, Peoples R China
来源
PROCEEDINGS OF THE 28TH CHINESE CONTROL AND DECISION CONFERENCE (2016 CCDC) | 2016年
关键词
Adaptive critic designs; Adaptive dynamic programming; Approximate dynamic programming; Neuro-dynamic programming; Recurrent neural network; Data-based control; Optimal control;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, a new data-based self-learning control scheme is developed to solve infinite horizon optimal control problems for continuous-time nonlinear systems. The developed optimal control scheme can be implement without knowing the mathematical model of the system. According to the input-output data of the nonlinear systems, a recurrent neural network (RNN) is employed to reconstruct the dynamics of the nonlinear system. According to the RNN model of the system, a new two-person zero-sum adaptive dynamic programming (ADP) algorithm is developed to obtain the optimal control, where the reconstruction error and the system disturbance are considered the control input of the system. Single-layer neural networks are used to construct the critic and action networks, which are presented to approximate the performance index function and the control law, respectively. Finally, simulation results will show the effectiveness of the developed data-based ADP methods.
引用
收藏
页码:6633 / 6638
页数:6
相关论文
共 22 条
[1]  
[Anonymous], 1999, Neural network control of robot manipulators and nonlinear systems
[2]   Internal structure of coalitions in competitive and altruistic graphical coalitional games [J].
Aurangzeb, Muhammad ;
Lewis, Frank L. .
AUTOMATICA, 2014, 50 (02) :335-348
[3]  
Basar T., 1999, Dynamic Noncooperative Game Theory, Vsecond
[4]  
Beard R. W., 1995, THESIS
[5]  
Bernhard P., 1995, H-optimal control and related minimax design problems, V2nd
[6]   An Equivalence Between Adaptive Dynamic Programming With a Critic and Backpropagation Through Time [J].
Fairbank, Michael ;
Alonso, Eduardo ;
Prokhorov, Danil .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2013, 24 (12) :2088-2100
[7]   Finite-Approximation-Error-Based Optimal Control Approach for Discrete-Time Nonlinear Systems [J].
Liu, Derong ;
Wei, Qinglai .
IEEE TRANSACTIONS ON CYBERNETICS, 2013, 43 (02) :779-789
[8]   A self-learning call admission control scheme for CDMA cellular networks [J].
Liu, DR ;
Zhang, Y ;
Zhang, HG .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2005, 16 (05) :1219-1228
[9]   Exponential H∞ Synchronization and State Estimation for Chaotic Systems via a Unified Model [J].
Liu, Meiqin ;
Zhang, Senlin ;
Fan, Zhen ;
Zheng, Shiyou ;
Sheng, Weihua .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2013, 24 (07) :1114-1126
[10]   Optimal tracking control of nonlinear partially-unknown constrained-input systems using integral reinforcement learning [J].
Modares, Hamidreza ;
Lewis, Frank L. .
AUTOMATICA, 2014, 50 (07) :1780-1792