Neural network-based adaptive critic designs for self-learning control

被引:0
作者
Liu, DR [1 ]
机构
[1] Univ Illinois, Dept Elect & Comp Engn, Chicago, IL 60607 USA
来源
ICONIP'02: PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON NEURAL INFORMATION PROCESSING: COMPUTATIONAL INTELLIGENCE FOR THE E-AGE | 2002年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We consider the implementation of adaptive critic designs using neural networks. The present scheme is within general framework of approximate dynamic programming where optimal/suboptimal control is achieved through learning using multilayer feedforward neural networks. We will develop a class of adaptive critic designs that can be classified as (model-free) action-dependent heuristic dynamic programming (ADHDP). We believe that the present ADHDP is equivalent to the conventional model-based HDP since the model network in the latter can be viewed as completely embedded in the critic network.
引用
收藏
页码:1252 / 1256
页数:5
相关论文
共 29 条
[1]  
Anderson C. W., 1989, IEEE Control Systems Magazine, V9, P31, DOI 10.1109/37.24809
[2]   Adaptive-critic-based neural networks for aircraft optimal control [J].
Balakrishnan, SN ;
Biega, V .
JOURNAL OF GUIDANCE CONTROL AND DYNAMICS, 1996, 19 (04) :893-898
[3]   NEURONLIKE ADAPTIVE ELEMENTS THAT CAN SOLVE DIFFICULT LEARNING CONTROL-PROBLEMS [J].
BARTO, AG ;
SUTTON, RS ;
ANDERSON, CW .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1983, 13 (05) :834-846
[4]   DYNAMIC PROGRAMMING [J].
BELLMAN, R .
SCIENCE, 1966, 153 (3731) :34-&
[5]  
Bertsekas D.P., 2005, DYNAMIC PROGRAMMING, V1
[6]  
Bertsekas D. P., 1996, Neuro Dynamic Programming, V1st
[7]  
Cox C, 1999, INT J ROBUST NONLIN, V9, P1071, DOI 10.1002/(SICI)1099-1239(19991215)9:14<1071::AID-RNC453>3.0.CO
[8]  
2-W
[9]  
Cybenko G., 1989, Mathematics of Control, Signals, and Systems, V2, P303, DOI 10.1007/BF02551274
[10]   A neighboring optimal adaptive critic for missile guidance [J].
Dalton, J ;
Balakrishnan, SN .
MATHEMATICAL AND COMPUTER MODELLING, 1996, 23 (1-2) :175-188