Discrete-time adaptive dynamic programming using wavelet basis function neural networks

被引:17
作者
Jin, Ning [1 ]
Liu, Derong [1 ]
Huang, Ting [1 ]
Pang, Zhongyu [1 ]
机构
[1] Univ Illinois, Dept Elect & Comp Engn, Chicago, IL 60607 USA
来源
2007 IEEE INTERNATIONAL SYMPOSIUM ON APPROXIMATE DYNAMIC PROGRAMMING AND REINFORCEMENT LEARNING | 2007年
关键词
D O I
10.1109/ADPRL.2007.368180
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Dynamic programming for discrete time systems is difficult due to the "curse of dimensionality": one has to find a series of control actions that must be taken in sequence, hoping that this sequence will lead to the optimal performance cost, but the total cost of those actions will be unknown until the end of that sequence. In this paper, we present our work on adaptive dynamic programming (ADP) for nonlinear discrete time system using neural networks. The neural network we adopted here is the wavelet basis function (WBF) neural network. We will exam the performance of an ADP algorithm using WBF neural networks. The comparison shows that when WBF neural networks are employed, the ADP algorithm gives faster training speed than when PBF neural networks are employed.
引用
收藏
页码:135 / +
页数:2
相关论文
共 34 条
[1]  
[Anonymous], 1990, NEURAL NETWORKS CONT
[2]   Adaptive-critic-based neural networks for aircraft optimal control [J].
Balakrishnan, SN ;
Biega, V .
JOURNAL OF GUIDANCE CONTROL AND DYNAMICS, 1996, 19 (04) :893-898
[3]  
BARTO AG, 1983, IEEE T SYS MAN CYBER, V3, P455
[4]  
Bellman R., 1957, DYNAMIC PROGRAMMING
[5]  
Bertsekas D., 1996, NEURO DYNAMIC PROGRA, V1st
[6]  
Cox C, 1999, INT J ROBUST NONLIN, V9, P1071, DOI 10.1002/(SICI)1099-1239(19991215)9:14<1071::AID-RNC453>3.0.CO
[7]  
2-W
[8]   A neighboring optimal adaptive critic for missile guidance [J].
Dalton, J ;
Balakrishnan, SN .
MATHEMATICAL AND COMPUTER MODELLING, 1996, 23 (1-2) :175-188
[9]  
Daubechies I., 1992, Society for Industrial and Applied Mathematics, DOI DOI 10.1137/1.9781611970104
[10]  
Dreyfus S.E., 1977, The Art and Theory of Dynamic Programming