Stochastic optimal control with neural networks and application to a retailer inventory problem

被引:0
|
作者
Huang, Zhongwu [1 ]
Wang, Xiaohua [1 ]
Balakrishnan, S. N. [1 ]
机构
[1] Univ Missouri, Dept Mech & Aerosp Engn, Rolla, MO 65401 USA
来源
2005 44TH IEEE CONFERENCE ON DECISION AND CONTROL & EUROPEAN CONTROL CONFERENCE, VOLS 1-8 | 2005年
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Overwhelming computational requirements of classical dynamic programming algorithms render them inapplicable to most practical stochastic problems. To overcome this problem a neural network based Dynamic Programming (DP) approach is described in this study. The cost function which is critical in a dynamic programming formulation is approximated by a neural network according to some designed weight-update rule based on Temporal Difference (TD) learning. A Lyapunov based theory is developed to guarantee an upper error bound between the output of the cost neural network and the true cost. We illustrate this approach through a retailer inventory problem.
引用
收藏
页码:4518 / 4523
页数:6
相关论文
共 50 条