Stochastic optimal control with neural networks and application to a retailer inventory problem

被引:0
作者
Huang, Zhongwu [1 ]
Wang, Xiaohua [1 ]
Balakrishnan, S. N. [1 ]
机构
[1] Univ Missouri, Dept Mech & Aerosp Engn, Rolla, MO 65401 USA
来源
2005 44TH IEEE CONFERENCE ON DECISION AND CONTROL & EUROPEAN CONTROL CONFERENCE, VOLS 1-8 | 2005年
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Overwhelming computational requirements of classical dynamic programming algorithms render them inapplicable to most practical stochastic problems. To overcome this problem a neural network based Dynamic Programming (DP) approach is described in this study. The cost function which is critical in a dynamic programming formulation is approximated by a neural network according to some designed weight-update rule based on Temporal Difference (TD) learning. A Lyapunov based theory is developed to guarantee an upper error bound between the output of the cost neural network and the true cost. We illustrate this approach through a retailer inventory problem.
引用
收藏
页码:4518 / 4523
页数:6
相关论文
共 50 条
[41]   Optimal stationary control of discrete processes and a polynomial time algorithm for stochastic control problem on networks [J].
Lozovanu, Dmitrii ;
Pickl, Stefan .
ICCS 2010 - INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE, PROCEEDINGS, 2010, 1 (01) :1411-1420
[42]   A study on optimal control problem with ελ- error bound for stochastic systems with application to linear quadratic problem [J].
Boukaf S. ;
Hafayed M. ;
Ghebouli M. .
International Journal of Dynamics and Control, 2017, 5 (02) :297-305
[43]   Maximum principle for a stochastic optimal control problem and application to portfolio/consumption choice [J].
Department of Applied Mathematics, Zhejiang University, Hangzhou, China .
J. Optim. Theory Appl., 3 (719-731)
[44]   Maximum Principle for a Stochastic Optimal Control Problem and Application to Portfolio/Consumption Choice [J].
W. S. Xu .
Journal of Optimization Theory and Applications, 1998, 98 :719-731
[45]   Maximum principle for a stochastic optimal control problem and application to portfolio/consumption choice [J].
Xu, WS .
JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 1998, 98 (03) :719-731
[46]   Effects of the control bounds in the stochastic optimal control problem [J].
Crespo, LG ;
Sun, JQ .
PROCEEDINGS OF THE 2002 AMERICAN CONTROL CONFERENCE, VOLS 1-6, 2002, 1-6 :4238-4243
[47]   ON ONE STOCHASTIC OPTIMAL CONTROL PROBLEM WITH CONTROL DELAY [J].
Agayeva, Cherkez A. .
PROCEEDINGS OF THE INSTITUTE OF MATHEMATICS AND MECHANICS, 2005, 22 (30) :171-177
[48]   The application of dynamic programming to optimal inventory control [J].
Berovic, DP ;
Vinter, RB .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2004, 49 (05) :676-685
[49]   Application of optimal control in inventory management of production [J].
Sun, Pei-hong ;
Tang, Lei ;
Tang, Liying .
APPLIED MECHANICS AND MECHANICAL ENGINEERING, PTS 1-3, 2010, 29-32 :2503-+
[50]   Optimal payment time for retailer's inventory system [J].
Liao, HC ;
Chen, YK .
INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2003, 34 (04) :245-253