Stochastic optimal control with neural networks and application to a retailer inventory problem

被引：0

作者：

Huang, Zhongwu ^{[1
]}

Wang, Xiaohua ^{[1
]}

Balakrishnan, S. N. ^{[1
]}

机构：

[1] Univ Missouri, Dept Mech & Aerosp Engn, Rolla, MO 65401 USA

来源：

2005 44TH IEEE CONFERENCE ON DECISION AND CONTROL & EUROPEAN CONTROL CONFERENCE, VOLS 1-8 | 2005年

关键词：

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Overwhelming computational requirements of classical dynamic programming algorithms render them inapplicable to most practical stochastic problems. To overcome this problem a neural network based Dynamic Programming (DP) approach is described in this study. The cost function which is critical in a dynamic programming formulation is approximated by a neural network according to some designed weight-update rule based on Temporal Difference (TD) learning. A Lyapunov based theory is developed to guarantee an upper error bound between the output of the cost neural network and the true cost. We illustrate this approach through a retailer inventory problem.

引用

页码：4518 / 4523

页数：6

共 50 条

[41] Optimal stationary control of discrete processes and a polynomial time algorithm for stochastic control problem on networks [J].

Lozovanu, Dmitrii ;

Pickl, Stefan .

ICCS 2010 - INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE, PROCEEDINGS, 2010, 1 (01) :1411-1420

[42] A study on optimal control problem with ελ- error bound for stochastic systems with application to linear quadratic problem [J].

Boukaf S. ;

Hafayed M. ;

Ghebouli M. .

International Journal of Dynamics and Control, 2017, 5 (02) :297-305

[43] Maximum principle for a stochastic optimal control problem and application to portfolio/consumption choice [J].

Department of Applied Mathematics, Zhejiang University, Hangzhou, China .

J. Optim. Theory Appl., 3 (719-731)

[44] Maximum Principle for a Stochastic Optimal Control Problem and Application to Portfolio/Consumption Choice [J].

W. S. Xu .

Journal of Optimization Theory and Applications, 1998, 98 :719-731

[45] Maximum principle for a stochastic optimal control problem and application to portfolio/consumption choice [J].

Xu, WS .

JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 1998, 98 (03) :719-731

[46] Effects of the control bounds in the stochastic optimal control problem [J].

Crespo, LG ;

Sun, JQ .

PROCEEDINGS OF THE 2002 AMERICAN CONTROL CONFERENCE, VOLS 1-6, 2002, 1-6 :4238-4243

[47] ON ONE STOCHASTIC OPTIMAL CONTROL PROBLEM WITH CONTROL DELAY [J].

Agayeva, Cherkez A. .

PROCEEDINGS OF THE INSTITUTE OF MATHEMATICS AND MECHANICS, 2005, 22 (30) :171-177

[48] The application of dynamic programming to optimal inventory control [J].

Berovic, DP ;

Vinter, RB .

IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2004, 49 (05) :676-685

[49] Application of optimal control in inventory management of production [J].

Sun, Pei-hong ;

Tang, Lei ;

Tang, Liying .

APPLIED MECHANICS AND MECHANICAL ENGINEERING, PTS 1-3, 2010, 29-32 :2503-+

[50] Optimal payment time for retailer's inventory system [J].

Liao, HC ;

Chen, YK .

INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2003, 34 (04) :245-253

← 1 2 3 4 5 →