Stochastic optimal control with neural networks and application to a retailer inventory problem

被引:0
作者
Huang, Zhongwu [1 ]
Wang, Xiaohua [1 ]
Balakrishnan, S. N. [1 ]
机构
[1] Univ Missouri, Dept Mech & Aerosp Engn, Rolla, MO 65401 USA
来源
2005 44TH IEEE CONFERENCE ON DECISION AND CONTROL & EUROPEAN CONTROL CONFERENCE, VOLS 1-8 | 2005年
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Overwhelming computational requirements of classical dynamic programming algorithms render them inapplicable to most practical stochastic problems. To overcome this problem a neural network based Dynamic Programming (DP) approach is described in this study. The cost function which is critical in a dynamic programming formulation is approximated by a neural network according to some designed weight-update rule based on Temporal Difference (TD) learning. A Lyapunov based theory is developed to guarantee an upper error bound between the output of the cost neural network and the true cost. We illustrate this approach through a retailer inventory problem.
引用
收藏
页码:4518 / 4523
页数:6
相关论文
共 50 条
  • [21] A PROBLEM OF OPTIMAL CONTROL OF A STOCHASTIC SHEET
    Pepeljaeva, T. V.
    CYBERNETICS AND SYSTEMS ANALYSIS, 2010, 46 (01) : 140 - 144
  • [22] OPTIMAL STOCHASTIC CONTROL PROBLEM FOR A CARBON
    Huang, Wenlin
    Liang, Jin
    Dong, Yuchao
    SIAM JOURNAL ON APPLIED MATHEMATICS, 2023, 83 (03) : 1272 - 1295
  • [23] Optimal Control Problem of Stochastic Systems
    G. K. Vassilina
    Lobachevskii Journal of Mathematics, 2021, 42 : 641 - 648
  • [24] STOCHASTIC TIME OPTIMAL CONTROL PROBLEM
    HAUSSMANN, UG
    ANDERSON, WJ
    BOYARSKY, A
    SIAM REVIEW, 1974, 16 (04) : 581 - 581
  • [25] Optimal Control Problem of Stochastic Systems
    Vassilina, G. K.
    LOBACHEVSKII JOURNAL OF MATHEMATICS, 2021, 42 (03) : 641 - 648
  • [26] Optimal control for stochastic neural oscillators
    Rajabi, Faranak
    Gibou, Frederic
    Moehlis, Jeff
    BIOLOGICAL CYBERNETICS, 2025, 119 (2-3)
  • [27] Solving an Optimal Control Problem of Cancer Treatment by Artificial Neural Networks
    Heydarpour, F.
    Abbasi, E.
    Ebadi, M. J.
    Karbassi, S. M.
    INTERNATIONAL JOURNAL OF INTERACTIVE MULTIMEDIA AND ARTIFICIAL INTELLIGENCE, 2020, 6 (04): : 18 - 25
  • [28] OPTIMAL SCHEDULING IN SHRIMP MARICULTURE - A STOCHASTIC GROWING INVENTORY PROBLEM
    HOCHMAN, E
    LEUNG, PS
    ROWLAND, LW
    WYBAN, JA
    AMERICAN JOURNAL OF AGRICULTURAL ECONOMICS, 1990, 72 (02) : 382 - 393
  • [29] On optimal inventory control with independent stochastic item returns
    Fleischmann, M
    Kuik, R
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2003, 151 (01) : 25 - 37
  • [30] Nonlinear Optimal Control of Stochastic Recurrent Neural Networks with Multiple Time Delays
    Liu, Ziqian
    Wang, Qunjing
    Ansari, Nirwan
    Schurz, Henri
    2012 AMERICAN CONTROL CONFERENCE (ACC), 2012, : 6424 - 6429