High-dimensional stochastic control models for newsvendor problems and deep learning resolution

被引:0
作者
Ma, Jingtang [1 ,2 ]
Yang, Shan [3 ]
机构
[1] Southwestern Univ Finance & Econ, Sch Math, Chengdu 611130, Peoples R China
[2] Southwestern Univ Finance & Econ, Fintech Innovat Ctr, Chengdu 611130, Peoples R China
[3] Univ New South Wales, Sch Risk & Actuarial Studies, UNSW Business Sch, Sydney, NSW 2052, Australia
基金
中国国家自然科学基金;
关键词
Supply chain management; Newsvendor models; Stochastic control; Dynamic replenishment; Financial hedging; Stackelberg game; Deep learning; PARTIAL-DIFFERENTIAL-EQUATIONS; SUBGRADIENT METHODS; BACKWARD SCHEMES; RISK; APPROXIMATION; PROCUREMENT; ALGORITHMS; OPTIONS; COSTS;
D O I
10.1007/s10479-024-05872-2
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
This paper studies continuous-time models for newsvendor problems with dynamic replenishment, financial hedging and Stackelberg competition. These factors are considered simultaneously and the high-dimensional stochastic control models are established. High-dimensional Hamilton-Jacobi-Bellman (HJB) equations are derived for the value functions. To circumvent the curse of dimensionality, a deep learning algorithm is proposed to solve the HJB equations. A projection is introduced in the algorithm to avoid the gradient explosion during the training phase. The deep learning algorithm is implemented for HJB equations derived from the newsvendor models with dimensions up to six. Numerical outcomes validate the algorithm's accuracy and demonstrate that the high-dimensional stochastic control models can successfully mitigate the risk.
引用
收藏
页码:789 / 811
页数:23
相关论文
共 40 条
  • [1] Bach F, 2017, J MACH LEARN RES, V18
  • [2] Baydin AG, 2018, J MACH LEARN RES, V18
  • [3] Machine Learning Approximation Algorithms for High-Dimensional Fully Nonlinear Partial Differential Equations and Second-order Backward Stochastic Differential Equations
    Beck, Christian
    Weinan, E.
    Jentzen, Arnulf
    [J]. JOURNAL OF NONLINEAR SCIENCE, 2019, 29 (04) : 1563 - 1619
  • [4] Real options valuation principle in the multi-period base-stock problem
    Berling, Peter
    [J]. OMEGA-INTERNATIONAL JOURNAL OF MANAGEMENT SCIENCE, 2008, 36 (06): : 1086 - 1095
  • [5] Reducing transaction costs for interest rate risk hedging with stochastic programming
    Blomvall, Jorgen
    Hagenbjork, Johan
    [J]. EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2022, 302 (03) : 1282 - 1293
  • [6] Serial Inventory Systems with Markov-Modulated Demand: Derivative Bounds, Asymptotic Analysis, and Insights
    Chen, Li
    Song, Jing-Sheng
    Zhang, Yue
    [J]. OPERATIONS RESEARCH, 2017, 65 (05) : 1231 - 1249
  • [7] Optimal decisions in a retailer Stackelberg supply chain
    Chen, Xi
    Zhang, Hui
    Zhang, Michael
    Chen, Jing
    [J]. INTERNATIONAL JOURNAL OF PRODUCTION ECONOMICS, 2017, 187 : 260 - 270
  • [8] Risk aversion in inventory management
    Chen, Xin
    Sim, Melvyn
    Simchi-Levi, David
    Sun, Peng
    [J]. OPERATIONS RESEARCH, 2007, 55 (05) : 828 - 842
  • [9] A multi-product risk-averse newsvendor with exponential utility function
    Choi, Sungyong
    Ruszczynski, Andrzej
    [J]. EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2011, 214 (01) : 78 - 84
  • [10] On the integration of production and financial hedging decisions in global markets
    Ding, Qing
    Dong, Lingxiu
    Kouvelis, Panos
    [J]. OPERATIONS RESEARCH, 2007, 55 (03) : 470 - 489