High-dimensional stochastic control models for newsvendor problems and deep learning resolution

被引：0

作者：

Ma, Jingtang ^{[1
,2
]}

Yang, Shan ^{[3
]}

机构：

[1] Southwestern Univ Finance & Econ, Sch Math, Chengdu 611130, Peoples R China

[2] Southwestern Univ Finance & Econ, Fintech Innovat Ctr, Chengdu 611130, Peoples R China

[3] Univ New South Wales, Sch Risk & Actuarial Studies, UNSW Business Sch, Sydney, NSW 2052, Australia

来源：

ANNALS OF OPERATIONS RESEARCH | 2024年 / 339卷 / 1-2期

基金：

中国国家自然科学基金;

关键词：

Supply chain management; Newsvendor models; Stochastic control; Dynamic replenishment; Financial hedging; Stackelberg game; Deep learning; PARTIAL-DIFFERENTIAL-EQUATIONS; SUBGRADIENT METHODS; BACKWARD SCHEMES; RISK; APPROXIMATION; PROCUREMENT; ALGORITHMS; OPTIONS; COSTS;

D O I：

10.1007/s10479-024-05872-2

中图分类号：

C93 [管理学]; O22 [运筹学];

学科分类号：

070105 ; 12 ; 1201 ; 1202 ; 120202 ;

摘要：

This paper studies continuous-time models for newsvendor problems with dynamic replenishment, financial hedging and Stackelberg competition. These factors are considered simultaneously and the high-dimensional stochastic control models are established. High-dimensional Hamilton-Jacobi-Bellman (HJB) equations are derived for the value functions. To circumvent the curse of dimensionality, a deep learning algorithm is proposed to solve the HJB equations. A projection is introduced in the algorithm to avoid the gradient explosion during the training phase. The deep learning algorithm is implemented for HJB equations derived from the newsvendor models with dimensions up to six. Numerical outcomes validate the algorithm's accuracy and demonstrate that the high-dimensional stochastic control models can successfully mitigate the risk.

引用

页码：789 / 811

页数：23

共 40 条

[1] Bach F, 2017, J MACH LEARN RES, V18
[2] Baydin AG, 2018, J MACH LEARN RES, V18
[3] Machine Learning Approximation Algorithms for High-Dimensional Fully Nonlinear Partial Differential Equations and Second-order Backward Stochastic Differential Equations
Beck, Christian
Weinan, E.
Jentzen, Arnulf
[J]. JOURNAL OF NONLINEAR SCIENCE, 2019, 29 (04) : 1563 - 1619
[4] Real options valuation principle in the multi-period base-stock problem
Berling, Peter
[J]. OMEGA-INTERNATIONAL JOURNAL OF MANAGEMENT SCIENCE, 2008, 36 (06): : 1086 - 1095
[5] Reducing transaction costs for interest rate risk hedging with stochastic programming
Blomvall, Jorgen
Hagenbjork, Johan
[J]. EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2022, 302 (03) : 1282 - 1293
[6] Serial Inventory Systems with Markov-Modulated Demand: Derivative Bounds, Asymptotic Analysis, and Insights
Chen, Li
Song, Jing-Sheng
Zhang, Yue
[J]. OPERATIONS RESEARCH, 2017, 65 (05) : 1231 - 1249
[7] Optimal decisions in a retailer Stackelberg supply chain
Chen, Xi
Zhang, Hui
Zhang, Michael
Chen, Jing
[J]. INTERNATIONAL JOURNAL OF PRODUCTION ECONOMICS, 2017, 187 : 260 - 270
[8] Risk aversion in inventory management
Chen, Xin
Sim, Melvyn
Simchi-Levi, David
Sun, Peng
[J]. OPERATIONS RESEARCH, 2007, 55 (05) : 828 - 842
[9] A multi-product risk-averse newsvendor with exponential utility function
Choi, Sungyong
Ruszczynski, Andrzej
[J]. EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2011, 214 (01) : 78 - 84
[10] On the integration of production and financial hedging decisions in global markets
Ding, Qing
Dong, Lingxiu
Kouvelis, Panos
[J]. OPERATIONS RESEARCH, 2007, 55 (03) : 470 - 489

← 1 2 3 4 →