Solving Stochastic Optimal Control Problem via Stochastic Maximum Principle with Deep Learning Method

被引:0
作者
Shaolin Ji
Shige Peng
Ying Peng
Xichuan Zhang
机构
[1] Shandong University,Shandong University
[2] Shandong University,Zhongtai Securities Institute for Financial Studies
来源
Journal of Scientific Computing | 2022年 / 93卷
关键词
Stochastic control; Deep neural networks; Stochastic maximum principle; Hamiltonian system; PDEs; 49M25; 68T20;
D O I
暂无
中图分类号
学科分类号
摘要
In this paper, we aim to solve the high dimensional stochastic optimal control problem from the view of the stochastic maximum principle via deep learning. By introducing the extended Hamiltonian system which is essentially a Forward Backward Stochastic Differential Equation (FBSDE) with a maximum condition, we reformulate the original control problem as a new one. According to whether the optimal control has an explicit representation, three algorithms are proposed to solve the new control problem. Numerical results for different examples demonstrate the effectiveness of our proposed algorithms, especially in high dimensional cases. And even if the optimal control u~\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\tilde{u}$$\end{document} in the maximum condition may not be solved explicitly, our algorithms can still deal with the stochastic optimal control problem. An important application of our proposed method is to calculate the sub-linear expectations, which correspond to a kind of fully nonlinear Partial Differential Equations (PDEs).
引用
收藏
相关论文
共 39 条
[1]  
Bismut JM(1978)An introductory approach to duality in optimal stochastic control SIAM Rev. 20 62-undefined
[2]  
Bensoussan A(1983)Stochastic maximum principle for distributed parameter system J. Franklin Inst. 315 387-undefined
[3]  
Bellman Richard(1958)Dynamic programming and stochastic control processes Inf. Control 1 228-undefined
[4]  
Kushner HJ(1990)Numerical methods for stochastic control problems in continuous time, SIAM J. Control Optim. 28 888-undefined
[5]  
Dong H(2007)The rate of convergence of finite-difference approximations for parabolic bellman equations with lipschitz coefficients in cylindrical domains Appl. Math. Optim. 56 37-undefined
[6]  
Krylov NV(2003)On the rate of convergence of approximation schemes for Bellman equations associated with optimal stopping time problems Math. Models Methods Appl. Sci. 13 613-undefined
[7]  
Jakobsen ER(2005)The rate of convergence of finite-difference approximations for Bellman equations with Lipschitz coefficients Appl. Math. Optim. 52 365-undefined
[8]  
Krylov NV(2009)Panos, approximate dynamic programming: solving the curses of dimensionality Optim. Methods Softw. 24 155-undefined
[9]  
Pardalos M(2017)Deep learning-based numerical methods for high-dimensional parabolic partial differential equations and backward stochastic differential equations Commun. Math. Stat. 5 349-undefined
[10]  
J Han W.E(2018)Solving high-dimensional partial differential equations using deep learning Proc. Natl. Acad. Sci. 115 8505-undefined