A Stochastic Gradient Descent Approach for Stochastic Optimal Control

被引:11
作者
Archibald, Richard [1 ]
Bao, Feng [2 ]
Yong, Jiongmin [3 ]
机构
[1] Oak Ridge Natl Lab, Computat Sci & Math Div, Oak Ridge, TN USA
[2] Florida State Univ, Dept Math, Tallahassee, FL 32306 USA
[3] Univ Cent Florida, Dept Math, Orlando, FL 32816 USA
基金
美国国家科学基金会;
关键词
Stochastic optimal control; stochastic gradient descent; maximum principle; forward backward stochastic differential equations; DIFFERENTIAL-EQUATIONS; MAXIMUM PRINCIPLE; NUMERICAL SCHEMES; PROJECTION METHOD; TIME;
D O I
10.4208/eajam.190420.200420
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
In this work, we introduce a stochastic gradient descent approach to solve the stochastic optimal control problem through stochastic maximum principle. The motivation that drives our method is the gradient of the cost functional in the stochastic optimal control problem is under expectation, and numerical calculation of such an expectation requires fully computation of a system of forward backward stochastic differential equations, which is computationally expensive. By evaluating the expectation with single-sample representation as suggested by the stochastic gradient descent type optimisation, we could save computational efforts in solving FBSDEs and only focus on the optimisation task which aims to determine the optimal control process.
引用
收藏
页码:635 / 658
页数:24
相关论文
共 41 条
[1]  
[Anonymous], FUTURE DIRECTIONS CO
[2]   Levy Backward SDE Filter for jump Diffusion Processes and Its Applications in Material Sciences [J].
Bao, Feng ;
Archibald, Richard ;
Maksymovych, Peter .
COMMUNICATIONS IN COMPUTATIONAL PHYSICS, 2020, 27 (02) :589-618
[3]   A Backward Doubly Stochastic Differential Equation Approach for Nonlinear Filtering Problems [J].
Bao, Feng ;
Cao, Yanzhao ;
Zhao, Weidong .
COMMUNICATIONS IN COMPUTATIONAL PHYSICS, 2018, 23 (05) :1573-1601
[4]   ADAPTIVE MESHFREE BACKWARD SDE FILTER [J].
Bao, Feng ;
Maroulas, Vasileios .
SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2017, 39 (06) :A2664-A2683
[5]   A First Order Scheme for Backward Doubly Stochastic Differential Equations [J].
Bao, Feng ;
Cao, Yanzhao ;
Meir, Amnon ;
Zhao, Weidong .
SIAM-ASA JOURNAL ON UNCERTAINTY QUANTIFICATION, 2016, 4 (01) :413-445
[6]   A FIRST ORDER SEMI-DISCRETE ALGORITHM FOR BACKWARD DOUBLY STOCHASTIC DIFFERENTIAL EQUATIONS [J].
Bao, Feng ;
Cao, Yanzhao ;
Zhao, Weidong .
DISCRETE AND CONTINUOUS DYNAMICAL SYSTEMS-SERIES B, 2015, 20 (05) :1297-1313
[7]   NUMERICAL SOLUTIONS FOR FORWARD BACKWARD DOUBLY STOCHASTIC DIFFERENTIAL EQUATIONS AND ZAKAI EQUATIONS [J].
Bao, Feng ;
Cao, Yanzhao ;
Zhao, Weidong .
INTERNATIONAL JOURNAL FOR UNCERTAINTY QUANTIFICATION, 2011, 1 (04) :351-367
[8]   Machine Learning Approximation Algorithms for High-Dimensional Fully Nonlinear Partial Differential Equations and Second-order Backward Stochastic Differential Equations [J].
Beck, Christian ;
Weinan, E. ;
Jentzen, Arnulf .
JOURNAL OF NONLINEAR SCIENCE, 2019, 29 (04) :1563-1619
[9]  
Bellman R., 1949, 153 RAND CORP
[10]  
Bellman R.E., 1957, DYNAMIC PROGRAMMING