Numerical approximations for stochastic differential games

被引:38
作者
Kushner, HJ [1 ]
机构
[1] Brown Univ, Lefschetz Ctr Dynam Syst, Dept Appl Math, Providence, RI 02912 USA
关键词
stochastic differential games; numerical methods; Markov chain approximations;
D O I
10.1137/S0363012901389457
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The Markov chain approximation method is a widely used, robust, relatively easy to use, and efficient family of methods for the bulk of stochastic control problems in continuous time for reflected-jump-diffusion-type models. It has been shown to converge under broad conditions, and there are good algorithms for solving the numerical problems if the dimension is not too high. Versions of these methods have been used in applications to various two-player differential and stochastic dynamic games for a long time, and proofs of convergence are available for some cases, mainly using PDE-type techniques. In this paper, purely probabilistic proofs of convergence are given for a broad class of such problems, where the controls for the two players are separated in the dynamics and cost function, and which cover a substantial class not dealt with in previous works. Discounted and stopping time cost functions are considered. Finite horizon problems and problems where the process is stopped on first hitting an a priori given boundary can be dealt with by adapting the methods of [H. J. Kushner and P. Dupuis, Numerical Methods for Stochastic Control Problems, in Continuous Time, 2nd ed., Springer-Verlag, Berlin, New York, 2001] as done in this paper for the treated problems. The essential conditions are the weak-sense existence and uniqueness of solutions, an almost everywhere continuity condition, and that a weak local consistency condition holds almost everywhere for the numerical approximations, just as for the control problem. There are extensions to problems with controlled variance and jumps.
引用
收藏
页码:457 / 486
页数:30
相关论文
共 41 条
[1]   Admission control for combined guaranteed performance and best effort communications systems under heavy traffic [J].
Altman, E ;
Kushner, HJ .
SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 1999, 37 (06) :1780-1807
[2]   Robust L2-gain control for nonlinear systems with projection dynamics and input constraints:: an example from traffic control [J].
Ball, JA ;
Day, MV ;
Yu, TS ;
Kachroo, P .
AUTOMATICA, 1999, 35 (03) :429-444
[3]   Robust feedback control of a single server queueing system [J].
Ball, JA ;
Day, MV ;
Kachroo, P .
MATHEMATICS OF CONTROL SIGNALS AND SYSTEMS, 1999, 12 (04) :307-345
[4]  
BARDI M, 1994, ANN INT SOC DYN GAME, P89
[5]  
BARDI M, 1991, LECT NOTES CONTR INF, V156, P131
[6]  
Bardi M, 1995, ANN INT SOC DYN GAME, V3, P273
[7]  
BARDI M, 1999, STOCHASTIC DIFFERENT, P105
[8]   EXISTENCE OF OPTIMAL STOCHASTIC CONTROL LAWS [J].
BENES, VE .
SIAM JOURNAL ON CONTROL, 1971, 9 (03) :446-&
[9]  
Bernhard P., 1991, HINFINITYOPTIMAL CON
[10]  
Billingsley P., 1999, CONVERGENCE PROBABIL