Nonzero-Sum Risk-Sensitive Stochastic Games on a Countable State Space

被引:21
作者
Basu, Arnab [1 ]
Ghosh, Mrinal K. [2 ]
机构
[1] Indian Inst Management Bangalore, Decis Sci & Informat Syst Area, Bangalore 560076, Karnataka, India
[2] Indian Inst Sci, Dept Math, Bangalore 560012, Karnataka, India
关键词
noncooperative stochastic games; risk-sensitive payoff; Bellman equations; Nash equilibria; geometric ergodicity; MARKOV DECISION CHAINS; QUADRATIC-GAUSSIAN CONTROL; INFINITE-HORIZON RISK; DIFFERENTIAL-GAMES; NASH EQUILIBRIA; OPTIMAL STRATEGIES; DYNAMIC-GAMES; OPTIMALITY; AVERAGE; ERGODICITY;
D O I
10.1287/moor.2017.0870
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
The infinite horizon risk-sensitive discounted-cost and ergodic-cost nonzero-sum stochastic games for controlled Markov chains with countably many states are analyzed. For the discounted-cost game, we prove the existence of Nash equilibrium strategies in the class of Markov strategies under fairly general conditions. Under an additional weak geometric ergodicity condition and a small cost criterion, the existence of Nash equilibrium strategies in the class of stationary Markov strategies is proved for the ergodic-cost game. The key nontrivial contributions in the ergodic part are to prove the existence of a particular form of a (relative) value function solution to a player's Bellman equation and the continuity of this solution with respect to the opponent's strategies.
引用
收藏
页码:516 / 532
页数:17
相关论文
共 45 条
[1]  
ALIPRANTIS C., 2006, Infinite Dimensional Analysis, A Hitchhikers Guide
[2]   Contraction conditions for average and alpha-discount optimality in countable state Markov games with unbounded rewards [J].
Altman, E ;
Hordijk, A ;
Spieksma, FM .
MATHEMATICS OF OPERATIONS RESEARCH, 1997, 22 (03) :588-618
[3]  
[Anonymous], 2013, Probability Theory
[4]  
[Anonymous], 2009, MARKOV CHAINS STOCHA
[5]   More Risk-Sensitive Markov Decision Processes [J].
Baeuerle, Nicole ;
Rieder, Ulrich .
MATHEMATICS OF OPERATIONS RESEARCH, 2014, 39 (01) :105-120
[6]   Multiplicative ergodicity and large deviations for an irreducible Markov chain [J].
Balaji, S ;
Meyn, SP .
STOCHASTIC PROCESSES AND THEIR APPLICATIONS, 2000, 90 (01) :123-144
[7]   Nash equilibria of risk-sensitive nonlinear stochastic differential games [J].
Basar, T .
JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 1999, 100 (03) :479-498
[8]   Zero-sum risk-sensitive stochastic games on a countable state space [J].
Basu, Arnab ;
Ghosh, Mrinal Kanti .
STOCHASTIC PROCESSES AND THEIR APPLICATIONS, 2014, 124 (01) :961-983
[9]   Zero-Sum Risk-Sensitive Stochastic Differential Games [J].
Basu, Arnab ;
Ghosh, Mrinal K. .
MATHEMATICS OF OPERATIONS RESEARCH, 2012, 37 (03) :437-449
[10]  
Bellman R. E., 1957, Dynamic programming. Princeton landmarks in mathematics