Contraction conditions for average and alpha-discount optimality in countable state Markov games with unbounded rewards

被引:41
作者
Altman, E [1 ]
Hordijk, A [1 ]
Spieksma, FM [1 ]
机构
[1] LEIDEN UNIV,DEPT MATH & COMP SCI,NL-2300 RA LEIDEN,NETHERLANDS
关键词
noncooperative Markov games; mu-geometric recurrence; equilibrium policies; value iteration; birth-death control;
D O I
10.1287/moor.22.3.588
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
The goal of this paper is to provide a theory of N-person Markov games with unbounded cost, for a countable state space and compact action spaces. We investigate both the finite and infinite horizon problems. For the latter, we consider the discounted cost as well as the expected average cost. We present conditions for the infinite horizon problems for which equilibrium policies exist for all players within the stationary policies, and show that the costs in equilibrium policies exist for all players within the stationary policies, and show that the costs in equilibrium satisfy the optimality equations. Similar results are obtained for the finite horizon costs, for which equilibrium policies are shown to exist for all players within the Markov policies. As special case of N-person games, we investigate the zero-sum (2 players) game, for which we establish the convergence of the value iteration algorithm. We conclude by studying an application of a zero-sum Markov game in a queueing model.
引用
收藏
页码:588 / 618
页数:31
相关论文
共 45 条