Contraction conditions for average and alpha-discount optimality in countable state Markov games with unbounded rewards

被引：41

作者：

Altman, E ^{[1
]}

Hordijk, A ^{[1
]}

Spieksma, FM ^{[1
]}

机构：

[1] LEIDEN UNIV,DEPT MATH & COMP SCI,NL-2300 RA LEIDEN,NETHERLANDS

来源：

MATHEMATICS OF OPERATIONS RESEARCH | 1997年 / 22卷 / 03期

关键词：

noncooperative Markov games; mu-geometric recurrence; equilibrium policies; value iteration; birth-death control;

D O I：

10.1287/moor.22.3.588

中图分类号：

C93 [管理学]; O22 [运筹学];

学科分类号：

070105 ; 12 ; 1201 ; 1202 ; 120202 ;

摘要：

The goal of this paper is to provide a theory of N-person Markov games with unbounded cost, for a countable state space and compact action spaces. We investigate both the finite and infinite horizon problems. For the latter, we consider the discounted cost as well as the expected average cost. We present conditions for the infinite horizon problems for which equilibrium policies exist for all players within the stationary policies, and show that the costs in equilibrium policies exist for all players within the stationary policies, and show that the costs in equilibrium satisfy the optimality equations. Similar results are obtained for the finite horizon costs, for which equilibrium policies are shown to exist for all players within the Markov policies. As special case of N-person games, we investigate the zero-sum (2 players) game, for which we establish the convergence of the value iteration algorithm. We conclude by studying an application of a zero-sum Markov game in a queueing model.

引用

页码：588 / 618

页数：31

共 45 条

[21]

Hordijk A., 1983, International Journal of Game Theory, V12, P81, DOI 10.1007/BF01774298

[22] ASYMPTOTIC-BEHAVIOR OF MINIMAL TOTAL EXPECTED COST FOR DENUMERABLE STATE MARKOV DECISION MODEL [J].

HORDIJK, A ;

SCHWEITZER, PJ ;

TIJMS, H .

JOURNAL OF APPLIED PROBABILITY, 1975, 12 (02) :298-305

[23] APPLYING A NEW DEVICE IN OPTIMIZATION OF EXPONENTIAL QUEUING SYSTEMS [J].

LIPPMAN, SA .

OPERATIONS RESEARCH, 1975, 23 (04) :687-710

[24] BOREL STOCHASTIC GAMES WITH LIM SUP PAYOFF [J].

MAITRA, A ;

SUDDERTH, W .

ANNALS OF PROBABILITY, 1993, 21 (02) :861-885

[25]

Majumdar M., 1991, GAME THEORY MATH PRO, P175

[26]

Mertens J.-F., 1981, International Journal of Game Theory, V10, P53, DOI 10.1007/BF01769259

[27] EXISTENCE OF STATIONARY CORRELATED EQUILIBRIA WITH SYMMETRICAL INFORMATION FOR DISCOUNTED STOCHASTIC GAMES [J].

NOWAK, AS ;

RAGHAVAN, TES .

MATHEMATICS OF OPERATIONS RESEARCH, 1992, 17 (03) :519-526

[28]

NOWAK AS, 1994, ADV DYNAMIC GAMES AP, V1, P231

[29] EXISTENCE OF STATIONARY EQUILIBRIUM STRATEGIES IN NON-ZERO SUM DISCOUNTED STOCHASTIC GAMES WITH UNCOUNTABLE STATE-SPACE AND STATE-INDEPENDENT TRANSITIONS [J].

PARTHASARATHY, T ;

SINHA, S .

INTERNATIONAL JOURNAL OF GAME THEORY, 1989, 18 (02) :189-194

[30]

PARTHASARATHY T, 1977, DIFFERENTIAL GAMES C, V2, P1

← 1 2 3 4 5 →