A policy-improvement type algorithm for solving zero-sum two-person stochastic games of perfect information

被引:10
|
作者
Raghavan, TES [1 ]
Syed, Z [1 ]
机构
[1] Univ Illinois, Dept Math Stat & Comp Sci, Chicago, IL 60680 USA
关键词
stochastic games; MDP; perfect information; policy iteration;
D O I
10.1007/s10107-002-0312-3
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
We give a policy-improvement type algorithm to locate an optimal pure stationary strategy for discounted stochastic games with perfect information. A graph theoretic motivation for our algorithm is presented as well.
引用
收藏
页码:513 / 532
页数:20
相关论文
共 30 条