Zero-Sum Discounted Reward Criterion Games for Piecewise Deterministic Markov Processes

被引:2
作者
Costa, O. L. V. [1 ]
Dufour, F. [2 ]
机构
[1] Univ Sao Paulo, Dept Engn Telecomunicacoes & Controle, Escola Politecn, BR-05508900 Sao Paulo, Brazil
[2] INRIA Bordeaux Sud Ouest, Inst Polytech Bordeaux, Inst Math Bordeaux, 200 Ave Vieille Tour, F-33405 Talence, France
基金
巴西圣保罗研究基金会;
关键词
Zero sum games; Continuous-time; Discounted reward criterion; General borel spaces; TRANSITION-PROBABILITIES;
D O I
10.1007/s00245-017-9416-2
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
This papers deals with the zero-sum game with a discounted reward criterion for piecewise deterministic Markov process (PDMPs) in general Borel spaces. The two players can act on the jump rate and transition measure of the process, with the decisions being taken just after a jump of the process. The goal of this paper is to derive conditions for the existence of min-max strategies for the infinite horizon total expected discounted reward function, which is composed of running and boundary parts. The basic idea is, by using the special features of the PDMPs, to re-write the problem via an embedded discrete-time Markov chain associated to the PDMP and re-formulate the problem as a discrete-stage zero sum game problem.
引用
收藏
页码:587 / 611
页数:25
相关论文
共 22 条
[1]  
Costa O., 2013, Continuous average control of piecewise deterministic markov processes
[2]  
Davis M. H. A., 1993, MARKOV MODELS OPTIMI
[3]  
DAVIS MHA, 1984, J ROY STAT SOC B MET, V46, P353
[4]   OPTIMAL CAPACITY EXPANSION UNDER UNCERTAINTY [J].
DAVIS, MHA ;
DEMPSTER, MAH ;
SETHI, SP ;
VERMES, D .
ADVANCES IN APPLIED PROBABILITY, 1987, 19 (01) :156-176
[6]  
Filar J., 1997, Competitive Markov Decision Processes
[7]   MINIMAX CONTROL OF DISCRETE-TIME STOCHASTIC SYSTEMS [J].
Gonzalez-Trejo, J. I. ;
Hernandez-Lerma, O. ;
Hoyos-Reyes, L. F. .
SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2003, 41 (05) :1626-1659
[8]   Zero-sum games for continuous-time jump Markov processes in polish spaces: Discounted payoffs [J].
Guo, Xianping ;
Hernandez-Lerma, Onesimo .
ADVANCES IN APPLIED PROBABILITY, 2007, 39 (03) :645-668
[9]   New optimality conditions for average-payoff continuous-time Markov games in Polish spaces [J].
Guo XianPing ;
Hernandez-Lerma, Onesimo .
SCIENCE CHINA-MATHEMATICS, 2011, 54 (04) :793-816
[10]  
Hernández-Lerma O, 2000, SIAM J CONTROL OPTIM, V39, P1520