Zero-Sum Discounted Reward Criterion Games for Piecewise Deterministic Markov Processes

被引：2

作者：

Costa, O. L. V. ^{[1
]}

Dufour, F. ^{[2
]}

机构：

[1] Univ Sao Paulo, Dept Engn Telecomunicacoes & Controle, Escola Politecn, BR-05508900 Sao Paulo, Brazil

[2] INRIA Bordeaux Sud Ouest, Inst Polytech Bordeaux, Inst Math Bordeaux, 200 Ave Vieille Tour, F-33405 Talence, France

来源：

APPLIED MATHEMATICS AND OPTIMIZATION | 2018年 / 78卷 / 03期

基金：

巴西圣保罗研究基金会;

关键词：

Zero sum games; Continuous-time; Discounted reward criterion; General borel spaces; TRANSITION-PROBABILITIES;

D O I：

10.1007/s00245-017-9416-2

中图分类号：

O29 [应用数学];

学科分类号：

070104 ;

摘要：

This papers deals with the zero-sum game with a discounted reward criterion for piecewise deterministic Markov process (PDMPs) in general Borel spaces. The two players can act on the jump rate and transition measure of the process, with the decisions being taken just after a jump of the process. The goal of this paper is to derive conditions for the existence of min-max strategies for the infinite horizon total expected discounted reward function, which is composed of running and boundary parts. The basic idea is, by using the special features of the PDMPs, to re-write the problem via an embedded discrete-time Markov chain associated to the PDMP and re-formulate the problem as a discrete-stage zero sum game problem.

引用

页码：587 / 611

页数：25

共 22 条

[1]

Costa O., 2013, Continuous average control of piecewise deterministic markov processes

[2]

Davis M. H. A., 1993, MARKOV MODELS OPTIMI

[3]

DAVIS MHA, 1984, J ROY STAT SOC B MET, V46, P353

[4] OPTIMAL CAPACITY EXPANSION UNDER UNCERTAINTY [J].