Minimizing Risk Models in Denumerable Semi-Markov Decision Processes with a Target Set

被引：0

作者：

Huang Yonghui ^{[1
]}

Guo Xianping ^{[1
]}

机构：

[1] Sun Yat Sen Univ, Sch Math & Computat Sci, Guangzhou 510275, Guangdong, Peoples R China

来源：

PROCEEDINGS OF THE 29TH CHINESE CONTROL CONFERENCE | 2010年

关键词：

Semi-Markov Decision Processes; Target Set; Risk Probability; Optimality Equation; Optimal Policy; POLICIES; PROBABILITY; TIME;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper deals with the risk minimization problem in semi-Markov decision processes with denumerable states. The criterion to be minimized is the risk probability that a total reward over a first passage time to some target set doesn't exceed a level. We first characterize the optimal value function, and then establish the optimality equation and the existence of optimal policies under mild conditions. Moreover, we give some sufficient conditions for the existence of an optimal policy, and these conditions are imposed on the primitive data of the model and are thus easy to verify. Finally, a numerical example is given to illustrate our results.

引用

页码：1576 / 1581

页数：6

共 50 条

[1] OPTIMIZATION OF DENUMERABLE SEMI-MARKOV DECISION PROCESSES.
Staniewski, Piotr
Weinfeld, Roman
Systems Science, 1980, 6 (02): : 129 - 141
[2] First passage models for denumerable semi-Markov decision processes with nonnegative discounted costs
Yong-hui Huang
Guo Xian-ping
Acta Mathematicae Applicatae Sinica, English Series, 2011, 27 : 177 - 190
[3] First passage models for denumerable semi-Markov decision processes with nonnegative discounted costs
Huang, Yong-hui
Guo, Xian-ping
ACTA MATHEMATICAE APPLICATAE SINICA-ENGLISH SERIES, 2011, 27 (02): : 177 - 190
[4] DENUMERABLE UNDISCOUNTED SEMI-MARKOV DECISION-PROCESSES WITH UNBOUNDED REWARDS
FEDERGRUEN, A
SCHWEITZER, PJ
TIJMS, HC
MATHEMATICS OF OPERATIONS RESEARCH, 1983, 8 (02) : 298 - 313
[5] DENUMERABLE SEMI-MARKOV DECISION CHAINS WITH SMALL INTEREST RATES
Dekker, Rommert
Hordijk, Arie
ANNALS OF OPERATIONS RESEARCH, 1991, 28 (01) : 185 - 211
[6] CONTINUITY OF MEAN RECURRENCE TIMES IN DENUMERABLE SEMI-MARKOV PROCESSES
DEPPE, H
ZEITSCHRIFT FUR WAHRSCHEINLICHKEITSTHEORIE UND VERWANDTE GEBIETE, 1985, 69 (04): : 581 - 592
[7] Average criteria in denumerable semi-Markov decision chains under risk-aversion
Rolando Cavazos-Cadena
Hugo Cruz-Suárez
Raúl Montes-De-Oca
Discrete Event Dynamic Systems, 2023, 33 : 221 - 256
[8] TRUNCATION APPROXIMATION OF LIMIT PROBABILITIES FOR DENUMERABLE SEMI-MARKOV PROCESSES
TWEEDIE, RL
JOURNAL OF APPLIED PROBABILITY, 1975, 12 (01) : 161 - 163
[9] Optimal risk probability for first passage models in semi-Markov decision processes
Huang, Yonghui
Guo, Xianping
JOURNAL OF MATHEMATICAL ANALYSIS AND APPLICATIONS, 2009, 359 (01) : 404 - 420
[10] Risk-aware semi-Markov decision processes
Isohaetaelae, Jukka
Haskell, William B.
2017 IEEE 56TH ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2017,

← 1 2 3 4 5 →