Minimizing Risk Models in Denumerable Semi-Markov Decision Processes with a Target Set

被引：0

作者：

Huang Yonghui ^{[1
]}

Guo Xianping ^{[1
]}

机构：

[1] Sun Yat Sen Univ, Sch Math & Computat Sci, Guangzhou 510275, Guangdong, Peoples R China

来源：

PROCEEDINGS OF THE 29TH CHINESE CONTROL CONFERENCE | 2010年

关键词：

Semi-Markov Decision Processes; Target Set; Risk Probability; Optimality Equation; Optimal Policy; POLICIES; PROBABILITY; TIME;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper deals with the risk minimization problem in semi-Markov decision processes with denumerable states. The criterion to be minimized is the risk probability that a total reward over a first passage time to some target set doesn't exceed a level. We first characterize the optimal value function, and then establish the optimality equation and the existence of optimal policies under mild conditions. Moreover, we give some sufficient conditions for the existence of an optimal policy, and these conditions are imposed on the primitive data of the model and are thus easy to verify. Finally, a numerical example is given to illustrate our results.

引用

页码：1576 / 1581

页数：6

共 50 条

[31] Computing semi-stationary optimal policies for multichain semi-Markov decision processes
Mondal, Prasenjit
ANNALS OF OPERATIONS RESEARCH, 2020, 287 (02) : 843 - 865
[32] Dynamical fluctuations for semi-Markov processes
Maes, Christian
Netocny, Karel
Wynants, Bram
JOURNAL OF PHYSICS A-MATHEMATICAL AND THEORETICAL, 2009, 42 (36)
[33] New Average Optimality Conditions for Semi-Markov Decision Processes in Borel Spaces
Qingda Wei
Xianping Guo
Journal of Optimization Theory and Applications, 2012, 153 : 709 - 732
[34] New Average Optimality Conditions for Semi-Markov Decision Processes in Borel Spaces
Wei, Qingda
Guo, Xianping
JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 2012, 153 (03) : 709 - 732
[35] Performance Sensitivity Analysis and Optimization for a Class of Countable Semi-Markov Decision Processes
Kang, Yu
Yin, Baoqun
Shang, Weike
Xi, Hongsheng
2011 9TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA 2011), 2011, : 799 - 804
[36] Performance optimization of semi-Markov decision processes with discounted-cost criteria
Yin, Baoqun
Li, Yanjie
Zhou, Yaping
Xi, Hongsheng
EUROPEAN JOURNAL OF CONTROL, 2008, 14 (03) : 213 - 222
[37] MINIMIZING RISK PROBABILITY FOR INFINITE DISCOUNTED PIECEWISE DETERMINISTIC MARKOV DECISION PROCESSES
Huo, Haifeng
Cui, Jinhua
Wen, Xian
KYBERNETIKA, 2024, 60 (03) : 357 - 378
[38] Mean-Variance Problems for Finite Horizon Semi-Markov Decision Processes
Huang, Yonghui
Guo, Xianping
APPLIED MATHEMATICS AND OPTIMIZATION, 2015, 72 (02) : 233 - 259
[39] Adaptive Honeypot Engagement Through Reinforcement Learning of Semi-Markov Decision Processes
Huang, Linan
Zhu, Quanyan
DECISION AND GAME THEORY FOR SECURITY, 2019, 11836 : 196 - 216
[40] ON THE 2ND OPTIMALITY EQUATION FOR SEMI-MARKOV DECISION-MODELS
SCHAL, M
MATHEMATICS OF OPERATIONS RESEARCH, 1992, 17 (02) : 470 - 486

← 1 2 3 4 5 →