Minimizing Risk Models in Denumerable Semi-Markov Decision Processes with a Target Set

被引:0
作者
Huang Yonghui [1 ]
Guo Xianping [1 ]
机构
[1] Sun Yat Sen Univ, Sch Math & Computat Sci, Guangzhou 510275, Guangdong, Peoples R China
来源
PROCEEDINGS OF THE 29TH CHINESE CONTROL CONFERENCE | 2010年
关键词
Semi-Markov Decision Processes; Target Set; Risk Probability; Optimality Equation; Optimal Policy; POLICIES; PROBABILITY; TIME;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper deals with the risk minimization problem in semi-Markov decision processes with denumerable states. The criterion to be minimized is the risk probability that a total reward over a first passage time to some target set doesn't exceed a level. We first characterize the optimal value function, and then establish the optimality equation and the existence of optimal policies under mild conditions. Moreover, we give some sufficient conditions for the existence of an optimal policy, and these conditions are imposed on the primitive data of the model and are thus easy to verify. Finally, a numerical example is given to illustrate our results.
引用
收藏
页码:1576 / 1581
页数:6
相关论文
共 50 条
  • [31] Computing semi-stationary optimal policies for multichain semi-Markov decision processes
    Mondal, Prasenjit
    ANNALS OF OPERATIONS RESEARCH, 2020, 287 (02) : 843 - 865
  • [32] Dynamical fluctuations for semi-Markov processes
    Maes, Christian
    Netocny, Karel
    Wynants, Bram
    JOURNAL OF PHYSICS A-MATHEMATICAL AND THEORETICAL, 2009, 42 (36)
  • [33] New Average Optimality Conditions for Semi-Markov Decision Processes in Borel Spaces
    Qingda Wei
    Xianping Guo
    Journal of Optimization Theory and Applications, 2012, 153 : 709 - 732
  • [34] New Average Optimality Conditions for Semi-Markov Decision Processes in Borel Spaces
    Wei, Qingda
    Guo, Xianping
    JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 2012, 153 (03) : 709 - 732
  • [35] Performance Sensitivity Analysis and Optimization for a Class of Countable Semi-Markov Decision Processes
    Kang, Yu
    Yin, Baoqun
    Shang, Weike
    Xi, Hongsheng
    2011 9TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA 2011), 2011, : 799 - 804
  • [36] Performance optimization of semi-Markov decision processes with discounted-cost criteria
    Yin, Baoqun
    Li, Yanjie
    Zhou, Yaping
    Xi, Hongsheng
    EUROPEAN JOURNAL OF CONTROL, 2008, 14 (03) : 213 - 222
  • [37] MINIMIZING RISK PROBABILITY FOR INFINITE DISCOUNTED PIECEWISE DETERMINISTIC MARKOV DECISION PROCESSES
    Huo, Haifeng
    Cui, Jinhua
    Wen, Xian
    KYBERNETIKA, 2024, 60 (03) : 357 - 378
  • [38] Mean-Variance Problems for Finite Horizon Semi-Markov Decision Processes
    Huang, Yonghui
    Guo, Xianping
    APPLIED MATHEMATICS AND OPTIMIZATION, 2015, 72 (02) : 233 - 259
  • [39] Adaptive Honeypot Engagement Through Reinforcement Learning of Semi-Markov Decision Processes
    Huang, Linan
    Zhu, Quanyan
    DECISION AND GAME THEORY FOR SECURITY, 2019, 11836 : 196 - 216