Minimizing Risk Models in Denumerable Semi-Markov Decision Processes with a Target Set

被引:0
|
作者
Huang Yonghui [1 ]
Guo Xianping [1 ]
机构
[1] Sun Yat Sen Univ, Sch Math & Computat Sci, Guangzhou 510275, Guangdong, Peoples R China
来源
PROCEEDINGS OF THE 29TH CHINESE CONTROL CONFERENCE | 2010年
关键词
Semi-Markov Decision Processes; Target Set; Risk Probability; Optimality Equation; Optimal Policy; POLICIES; PROBABILITY; TIME;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper deals with the risk minimization problem in semi-Markov decision processes with denumerable states. The criterion to be minimized is the risk probability that a total reward over a first passage time to some target set doesn't exceed a level. We first characterize the optimal value function, and then establish the optimality equation and the existence of optimal policies under mild conditions. Moreover, we give some sufficient conditions for the existence of an optimal policy, and these conditions are imposed on the primitive data of the model and are thus easy to verify. Finally, a numerical example is given to illustrate our results.
引用
收藏
页码:1576 / 1581
页数:6
相关论文
共 50 条