Reachability analysis of uncertain systems using bounded-parameter Markov decision processes

被引:31
作者
Wu, Di [1 ]
Koutsoukos, Xenofon [1 ]
机构
[1] Vanderbilt Univ, Dept EECS, Nashville, TN 37235 USA
基金
美国国家科学基金会;
关键词
reachability analysis; uncertain systems; Markov decision processes;
D O I
10.1016/j.artint.2007.12.002
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Verification of reachability properties for probabilistic systems is usually based on variants of Markov processes. Current methods assume an exact model of the dynamic behavior and are not suitable for realistic systems that operate in the presence of uncertainty and variability. This research note extends existing methods for Bounded-parameter Markov Decision Processes (BMDPs) to solve the reachability problem. BMDPs are a generalization of MDPs that allows modeling uncertainty. Our results show that interval value iteration converges in the case of an undiscounted reward criterion that is required to formulate the problems of maximizing the probability of reaching a set of desirable states or minimizing the probability of reaching an unsafe set. Analysis of the computational complexity is also presented. (C) 2007 Elsevier B.V. All rights reserved.
引用
收藏
页码:945 / 954
页数:10
相关论文
共 24 条
[1]  
BAGNELL J, 2001, CMURITR0125
[2]  
Bertsekas Dimitri P., 1989, PARALLEL DISTRIBUTED
[3]  
Buffet O, 2005, PROC INT C TOOLS ART, P515
[4]   Markov decision processes and regular events [J].
Courcoubetis, C ;
Yannakakis, M .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1998, 43 (10) :1399-1418
[5]  
DEALFARO L, 1997, STANCSTR981601
[6]  
Feinberg EA, 2002, HDB MARKOV DECISION
[7]   Bounded-parameter markov decision processes [J].
Givan, R ;
Leach, S ;
Dean, T .
ARTIFICIAL INTELLIGENCE, 2000, 122 (1-2) :71-109
[8]  
HEATH J, 2006, P INT C COMP METH SY
[9]   Probabilistic decision graphs - Combining verification and AI techniques for probabilistic inference [J].
Jaeger, M .
INTERNATIONAL JOURNAL OF UNCERTAINTY FUZZINESS AND KNOWLEDGE-BASED SYSTEMS, 2004, 12 :19-42
[10]  
KALYANASUNDARAM S, 2002, P 41 IEEE C DEC CONT