A Risk-Sensitive Finite-Time Reachability Approach for Safety of Stochastic Dynamic Systems

被引:34
作者
Chapman, Margaret P. [1 ,3 ]
Lacotte, Jonathan [4 ]
Tamar, Aviv [1 ]
Lee, Donggun [5 ]
Smith, Kevin M. [6 ,7 ]
Cheng, Victoria [8 ]
Fisac, Jaime F. [1 ]
Jha, Susmit [2 ]
Pavone, Marco [9 ]
Tomlin, Claire J. [1 ]
机构
[1] Univ Calif Berkeley, Dept Elect Engn & Comp Sci, Berkeley, CA 94720 USA
[2] SRI Int, Comp Sci Lab, 333 Ravenswood Ave, Menlo Pk, CA 94025 USA
[3] SRI Int, 333 Ravenswood Ave, Menlo Pk, CA 94025 USA
[4] Stanford Univ, Dept Elect Engn, Stanford, CA 94305 USA
[5] Univ Calif Berkeley, Dept Mech Engn, Berkeley, CA 94720 USA
[6] OptiRTC Inc, Boston, MA USA
[7] Tufts Univ, Dept Civil & Environm Engn, Medford, MA 02155 USA
[8] Univ Calif Berkeley, Dept Civil & Environm Engn, Berkeley, CA 94720 USA
[9] Stanford Univ, Dept Aeronaut & Astronaut, Stanford, CA 94305 USA
来源
2019 AMERICAN CONTROL CONFERENCE (ACC) | 2019年
基金
美国国家科学基金会;
关键词
CONSISTENT;
D O I
10.23919/acc.2019.8815169
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A classic reachability problem for safety of dynamic systems is to compute the set of initial states from which the state trajectory is guaranteed to stay inside a given constraint set over a given time horizon. In this paper, we leverage existing theory of reachability analysis and risk measures to devise a risk-sensitive reachability approach for safety of stochastic dynamic systems under non-adversarial disturbances over a finite time horizon. Specifically, we first introduce the notion of a risk-sensitive safe set as a set of initial states from which the risk of large constraint violations can be reduced to a required level via a control policy, where risk is quantified using the Conditional Value-at-Risk (CVaR) measure. Second, we show how the computation of a risk-sensitive safe set can be reduced to the solution to a Markov Decision Process (MDP), where cost is assessed according to CVaR. Third, leveraging this reduction, we devise a tractable algorithm to approximate a risk-sensitive safe set and provide arguments about its correctness. Finally, we present a realistic example inspired from stormwater catchment design to demonstrate the utility of risk-sensitive reachability analysis. In particular, our approach allows a practitioner to tune the level of risk sensitivity from worst-case (which is typical for Hamilton-Jacobi reachability analysis) to risk-neutral (which is the case for stochastic reachability analysis).
引用
收藏
页码:2958 / 2963
页数:6
相关论文
共 27 条
[11]  
Chow YL, 2014, P AMER CONTR CONF, P4204, DOI 10.1109/ACC.2014.6859437
[12]  
Chow Yinlam, 2015, ADV NEURAL INFORM PR, P1522, DOI DOI 10.48550/ARXIV.1506.02188
[13]  
Grant M., 2014, Cvx: Matlab software for disciplined convex programming
[14]   A CONVEX ANALYTIC APPROACH TO RISK-AWARE MARKOV DECISION PROCESSES [J].
Haskell, William B. ;
Jain, Rahul .
SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2015, 53 (03) :1569-1598
[15]  
Herbert Sylvia L., 2017, 2017 IEEE 56th Annual Conference on Decision and Control (CDC), P1517, DOI 10.1109/CDC.2017.8263867
[16]  
Kisiala J., 2015, THESIS
[17]  
Mitchell IA, 2005, LECT NOTES COMPUT SC, V3414, P480
[18]  
Osogami Takayuki, 2012, Advances in Neural Information Processing Systems, V25, P233
[19]   Time-Consistent Decisions and Temporal Decomposition of Coherent Risk Functionals [J].
Pflug, Georg Ch. ;
Pichler, Alois .
MATHEMATICS OF OPERATIONS RESEARCH, 2016, 41 (02) :682-699
[20]  
Ratliff L. J., 2017, ARXIV170309842