Probabilities Are Not Enough: Formal Controller Synthesis for Stochastic Dynamical Models with Epistemic Uncertainty

被引:0
作者
Badings, Thom [1 ]
Romao, Licio [2 ]
Abate, Alessandro [2 ]
Jansen, Nils [1 ]
机构
[1] Radboud Univ Nijmegen, Nijmegen, Netherlands
[2] Univ Oxford, Oxford, England
来源
THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 12 | 2023年
基金
英国工程与自然科学研究理事会; 欧洲研究理事会;
关键词
REACHABILITY; VERIFICATION; SAFETY;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Capturing uncertainty in models of complex dynamical systems is crucial to designing safe controllers. Stochastic noise causes aleatoric uncertainty, whereas imprecise knowledge of model parameters leads to epistemic uncertainty. Several approaches use formal abstractions to synthesize policies that satisfy temporal specifications related to safety and reachability. However, the underlying models exclusively capture aleatoric but not epistemic uncertainty, and thus require that model parameters are known precisely. Our contribution to overcoming this restriction is a novel abstraction-based controller synthesis method for continuous-state models with stochastic noise and uncertain parameters. By sampling techniques and robust analysis, we capture both aleatoric and epistemic uncertainty, with a user-specified confidence level, in the transition probability intervals of a so-called interval Markov decision process (iMDP). We synthesize an optimal policy on this iMDP, which translates (with the specified confidence level) to a feedback controller for the continuous model with the same performance guarantees. Our experimental benchmarks confirm that accounting for epistemic uncertainty leads to controllers that are more robust against variations in parameter values.
引用
收藏
页码:14701 / 14710
页数:10
相关论文
共 60 条
  • [51] Trustworthy artificial intelligence
    Thiebes, Scott
    Lins, Sebastian
    Sunyaev, Ali
    [J]. ELECTRONIC MARKETS, 2021, 31 (02) : 447 - 464
  • [52] Probabilistic robotics
    Thrun, S
    [J]. COMMUNICATIONS OF THE ACM, 2002, 45 (03) : 52 - 57
  • [53] Tsiamis A, 2019, IEEE DECIS CONTR P, P3648, DOI [10.1109/CDC40024.2019.9029499, 10.1109/cdc40024.2019.9029499]
  • [54] Vinod A. P., 2022, IEEE Transactions on Automatic Control., P1
  • [55] SReachTools: A MATLAB Stochastic Reachability Toolbox
    Vinod, Abraham P.
    Gleason, Joseph D.
    Oishi, Meeko M. K.
    [J]. PROCEEDINGS OF THE 2019 22ND ACM INTERNATIONAL CONFERENCE ON HYBRID SYSTEMS: COMPUTATION AND CONTROL (HSCC '19), 2019, : 33 - 38
  • [56] Distributionally Robust Convex Optimization
    Wiesemann, Wolfram
    Kuhn, Daniel
    Sim, Melvyn
    [J]. OPERATIONS RESEARCH, 2014, 62 (06) : 1358 - 1376
  • [57] Wolff EM, 2012, IEEE DECIS CONTR P, P3372, DOI 10.1109/CDC.2012.6426174
  • [58] Yedavalli R.K., 2014, AMC, V10, P12
  • [59] Safe Reinforcement Learning Using Robust MPC
    Zanon, Mario
    Gros, Sebastien
    [J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2021, 66 (08) : 3638 - 3652
  • [60] Zikelic D., 2022, CoRR, abs/2210.05308.