Probabilities Are Not Enough: Formal Controller Synthesis for Stochastic Dynamical Models with Epistemic Uncertainty

被引：0

作者：

Badings, Thom ^{[1
]}

Romao, Licio ^{[2
]}

Abate, Alessandro ^{[2
]}

Jansen, Nils ^{[1
]}

机构：

[1] Radboud Univ Nijmegen, Nijmegen, Netherlands

[2] Univ Oxford, Oxford, England

来源：

THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 12 | 2023年

基金：

英国工程与自然科学研究理事会; 欧洲研究理事会;

关键词：

REACHABILITY; VERIFICATION; SAFETY;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Capturing uncertainty in models of complex dynamical systems is crucial to designing safe controllers. Stochastic noise causes aleatoric uncertainty, whereas imprecise knowledge of model parameters leads to epistemic uncertainty. Several approaches use formal abstractions to synthesize policies that satisfy temporal specifications related to safety and reachability. However, the underlying models exclusively capture aleatoric but not epistemic uncertainty, and thus require that model parameters are known precisely. Our contribution to overcoming this restriction is a novel abstraction-based controller synthesis method for continuous-state models with stochastic noise and uncertain parameters. By sampling techniques and robust analysis, we capture both aleatoric and epistemic uncertainty, with a user-specified confidence level, in the transition probability intervals of a so-called interval Markov decision process (iMDP). We synthesize an optimal policy on this iMDP, which translates (with the specified confidence level) to a feedback controller for the continuous model with the same performance guarantees. Our experimental benchmarks confirm that accounting for epistemic uncertainty leads to controllers that are more robust against variations in parameter values.

引用

页码：14701 / 14710

页数：10

共 60 条

[51] Trustworthy artificial intelligence
Thiebes, Scott
Lins, Sebastian
Sunyaev, Ali
[J]. ELECTRONIC MARKETS, 2021, 31 (02) : 447 - 464
[52] Probabilistic robotics
Thrun, S
[J]. COMMUNICATIONS OF THE ACM, 2002, 45 (03) : 52 - 57
[53] Tsiamis A, 2019, IEEE DECIS CONTR P, P3648, DOI [10.1109/CDC40024.2019.9029499, 10.1109/cdc40024.2019.9029499]
[54] Vinod A. P., 2022, IEEE Transactions on Automatic Control., P1
[55] SReachTools: A MATLAB Stochastic Reachability Toolbox
Vinod, Abraham P.
Gleason, Joseph D.
Oishi, Meeko M. K.
[J]. PROCEEDINGS OF THE 2019 22ND ACM INTERNATIONAL CONFERENCE ON HYBRID SYSTEMS: COMPUTATION AND CONTROL (HSCC '19), 2019, : 33 - 38
[56] Distributionally Robust Convex Optimization
Wiesemann, Wolfram
Kuhn, Daniel
Sim, Melvyn
[J]. OPERATIONS RESEARCH, 2014, 62 (06) : 1358 - 1376
[57] Wolff EM, 2012, IEEE DECIS CONTR P, P3372, DOI 10.1109/CDC.2012.6426174
[58] Yedavalli R.K., 2014, AMC, V10, P12
[59] Safe Reinforcement Learning Using Robust MPC
Zanon, Mario
Gros, Sebastien
[J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2021, 66 (08) : 3638 - 3652
[60] Zikelic D., 2022, CoRR, abs/2210.05308.

← 1 2 3 4 5 6 →