An active inference approach to on-line agent monitoring in safety-critical systems

被引:6
作者
Avila, Luis [1 ]
Martinez, Ernesto [1 ]
机构
[1] INGAR CONICET UTN, Rosario, Santa Fe, Argentina
关键词
Active inference; Bayesian surprise; On-line monitoring; Twin Gaussian processes; MODEL;
D O I
10.1016/j.aei.2015.07.008
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The current trend towards integrating software agents in safety-critical systems such as drones, autonomous cars and medical devices, which must operate in uncertain environments, gives rise to the need of on-line detection of an unexpected behavior. In this work, on-line monitoring is carried out by comparing environmental state transitions with prior beliefs descriptive of optimal behavior. The agent policy is computed analytically using linearly solvable Markov decision processes. Active inference using prior beliefs allows a monitor proactively rehearsing on-line future agent actions over a rolling horizon so as to generate expectations to discover surprising behaviors. A Bayesian surprise metric is proposed based on twin Gaussian processes to measure the difference between prior and posterior beliefs about state transitions in the agent environment. Using a sliding window of sampled data, beliefs are updated a posteriori by comparing a sequence of state transitions with the ones predicted using the optimal policy. An artificial pancreas for diabetic patients is used as a representative example. (C) 2015 Elsevier Ltd. All rights reserved.
引用
收藏
页码:1083 / 1095
页数:13
相关论文
共 41 条
[1]   Blood glucose regulation with stochastic optimal control for insulin-dependent diabetic patients [J].
Acikgoz, Saadet Ulas ;
Diwekar, Urmila M. .
CHEMICAL ENGINEERING SCIENCE, 2010, 65 (03) :1227-1236
[2]  
ASTROM K. J., 1970, Introduction to stochastic control
[3]   Behavior monitoring under uncertainty using Bayesian surprise and optimal action selection [J].
Avila, Luis ;
Martinez, Ernesto .
EXPERT SYSTEMS WITH APPLICATIONS, 2014, 41 (14) :6327-6345
[4]   PHYSIOLOGIC EVALUATION OF FACTORS CONTROLLING GLUCOSE-TOLERANCE IN MAN - MEASUREMENT OF INSULIN SENSITIVITY AND BETA-CELL GLUCOSE SENSITIVITY FROM THE RESPONSE TO INTRAVENOUS GLUCOSE [J].
BERGMAN, RN ;
PHILLIPS, LS ;
COBELLI, C .
JOURNAL OF CLINICAL INVESTIGATION, 1981, 68 (06) :1456-1467
[5]  
Bertsekas D. P., 2000, DYNAMIC PROGRAMMING, VI
[6]   Twin Gaussian Processes for Structured Prediction [J].
Bo, Liefeng ;
Sminchisescu, Cristian .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2010, 87 (1-2) :28-52
[7]   Autonomous vehicles control in the VisLab Intercontinental Autonomous Challenge [J].
Broggi, A. ;
Medici, P. ;
Zani, P. ;
Coati, A. ;
Panciroli, M. .
ANNUAL REVIEWS IN CONTROL, 2012, 36 (01) :161-171
[8]  
Burdelis Mauricio, 2014, SICE Journal of Control, Measurement, and System Integration, V7, P48
[9]  
Clifton L., 2011, 2011 Federated Conference on Computer Science and Information Systems (FedCSIS), P125
[10]  
Curtis A., 2008, THESIS