Behavior monitoring under uncertainty using Bayesian surprise and optimal action selection

被引:6
作者
Avila, Luis [1 ]
Martinez, Ernesto [1 ]
机构
[1] Consejo Nacl Invest Cient & Tecn, INGAR, UTN, Santa Fe, Gjc, Argentina
关键词
Artificial pancreas; Bayesian surprise; Behavior monitoring; Kullback-Leibler divergence; Optimal action selection; GLUCOSE DYNAMICS; BLOOD-GLUCOSE; INSULIN; SIMULATION; MODEL;
D O I
10.1016/j.eswa.2014.04.031
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The increasing trend towards delegating tasks to autonomous artificial agents in safety-critical socio-technical systems makes monitoring an action selection policy of paramount importance. Agent behavior monitoring may profit from a stochastic specification of an optimal policy under uncertainty. A probabilistic monitoring approach is proposed to assess if an agent behavior (or policy) respects its specification. The desired policy is modeled by a prior distribution for state transitions in an optimally-controlled stochastic process. Bayesian surprise is defined as the Kullback-Leibler divergence between the state transition distribution for the observed behavior and the distribution for optimal action selection. To provide a sensitive on-line estimation of Bayesian surprise with small samples twin Gaussian processes are used. Timely detection of a deviant behavior or anomaly in an artificial pancreas highlights the sensitivity of Bayesian surprise to a meaningful discrepancy regarding the stochastic optimal policy when there exist excessive glycemic variability, sensor errors, controller ill-tuning and infusion pump malfunctioning. To reject outliers and leave out redundant information, on-line sparsification of data streams is proposed. (C) 2014 Elsevier Ltd. All rights reserved.
引用
收藏
页码:6327 / 6345
页数:19
相关论文
共 54 条
[1]   Blood glucose regulation with stochastic optimal control for insulin-dependent diabetic patients [J].
Acikgoz, Saadet Ulas ;
Diwekar, Urmila M. .
CHEMICAL ENGINEERING SCIENCE, 2010, 65 (03) :1227-1236
[2]  
[Anonymous], 2009, Vision Res., DOI [DOI 10.1016/J.VISRES.2008.09.007, 10.1016/j.visres.2008.09.007]
[3]  
[Anonymous], 2002, Methods Appl. Anal, DOI DOI 10.4310/MAA.2002.V9.N3.A8
[4]  
Baldi P, 2002, KLUW COMMUN, V687, P1
[5]   Of bits and wows: A Bayesian theory of surprise with applications to attention [J].
Baldi, Pierre ;
Itti, Laurent .
NEURAL NETWORKS, 2010, 23 (05) :649-666
[6]  
Bayarri M., 1997, DUKE U I STAT DECISI, P97
[7]   COMPUTER-SIMULATION OF PLASMA-INSULIN AND GLUCOSE DYNAMICS AFTER SUBCUTANEOUS INSULIN INJECTION [J].
BERGER, M ;
RODBARD, D .
DIABETES CARE, 1989, 12 (10) :725-736
[8]   Twin Gaussian Processes for Structured Prediction [J].
Bo, Liefeng ;
Sminchisescu, Cristian .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2010, 87 (1-2) :28-52
[9]   FACTORS INVOLVED IN CATHETER OBSTRUCTION DURING LONG-TERM PERITONEAL INSULIN INFUSION [J].
BOUSQUETROUAUD, R ;
CASTEX, F ;
COSTALAT, G ;
BASTIDE, M ;
HEDON, B ;
BOUANANI, M ;
JOUVERT, S ;
MIROUZE, J .
DIABETES CARE, 1993, 16 (05) :801-805
[10]   Is blood glucose predictable from previous values? A solicitation for data [J].
Bremer, T ;
Gough, DA .
DIABETES, 1999, 48 (03) :445-451