A multivariate Bayesian scan statistic for early event detection and characterization

被引:46
作者
Neill, Daniel B. [1 ]
Cooper, Gregory F. [2 ]
机构
[1] Carnegie Mellon Univ, Sch Publ Policy & Management, HJ Heinz III Coll, Pittsburgh, PA 15213 USA
[2] Univ Pittsburgh, Dept Biomed Informat, Pittsburgh, PA 15260 USA
基金
美国国家科学基金会;
关键词
Event detection; Event characterization; Biosurveillance; Scan statistics;
D O I
10.1007/s10994-009-5144-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present the multivariate Bayesian scan statistic (MBSS), a general framework for event detection and characterization in multivariate spatial time series data. MBSS integrates prior information and observations from multiple data streams in a principled Bayesian framework, computing the posterior probability of each type of event in each space-time region. MBSS learns a multivariate Gamma-Poisson model from historical data, and models the effects of each event type on each stream using expert knowledge or labeled training examples. We evaluate MBSS on various disease surveillance tasks, detecting and characterizing outbreaks injected into three streams of Pennsylvania medication sales data. We demonstrate that MBSS can be used both as a "general" event detector, with high detection power across a variety of event types, and a "specific" detector that incorporates prior knowledge of an event's effects to achieve much higher detection power. MBSS has many other advantages over previous event detection approaches, including faster computation and easy interpretation and visualization of results, and allows faster and more accurate event detection by integrating information from the multiple streams. Most importantly, MBSS can model and differentiate between multiple event types, thus distinguishing between events requiring urgent responses and other, less relevant patterns in the data.
引用
收藏
页码:261 / 282
页数:22
相关论文
共 33 条
[1]  
Buckeridge David L, 2004, MMWR Suppl, V53, P137
[2]  
Burkom Howard S, 2005, MMWR Suppl, V54, P55
[3]  
Burkom HS, 2003, J URBAN HEALTH, V80, pI57
[4]   EMPIRICAL BAYES ESTIMATES OF AGE-STANDARDIZED RELATIVE RISKS FOR USE IN DISEASE MAPPING [J].
CLAYTON, D ;
KALDOR, J .
BIOMETRICS, 1987, 43 (03) :671-681
[5]  
COOPER GF, 2007, ADV DIS SURVEILLANCE, V2, P45
[6]  
COOPER GF, 2004, P C UNC ART INT
[7]   A simulated annealing strategy for the detection of arbitrarily shaped spatial clusters [J].
Duczmal, L ;
Assunçao, R .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2004, 45 (02) :269-286
[8]  
JIANG X, 2008, BAYESIAN NETWORK MOD
[9]   A model-adjusted space-time scan statistic with an application to syndromic surveillance [J].
Kleinman, KP ;
Abrams, AM ;
Kulldorff, M ;
Platt, R .
EPIDEMIOLOGY AND INFECTION, 2005, 133 (03) :409-419
[10]   A space-time permutation scan statistic for disease outbreak detection [J].
Kulldorff, M ;
Heffernan, R ;
Hartman, J ;
Assunçao, R ;
Mostashari, F .
PLOS MEDICINE, 2005, 2 (03) :216-224