Anomaly detection in streaming environmental sensor data: A data-driven modeling approach

被引:182
作者
Hill, David J. [1 ]
Minsker, Barbara S. [2 ]
机构
[1] Rutgers State Univ, Dept Civil & Environm Engn, Piscataway, NJ 08854 USA
[2] Univ Illinois, Dept Civil & Environm Engn, Urbana, IL 61801 USA
关键词
Coastal environment; Data-driven modeling; Anomaly detection; Machine learning; Real-time data; Sensor networks; Data quality control; Artificial intelligence; TIME; VALIDATION;
D O I
10.1016/j.envsoft.2009.08.010
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The deployment of environmental sensors has generated an interest in real-time applications of the data they collect. This research develops a real-time anomaly detection method for environmental data streams that can be used to identify data that deviate from historical patterns. The method is based on an autoregressive data-driven model of the data stream and its corresponding prediction interval. It performs fast, incremental evaluation of data as it becomes available, scales to large quantities of data, and requires no pre-classification of anomalies. Furthermore, this method can be easily deployed on a large heterogeneous sensor network. Sixteen instantiations of this method are compared based on their ability to identify measurement errors in a windspeed data stream from Corpus Christi, Texas. The results indicate that a multilayer perceptron model of the data stream, coupled with replacement of anomalous data points, performs well at identifying erroneous data in this data stream. (C) 2009 Published by Elsevier Ltd.
引用
收藏
页码:1014 / 1022
页数:9
相关论文
共 40 条
[1]   Hydrologic applications of MRAN algorithm [J].
Amenu, Geremew G. ;
Markus, Momcilo ;
Kumar, Praveen ;
Demissie, Misganaw .
JOURNAL OF HYDROLOGIC ENGINEERING, 2007, 12 (01) :124-129
[2]  
[Anonymous], 2001, Credit scoring and credit control IN
[3]  
[Anonymous], INTRO TIME SERIES FO
[4]  
[Anonymous], 1986, PARALLEL DISTRIBUTED
[5]  
[Anonymous], 2011, Pei. data mining concepts and techniques
[6]  
[Anonymous], 2000, Pattern Classification
[7]  
[Anonymous], P IEEE SENS
[8]   Weather analogue: A tool for real-time prediction of daily weather data realizations based on a modified k-nearest neighbor approach [J].
Bannayan, Mohammad ;
Hoogenboom, Gerrit .
ENVIRONMENTAL MODELLING & SOFTWARE, 2008, 23 (06) :703-713
[9]  
Belle R., 1983, Nuclear Technology, V64, P70
[10]  
Bifet A, 2006, LECT NOTES ARTIF INT, V4265, P29