Contextual Anomaly Detection in Big Sensor Data

被引:48
作者
Hayes, Michael A. [1 ]
Capretz, Miriam A. M. [1 ]
机构
[1] Univ Western Ontario, Dept Elect & Comp Engn, London, ON N6A 5B9, Canada
来源
2014 IEEE INTERNATIONAL CONGRESS ON BIG DATA (BIGDATA CONGRESS) | 2014年
关键词
Big Data Analytics; Contextual Anomaly Detection; Predictive Modelling; Multivariate Clustering;
D O I
10.1109/BigData.Congress.2014.19
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Performing predictive modelling, such as anomaly detection, in Big Data is a difficult task. This problem is compounded as more and more sources of Big Data are generated from environmental sensors, logging applications, and the Internet of Things. Further, most current techniques for anomaly detection only consider the content of the data source, i.e. the data itself, without concern for the context of the data. As data becomes more complex it is increasingly important to bias anomaly detection techniques for the context, whether it is spatial, temporal, or semantic. The work proposed in this paper outlines a contextual anomaly detection technique for use in streaming sensor networks. The technique uses a well-defined content anomaly detection algorithm for real-time point anomaly detection. Additionally, we present a post-processing context-aware anomaly detection algorithm based on sensor profiles, which are groups of contextually similar sensors generated by a multivariate clustering algorithm. Our proposed research has been implemented and evaluated with real-world data provided by Powersmiths, located in Brampton, Ontario, Canada.
引用
收藏
页码:64 / 71
页数:8
相关论文
共 17 条
[1]  
[Anonymous], POW POW FUT
[2]  
[Anonymous], P SIAM C DAT MIN
[3]  
Arthur D., 2007, P 18 ANN ACM SIAM S, DOI DOI 10.1145/1283383.1283494
[4]  
Dean J, 2004, USENIX ASSOCIATION PROCEEDINGS OF THE SIXTH SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION (OSDE '04), P137
[5]  
Hartigan J. A., 1979, Applied Statistics, V28, P100, DOI 10.2307/2346830
[6]  
He ZY, 2004, LECT NOTES COMPUT SC, V3129, P589
[7]   Anomaly detection in streaming environmental sensor data: A data-driven modeling approach [J].
Hill, David J. ;
Minsker, Barbara S. .
ENVIRONMENTAL MODELLING & SOFTWARE, 2010, 25 (09) :1014-1022
[8]   Real-time Bayesian anomaly detection in streaming environmental data [J].
Hill, David J. ;
Minsker, Barbara S. ;
Amir, Eyal .
WATER RESOURCES RESEARCH, 2009, 45
[9]   Detecting Anomaly Teletraffic Using Stochastic Self-similarity Based on Hadoop [J].
Lee, JongSuk R. ;
Ye, Sang-Kug ;
Jeong, Hae-Duck J. .
2013 16TH INTERNATIONAL CONFERENCE ON NETWORK-BASED INFORMATION SYSTEMS (NBIS 2013), 2013, :282-287
[10]  
Machine Learning Group, 2012, WEK 3 DAT MIN OP SOU