Anomaly detection based on a dynamic Markov model

被引:56
作者
Ren, Huorong [1 ,2 ]
Ye, Zhixing [1 ,2 ]
Li, Zhiwu [1 ,3 ]
机构
[1] Xidian Univ, Sch Electromech Engn, Xian 710071, Peoples R China
[2] Minist Educ, Key Lab Elect Equipment Struct Design, Xian 710071, Peoples R China
[3] Macau Univ Sci & Technol, Inst Syst Engn, Taipa 999078, Macau, Peoples R China
关键词
Sequence data; Anomaly detection; Markov model; Higher order Markov model; TIME-SERIES; STATISTICS; ALGORITHMS; SYSTEMS;
D O I
10.1016/j.ins.2017.05.021
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Anomaly detection in sequence data is becoming more and more important in a wide variety of application domains such as credit card fraud detection, health care in medical field, and intrusion detection in cyber security. In the existing anomaly detection approaches, Markov chain techniques are widely accepted for their simple realization and few parameters. However, the short memory property of a classical Markov model ignores the interaction among data, and the long memory property of a higher order Markov model clouds the relationship between the previous data and current test data, and reduces the reliability of the model. Besides, both of these models cannot successfully describe the sequences changing with a tendency. In this paper, we propose an anomaly detection approach based on a dynamic Markov model. This approach segments sequence data by a sliding window. In the sliding window, we define the states of data according to the value of the data and establish a higher order Markov model with a proper order consequently, to balance the length of the memory property and keep up with the trend of sequences. In addition, an anomaly substitution strategy is proposed to prevent the detected anomalies from impacting the building of the models and keep anomaly detection continuously. The experimental results using simulated datasets and real-world datasets have demonstrated that the proposed approach improves the adaptability and stability of anomaly detection in sequence data. (C) 2017 Elsevier Inc. All rights reserved.
引用
收藏
页码:52 / 65
页数:14
相关论文
共 43 条
[1]  
Azha I.I.N., 2016, P IEEE INT C CONTR S, P24, DOI [10.1109/ICSGRC.2016.7813299, DOI 10.1109/ICSGRC.2016.7813299]
[2]   A multi-step outlier-based anomaly detection approach to network-wide traffic [J].
Bhuyan, Monowar H. ;
Bhattacharyya, D. K. ;
Kalita, J. K. .
INFORMATION SCIENCES, 2016, 348 :243-271
[3]   LOF: Identifying density-based local outliers [J].
Breunig, MM ;
Kriegel, HP ;
Ng, RT ;
Sander, J .
SIGMOD RECORD, 2000, 29 (02) :93-104
[4]   Adaptive Hidden Markov Model With Anomaly States for Price Manipulation Detection [J].
Cao, Yi ;
Li, Yuhua ;
Coleman, Sonya ;
Belatreche, Ammar ;
McGinnity, Thomas Martin .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 26 (02) :318-330
[5]   Anomaly Detection: A Survey [J].
Chandola, Varun ;
Banerjee, Arindam ;
Kumar, Vipin .
ACM COMPUTING SURVEYS, 2009, 41 (03)
[6]   Anomaly Detection for Discrete Sequences: A Survey [J].
Chandola, Varun ;
Banerjee, Arindam ;
Kumar, Vipin .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2012, 24 (05) :823-839
[7]  
Chen DQ, 2015, 2015 12th International Conference on Fuzzy Systems and Knowledge Discovery (FSKD), P1532, DOI 10.1109/FSKD.2015.7382172
[8]   Outlier Detection with the Kernelized Spatial Depth Function [J].
Chen, Yixin ;
Dang, Xin ;
Peng, Hanxiang ;
Bart, Henry L., Jr. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2009, 31 (02) :288-305
[9]   Deadlock recovery for flexible manufacturing systems modeled with Petri nets [J].
Chen, YuFeng ;
Li, ZhiWu ;
Al-Ahmari, Abdulrahman ;
Wu, Naiqi ;
Qu, Ting .
INFORMATION SCIENCES, 2017, 381 :290-303
[10]   A formal framework for positive and negative detection schemes [J].
Esponda, F ;
Forrest, S ;
Helman, P .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2004, 34 (01) :357-373