Adaptive Anomaly Detection with Kernel Eigenspace Splitting and Merging

被引:15
作者
O'Reilly, Colin [1 ]
Gluhak, Alexander [2 ]
Imran, Muhammad Ali [1 ]
机构
[1] Univ Surrey, Ctr Commun Syst Res, Guildford GU2 7XH, Surrey, England
[2] Intel Labs Europe, London, England
基金
英国工程与自然科学研究理事会;
关键词
Adaptive; non-stationary; anomaly detection; outlier detection; kernel principal component analysis; kernel methods; COMPONENT ANALYSIS; SUPPORT; PCA;
D O I
10.1109/TKDE.2014.2324594
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Kernel principal component analysis and the reconstruction error is an effective anomaly detection technique for non-linear data sets. In an environment where a phenomenon is generating data that is non-stationary, anomaly detection requires a recomputation of the kernel eigenspace in order to represent the current data distribution. Recomputation is a computationally complex operation and reducing computational complexity is therefore a key challenge. In this paper, we propose an algorithm that is able to accurately remove data from a kernel eigenspace without performing a batch recomputation. Coupled with a kernel eigenspace update, we demonstrate that our technique is able to remove and add data to a kernel eigenspace more accurately than existing techniques. An adaptive version determines an appropriately sized sliding window of data and when a model update is necessary. Experimental evaluations on both synthetic and real-world data sets demonstrate the superior performance of the proposed approach in comparison to alternative incremental KPCA approaches and alternative anomaly detection techniques.
引用
收藏
页码:3 / 16
页数:14
相关论文
共 40 条
[1]  
Aizerman M. A., 1964, AUTOMAT REM CONTR, V25, P821, DOI DOI 10.1234/12345678
[2]  
Bache K, 2013, UCI machine learning repository
[3]  
Barnett V., 1994, Outliers in statistical data, V3
[4]   Variable window adaptive Kernel Principal Component Analysis for nonlinear nonstationary process monitoring [J].
Ben Khediri, Issam ;
Limam, Mohamed ;
Weihs, Claus .
COMPUTERS & INDUSTRIAL ENGINEERING, 2011, 61 (03) :437-446
[5]   The use of the area under the roc curve in the evaluation of machine learning algorithms [J].
Bradley, AP .
PATTERN RECOGNITION, 1997, 30 (07) :1145-1159
[6]   LOF: Identifying density-based local outliers [J].
Breunig, MM ;
Kriegel, HP ;
Ng, RT ;
Sander, J .
SIGMOD RECORD, 2000, 29 (02) :93-104
[7]   LIBSVM: A Library for Support Vector Machines [J].
Chang, Chih-Chung ;
Lin, Chih-Jen .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
[8]   Diagnosing anomalies and identifying faulty nodes in sensor networks [J].
Chatzigiannakis, Vassilis ;
Papavassiliou, Symeon .
IEEE SENSORS JOURNAL, 2007, 7 (5-6) :637-645
[9]   Incremental kernel principal component analysis [J].
Chin, Tat-Jun ;
Suter, David .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2007, 16 (06) :1662-1674
[10]   A Compressed PCA Subspace Method for Anomaly Detection in High-Dimensional Data [J].
Ding, Qi ;
Kolaczyk, Eric D. .
IEEE TRANSACTIONS ON INFORMATION THEORY, 2013, 59 (11) :7419-7433