Applying PCA for Traffic Anomaly Detection: Problems and Solutions

被引:74
作者
Brauckhoff, Daniela [1 ]
Salamatian, Kave [2 ]
May, Martin [1 ]
机构
[1] ETH, Zurich, Switzerland
[2] Univ Lancaster, Lancaster LA1 4YW, England
来源
IEEE INFOCOM 2009 - IEEE CONFERENCE ON COMPUTER COMMUNICATIONS, VOLS 1-5 | 2009年
关键词
D O I
10.1109/INFCOM.2009.5062248
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Spatial Principal Component Analysis (PCA) has been proposed for network-wide anomaly detection. A recent work has shown that PCA is very sensitive to calibration settings. Unfortunately, the authors did not provide further explanations for this observation. In this paper, we fill this gap and provide the reasoning behind the found discrepancies. We revisit PCA for anomaly detection and evaluate its performance on our data. We develop a slightly modified version of PCA that uses only data from a single router. Instead of correlating data across different spatial measurement points, we correlate the data across different metrics. With the help of the analyzed data, we explain the pitfalls of PCA and underline our argumentation with measurement results. We show that the main problem is that PCA fails to capture temporal correlation. We propose a solution to deal with this problem by replacing PCA with the Karhunen-Loeve Transform. We find that when we consider temporal correlation, anomaly detection results are significantly improved.
引用
收藏
页码:2866 / +
页数:2
相关论文
共 10 条
[1]  
[Anonymous], 2001, INTRO MATH STAT ITS
[2]  
Gray R.M., 2005, INTRO STAT SIGNAL PR
[3]   CONTROL PROCEDURES FOR RESIDUALS ASSOCIATED WITH PRINCIPAL COMPONENT ANALYSIS [J].
JACKSON, JE ;
MUDHOLKAR, GS .
TECHNOMETRICS, 1979, 21 (03) :341-349
[4]   GAUSSIAN APPROXIMATION TO DISTRIBUTION OF A DEFINITE QUADRATIC FORM [J].
JENSEN, DR ;
SOLOMON, H .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1972, 67 (340) :898-902
[5]   Galerkin proper orthogonal decomposition methods for a general equation in fluid dynamics [J].
Kunisch, K ;
Volkwein, S .
SIAM JOURNAL ON NUMERICAL ANALYSIS, 2002, 40 (02) :492-515
[6]  
Lakhina A., 2005, ACM SIGCOMM 2005
[7]  
LAKHINA A, 2004, SIGCOMM 20004
[8]  
RINGBERG H, 2007, SIGMETRICS 2007
[9]  
Ruppert D., 2004, J AM STAT ASSOC, V99, P567, DOI [10.1198/jasa.2004.s339, DOI 10.1198/JASA.2004.S339]
[10]  
SOULE A, 2005, IMC 05