Non-Gaussian and long memory statistical characterizations for Internet traffic with anomalies

被引:82
作者
Scherrer, Antoine
Larrieu, Nicolas
Owezarski, Philippe
Borgnat, Pierre
Abry, Patrice
机构
[1] Ecole Normale Super Lyon, Phys Lab, F-69364 Lyon 07, France
[2] Ecole Normale Super Lyon, Lab Informat Parallelisme, F-69364 Lyon 07, France
[3] CNRS, LAAS, F-31077 Toulouse 4, France
[4] Ecole Normale Super Lyon, Phys Lab, F-69364 Lyon 07, France
关键词
traffic statistical modeling; DoS attack; flash crowd; non-Gaussian long-range dependent process;
D O I
10.1109/TDSC.2007.12
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The goals of the present contribution are twofold. First, we propose the use of a non-Gaussian long-range dependent process to model Internet traffic aggregated time series. We give the definitions and intuition behind the use of this model. We detail numerical procedures that can be used to synthesize artificial traffic exactly following the model prescription. We also propose original and practically effective procedures to estimate the corresponding parameters from empirical data. We show that this empirical model relevantly describes a large variety of Internet traffic, including both regular traffic obtained from public reference repositories and traffic containing legitimate ( flash crowd) or illegitimate (DDoS attack) anomalies. We observe that the proposed model accurately fits the data for a wide range of aggregation levels. The model provides us with a meaningful multiresolution (i.e., aggregation level dependent) statistics to characterize the traffic: the evolution of the estimated parameters with respect to the aggregation level. It opens the track to the second goal of the paper: anomaly detection. We propose the use of a quadratic distance computed on these statistics to detect the occurrences of DDoS attack and study the statistical performance of these detection procedures. Traffic with anomalies was produced and collected by us so as to create a controlled and reproducible database, allowing for a relevant assessment of the statistical performance of the proposed (modeling and detection) procedures.
引用
收藏
页码:56 / 70
页数:15
相关论文
共 60 条
[1]   Multiscale nature of network traffic [J].
Abry, P ;
Baraniuk, R ;
Flandrin, P ;
Riedi, R ;
Veitch, D .
IEEE SIGNAL PROCESSING MAGAZINE, 2002, 19 (03) :28-46
[2]   Wavelet analysis of long-range-dependent traffic [J].
Abry, P ;
Veitch, D .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1998, 44 (01) :2-15
[3]  
Abry P., 2000, SELF SIMILAR NETWORK, P39, DOI [10.1002/047120644X.ch2, DOI 10.1002/047120644X.CH2]
[4]   A Markovian approach for modeling packet traffic with long-range dependence [J].
Andersen, AT ;
Nielsen, BF .
IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 1998, 16 (05) :719-732
[5]  
[Anonymous], P IEEE INFOCOM
[6]  
[Anonymous], P INT C COMP COMM NE
[7]  
[Anonymous], P USENIX SYST ADM C
[8]  
BARFORD P, 2002, P ACM SIGCOMM INT ME
[9]   DISTANCE MEASURES FOR SIGNAL-PROCESSING AND PATTERN-RECOGNITION [J].
BASSEVILLE, M .
SIGNAL PROCESSING, 1989, 18 (04) :349-369
[10]  
Beran J., 1994, Statistics for long-memory processes