DETECTION AND LOCALIZATION OF CHANGE-POINTS IN HIGH-DIMENSIONAL NETWORK TRAFFIC DATA

被引:59
作者
Levy-Leduc, Celine
Roueff, Francois
机构
[1] CNRS, LTCI, F-75700 Paris, France
[2] Telecom ParisTech, Paris, France
关键词
Network anomaly detection; change-point detection; rank tests; high-dimensional data;
D O I
10.1214/08-AOAS232
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We propose a novel and efficient method, that we shall call TopRank in the following paper, for detecting change-points in high-dimensional data. This issue is of growing concern to the network security community since network anomalies such as Denial of Service (DoS) attacks lead to changes in Internet traffic. Our method consists of a data reduction stage based on record filtering, followed by a nonparametric change-point detection test based on U-statistics. Using this approach, we can address massive data streams and perform anomaly detection and localization on the fly. We show how it applies to some real Internet traffic provided by France-Telecom (a French Internet service provider) in the framework of the ANR-RNRT OSCAR project. This approach is very attractive since it benefits from a low computational load and is able to detect and localize several types of network anomalies. We also assess the performance of the TopRank algorithm using synthetic data and compare it with alternative approaches based on random aggregation.
引用
收藏
页码:637 / 662
页数:26
相关论文
共 23 条
  • [1] ABRY P, 2007, P 2007 INT S APPL IN
  • [2] [Anonymous], 1997, LIMIT THEOREMS CHANG
  • [3] Balachander K., 2003, P 3 ACM SIGCOMM C IN, P234, DOI [DOI 10.1145/948205.948236, 10.1145/948205.948236]
  • [4] Basseville Michele, 1993, Detection of abrupt changes: theory and application, V104
  • [5] BICKEL P, 1976, MATH STAT
  • [6] Brodsky B., 1993, NONPARAMETRIC METHOD
  • [7] GEHAN EA, 1965, BIOMETRIKA, V52, P203, DOI 10.1093/biomet/52.1-2.203
  • [8] A nonparametric test for change in randomly censored data
    Gombay, E
    Liu, SQ
    [J]. CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2000, 28 (01): : 113 - 121
  • [9] Diagnosing network-wide traffic anomalies
    Lakhina, A
    Crovella, M
    Diot, C
    [J]. ACM SIGCOMM COMPUTER COMMUNICATION REVIEW, 2004, 34 (04) : 219 - 230
  • [10] LEVYLEDUC C, 2008, TOPRANK ALGORITHM RE