Internet Traffic Classification by Aggregating Correlated Naive Bayes Predictions

被引:133
作者
Zhang, Jun [1 ]
Chen, Chao [1 ]
Xiang, Yang [1 ]
Zhou, Wanlei [1 ]
Xiang, Yong [1 ]
机构
[1] Deakin Univ, Sch Informat Technol, Melbourne, Vic 3125, Australia
基金
澳大利亚研究理事会;
关键词
Traffic classification; network security; naive Bayes; SUPPORT VECTOR MACHINES;
D O I
10.1109/TIFS.2012.2223675
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This paper presents a novel traffic classification scheme to improve classification performance when few training data are available. In the proposed scheme, traffic flows are described using the discretized statistical features and flow correlation information is modeled by bag-of-flow (BoF). We solve the BoF-based traffic classification in a classifier combination framework and theoretically analyze the performance benefit. Furthermore, a new BoF-based traffic classification method is proposed to aggregate the naive Bayes (NB) predictions of the correlated flows. We also present an analysis on prediction error sensitivity of the aggregation strategies. Finally, a large number of experiments are carried out on two large-scale real-world traffic datasets to evaluate the proposed scheme. The experimental results show that the proposed scheme can achieve much better classification performance than existing state-of-the-art traffic classification methods.
引用
收藏
页码:5 / 15
页数:11
相关论文
共 34 条
  • [1] [Anonymous], 2004, P 4 ACM SIGCOMM C IN, DOI DOI 10.1145/1028788.1028805
  • [2] [Anonymous], 2003, Statistical pattern recognition
  • [3] [Anonymous], 2011, WEKA 3 DATA MINING S
  • [4] [Anonymous], P 13 INT JOINT C ART
  • [5] [Anonymous], 2000, Pattern Classification
  • [6] [Anonymous], 2008, PROC CONEXT, DOI DOI 10.1145/1544012.1544023
  • [7] [Anonymous], 2007, P 16 INT C WORLD WID
  • [8] Bayesian neural networks for Internet traffic classification
    Auld, Tom
    Moore, Andrew W.
    Gull, Stephen F.
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 2007, 18 (01): : 223 - 239
  • [9] Traffic classification on the fly
    Bernaille, Laurent
    Teixeira, Renata
    Akodkenou, Ismael
    Soule, Augustin
    Salamatian, Kave
    [J]. ACM SIGCOMM COMPUTER COMMUNICATION REVIEW, 2006, 36 (02) : 23 - 26
  • [10] Bernaille L, 2007, LECT NOTES COMPUT SC, V4427, P165