An Accurate and Extensible Machine Learning Classifier for Flow-Level Traffic Classification

被引:4
作者
Lu, Gang [1 ]
Guo, Ronghua [1 ]
Zhou, Ying [1 ]
Du, Jing [1 ]
机构
[1] Chinese Luoyang Elect Equipment Ctr, Luoyang 471003, Peoples R China
基金
中国国家自然科学基金;
关键词
traffic classification; class imbalance; dircriminator bias; encrypted traffic; machine learning; IDENTIFICATION;
D O I
10.1109/CC.2018.8398510
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
Machine Learning (ML) techniques have been widely applied in recent traffic classification. However, the problems of both discriminator bias and class imbalance decrease the accuracies of ML based traffic classifier. In this paper, we propose an accurate and extensible traffic classifier. Specifically, to address the discriminator bias issue, our classifier is built by making an optimal cascade of binary sub-classifiers, where each binary sub-classifier is trained independently with the discriminators used for identifying application specific traffic. Moreover, to balance a training dataset, we apply SMOTE algorithm in generating artificial training samples for minority classes. We evaluate our classifier on two datasets collected from different network border routers. Compared with the previous multi-class traffic classifiers built in one-time training process, our classifier achieves much higher F-Measure and AUC for each application.
引用
收藏
页码:125 / 138
页数:14
相关论文
共 31 条
  • [1] Identification of VoIP encrypted traffic using a machine learning approach
    Alshammari, Riyad
    Zincir-Heywood, A. Nur
    [J]. JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2015, 27 (01) : 77 - 92
  • [2] [Anonymous], 2005, TECHNICAL REPORT
  • [3] [Anonymous], IEEE T PARALLEL DIST
  • [4] [Anonymous], 2010, P ACM C EM NETW EXP
  • [5] BERNAILLE L., 2006, CONEXT 06
  • [6] Traffic classification on the fly
    Bernaille, Laurent
    Teixeira, Renata
    Akodkenou, Ismael
    Soule, Augustin
    Salamatian, Kave
    [J]. ACM SIGCOMM COMPUTER COMMUNICATION REVIEW, 2006, 36 (02) : 23 - 26
  • [7] SMOTE: Synthetic minority over-sampling technique
    Chawla, Nitesh V.
    Bowyer, Kevin W.
    Hall, Lawrence O.
    Kegelmeyer, W. Philip
    [J]. 2002, American Association for Artificial Intelligence (16)
  • [8] Erman J., 2007, P 3 ANN ACM WORKSHOP, P35
  • [9] On the Stability of the Information Carried by Traffic Flow Features at the Packet Level
    Este, Alice
    Gringoli, Francesco
    Salgarelli, Luca
    [J]. ACM SIGCOMM COMPUTER COMMUNICATION REVIEW, 2009, 39 (03) : 13 - 18
  • [10] Gil G. D., 2016, INT C INF SYST SEC P