An Accurate and Extensible Machine Learning Classifier for Flow-Level Traffic Classification

被引:3
作者
Gang Lu [1 ]
Ronghua Guo [1 ]
Ying Zhou [1 ]
Jing Du [1 ]
机构
[1] Chinese Luoyang electronic equipment center
基金
中国国家自然科学基金;
关键词
traffic classification; class imbalance; dircriminator bias; encrypted traffic; machine learning;
D O I
暂无
中图分类号
TP181 [自动推理、机器学习];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Machine Learning(ML) techniques have been widely applied in recent traffic classification.However, the problems of both discriminator bias and class imbalance decrease the accuracies of ML based traffic classifier. In this paper, we propose an accurate and extensible traffic classifier. Specifically, to address the discriminator bias issue, our classifier is built by making an optimal cascade of binary sub-classifiers, where each binary sub-classifier is trained independently with the discriminators used for identifying application specific traffic. Moreover, to balance a training dataset,we apply SMOTE algorithm in generating artificial training samples for minority classes.We evaluate our classifier on two datasets collected from different network border routers.Compared with the previous multi-class traffic classifiers built in one-time training process,our classifier achieves much higher F-Measure and AUC for each application.
引用
收藏
页码:125 / 138
页数:14
相关论文
共 13 条
[1]   分类不平衡协议流的机器学习算法评估与比较 [J].
张宏莉 ;
鲁刚 .
软件学报, 2012, 23 (06) :1500-1516
[2]   Identification of VoIP encrypted traffic using a machine learning approach [J].
Alshammari, Riyad ;
Zincir-Heywood, A. Nur .
JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2015, 27 (01) :77-92
[3]   On the detection of card-sharing traffic through wavelet analysis and Support Vector Machines [J].
Palmieri, Francesco ;
Fiore, Ugo ;
Castiglione, Aniello ;
De Santis, Alfredo .
APPLIED SOFT COMPUTING, 2013, 13 (01) :615-627
[4]   Feature selection for optimizing traffic classification [J].
Zhang, Hongli ;
Lu, Gang ;
Qassrawi, Mahmoud T. ;
Zhang, Yu ;
Yu, Xiangzhan .
COMPUTER COMMUNICATIONS, 2012, 35 (12) :1457-1471
[5]   A Modular Machine Learning System for Flow-Level Traffic Classification in Large Networks [J].
Jin, Yu ;
Duffield, Nick ;
Erman, Jeffrey ;
Haffner, Patrick ;
Sen, Subhabrata ;
Zhang, Zhi-Li .
ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2012, 6 (01)
[6]  
GT[J] . F. Gringoli,Luca Salgarelli,M. Dusi,N. Cascarano,F. Risso,k. c. claffy. ACM SIGCOMM Computer Communication Review . 2009 (5)
[7]   On the Stability of the Information Carried by Traffic Flow Features at the Packet Level [J].
Este, Alice ;
Gringoli, Francesco ;
Salgarelli, Luca .
ACM SIGCOMM COMPUTER COMMUNICATION REVIEW, 2009, 39 (03) :13-18
[8]  
Efficient application identification and the temporal and spatial stability of classification schema[J] . Wei Li,Marco Canini,Andrew W. Moore,Raffaele Bolla. Computer Networks . 2008 (6)
[9]  
A nonlinear, recurrence-based approach to traffic classification[J] . Francesco Palmieri,Ugo Fiore. Computer Networks . 2008 (6)
[10]   Traffic classification on the fly [J].
Bernaille, Laurent ;
Teixeira, Renata ;
Akodkenou, Ismael ;
Soule, Augustin ;
Salamatian, Kave .
ACM SIGCOMM COMPUTER COMMUNICATION REVIEW, 2006, 36 (02) :23-26