Improved Network Traffic Classification Using Ensemble Learning

被引:16
作者
Possebon, Isadora P. [1 ]
Silva, Anderson S. [1 ]
Granville, Lisandro Z. [1 ]
Schaeffer-Filho, Alberto [1 ]
Marnerides, Angelos [2 ]
机构
[1] Univ Fed Rio Grande do Sul, Inst Informat, Porto Alegre, RS, Brazil
[2] Univ Lancaster, Sch Comp & Commun, Lancaster, England
来源
2019 IEEE SYMPOSIUM ON COMPUTERS AND COMMUNICATIONS (ISCC) | 2019年
关键词
D O I
10.1109/iscc47284.2019.8969637
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Despite the large number of research efforts that applied specific machine learning algorithms for network traffic classification, recent work has highlighted limitations and particularities of individual algorithms that make them more suitable to specific types of traffic and scenarios. As such, an important topic in this area is how to combine individual algorithms using meta-learning techniques in order to obtain more robust traffic classification metrics. This paper presents a comparative analysis among meta-learning approaches and individual classifiers to classify network traffic. We investigate and evaluate a range of meta-learning techniques, including Voting, Stacking, Bagging and Boosting. We then propose a new experimental analysis of different meta-learning techniques - also known as ensemble learners - and compare them with their own base classifiers when used individually. Finally, considering the emerging popularity of Neural Networks, we analyze this scenario using the Multi-layer Perceptron classifier. The experiments were performed with data provided by the UCI Machine Learning Repository. The best performance was obtained by an ensemble technique (Bagging), which obtained accuracy of 99.972% and false positive rate of 0.00018%.
引用
收藏
页码:431 / 436
页数:6
相关论文
共 19 条
[1]   A survey of cross-validation procedures for model selection [J].
Arlot, Sylvain ;
Celisse, Alain .
STATISTICS SURVEYS, 2010, 4 :40-79
[2]   Machine Learning for Cognitive Network Management [J].
Ayoubi, Sara ;
Limam, Noura ;
Salahuddin, Mohammad A. ;
Shahriar, Nashid ;
Boutaba, Raouf ;
Estrada-Solano, Felipe ;
Caicedo, Oscar M. .
IEEE COMMUNICATIONS MAGAZINE, 2018, 56 (01) :158-165
[3]   A comprehensive survey on machine learning for networking: evolution, applications and research opportunities [J].
Boutaba, Raouf ;
Salahuddin, Mohammad A. ;
Limam, Noura ;
Ayoubi, Sara ;
Shahriar, Nashid ;
Estrada-Solano, Felipe ;
Caicedo, Oscar M. .
JOURNAL OF INTERNET SERVICES AND APPLICATIONS, 2018, 9 (01)
[4]   A Survey of Data Mining and Machine Learning Methods for Cyber Security Intrusion Detection [J].
Buczak, Anna L. ;
Guven, Erhan .
IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2016, 18 (02) :1153-1176
[5]  
Chan P. K., 1993, CIKM 93. Proceedings of the Second International Conference on Information and Knowledge Management, P314, DOI 10.1145/170088.170160
[6]  
Changyu Wang, 2017, 2017 IFIP/IEEE Symposium on Integrated Network and Service Management (IM), P588, DOI 10.23919/INM.2017.7987336
[7]  
da Silva AS, 2016, IEEE IFIP NETW OPER, P27, DOI 10.1109/NOMS.2016.7502793
[8]  
Didaci L., 2002, AI IA WORKSH APPR AU
[9]   An experimental comparison of three methods for constructing ensembles of decision trees: Bagging, boosting, and randomization [J].
Dietterich, TG .
MACHINE LEARNING, 2000, 40 (02) :139-157
[10]  
Dzeroski S, 2004, MACH LEARN, V54, P255, DOI 10.1023/B.MAC.0000015881.36452.6e