Performance evaluation of secured network traffic classification using a machine learning approach

被引:44
作者
Afuwape, Afeez Ajani [1 ]
Xu, Ying [1 ]
Anajemba, Joseph Henry [2 ]
Srivastava, Gautam [3 ,4 ]
机构
[1] Hunan Univ, Coll Comp Sci & Elect Engn, Changsha 410082, Hunan, Peoples R China
[2] Hohai Univ, Coll Internet Things, Changzhou Campus, Changzhou, Jiangsu, Peoples R China
[3] Brandon Univ, Dept Math & Comp Sci, 270 18th St, Brandon, MB R7A 6A9, Canada
[4] China Med Univ, Res Ctr Interneural Comp, Taichung 40402, Taiwan
关键词
Intrusion detection system; VPN Traffic; 5G; Machine learning; Multilayer perceptron; Random forest; Gradient boosting; ALGORITHM;
D O I
10.1016/j.csi.2021.103545
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Network traffic classification is a significant and problematic aspect of network resource management arising from an investigation of network developments, planning, and design for 5G and beyond. Recently, traffic investigation systems for network monitoring and user access restrictions to Virtual Private Networks (VPN) and non-Virtual Private Networks (non-VPN) have gained widespread attention. In this paper, different algorithms for classifying and detecting VPN traffic are considered. A few existing machine learning procedures were tested concerning their performance in network traffic classification and security. The purpose is to improve Precision, Recall, and F1-score in VPN Network Traffic using Ensemble Classifiers. Therefore, the parameters of the ensemble classifier were changed to obtain high Precision, Recall, and F1-score. Bagging Decision Tree and Gradient Boosting algorithms were used for classification which produced promising results when compared to single classifiers like k-Nearest Neighbors (kNN), Multilayer Perceptron (MLP), and Decision Tree. The proposed classifier demonstrates recognition accuracy on a test sample of up to 93.80% which outperforms all other single algorithms used in previous work. The MLP, Random Forest (RF), and Gradient Boosting (GB) algorithms had almost identical performance in all experiments. Furthermore, the proposed classifiers are found to perform better when the network traffic flows are generated using different values of time parameters (timeout). Our results show that the ensemble algorithms (Random Forest and the Gradient Boosting) outperform the single machine learning classifier previously used by other researchers, and we achieved the highest accuracy with the random forest classifier with better results while using non-VPN traffic and VPN traffic. The novelty lies in the application of an ensemble algorithm to secure a network traffic classification performed in comparison with single classifiers to determine Accuracy, Precision, and F1-score of a given dataset, contrary to the known process of selection of features and generation.
引用
收藏
页数:16
相关论文
共 47 条
[1]  
Abu Taher K, 2019, 2019 1ST INTERNATIONAL CONFERENCE ON ROBOTICS, ELECTRICAL AND SIGNAL PROCESSING TECHNIQUES (ICREST), P643, DOI [10.1109/ICREST.2019.8644161, 10.1109/icrest.2019.8644161]
[2]   Threat of Adversarial Attacks on Deep Learning in Computer Vision: A Survey [J].
Akhtar, Naveed ;
Mian, Ajmal .
IEEE ACCESS, 2018, 6 :14410-14430
[3]   Realizing Efficient Security and Privacy in IoT Networks [J].
Anajemba, Joseph Henry ;
Tang, Yue ;
Iwendi, Celestine ;
Ohwoekevwo, Akpesiri ;
Srivastava, Gautam ;
Jo, Ohyun .
SENSORS, 2020, 20 (09)
[4]  
[Anonymous], 2009, J COMPUTER SCI SYSTE
[5]  
[Anonymous], 2020, VPN DATASET
[6]  
[Anonymous], 2010, Int. J. Comput. Appl., DOI DOI 10.5120/758-993
[7]  
[Anonymous], 2017, 2017 14 IEEE IND COU
[8]   On Internet Traffic Classification: A Two-Phased Machine Learning Approach [J].
Bakhshi, Taimur ;
Ghita, Bogdan .
JOURNAL OF COMPUTER NETWORKS AND COMMUNICATIONS, 2016, 2016
[9]  
Battalov R.I., 2019, INT C INF TECHN NAN, P445
[10]   Issues and Future Directions in Traffic Classification [J].
Dainotti, Alberto ;
Pescape, Antonio ;
Claffy, Kimberly C. .
IEEE NETWORK, 2012, 26 (01) :35-40