A Real-Time Network Traffic Classifier for Online Applications Using Machine Learning

被引:14
作者
Ahmed, Ahmed Abdelmoamen [1 ]
Agunsoye, Gbenga [1 ]
机构
[1] Prairie View A&M Univ, Dept Comp Sci, Prairie View, TX 77446 USA
基金
美国国家科学基金会;
关键词
real-time; traffic classifier; network flow; machine learning; KNN; RF; ANN;
D O I
10.3390/a14080250
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The increasing ubiquity of network traffic and the new online applications' deployment has increased traffic analysis complexity. Traditionally, network administrators rely on recognizing well-known static ports for classifying the traffic flowing their networks. However, modern network traffic uses dynamic ports and is transported over secure application-layer protocols (e.g., HTTPS, SSL, and SSH). This makes it a challenging task for network administrators to identify online applications using traditional port-based approaches. One way for classifying the modern network traffic is to use machine learning (ML) to distinguish between the different traffic attributes such as packet count and size, packet inter-arrival time, packet send-receive ratio, etc. This paper presents the design and implementation of NetScrapper, a flow-based network traffic classifier for online applications. NetScrapper uses three ML models, namely K-Nearest Neighbors (KNN), Random Forest (RF), and Artificial Neural Network (ANN), for classifying the most popular 53 online applications, including Amazon, Youtube, Google, Twitter, and many others. We collected a network traffic dataset containing 3,577,296 packet flows with different 87 features for training, validating, and testing the ML models. A web-based user-friendly interface is developed to enable users to either upload a snapshot of their network traffic to NetScrapper or sniff the network traffic directly from the network interface card in real time. Additionally, we created a middleware pipeline for interfacing the three models with the Flask GUI. Finally, we evaluated NetScrapper using various performance metrics such as classification accuracy and prediction time. Most notably, we found that our ANN model achieves an overall classification accuracy of 99.86% in recognizing the online applications in our dataset.
引用
收藏
页数:20
相关论文
共 32 条
[11]   Constructing 3D Maps for Dynamic Environments using Autonomous UAVs [J].
Ahmed, Ahmed Abdelmoamen ;
Olumide, Abel ;
Akinwa, Adeoluwa ;
Chouikha, Mohamed .
PROCEEDINGS OF THE 16TH EAI INTERNATIONAL CONFERENCE ON MOBILE AND UBIQUITOUS SYSTEMS: COMPUTING, NETWORKING AND SERVICES (MOBIQUITOUS'19), 2019, :504-513
[12]  
[Anonymous], 2017, EAI ENDORSED T MOB C, DOI DOI 10.4108/EAI.13-9-2017.153070
[13]  
Chang L.-H., 2020, Adv. Technol. Innov, V5, P216, DOI DOI 10.46604/AITI.2020.4286
[14]   Predicting Network Flow Characteristics Using Deep Learning and Real-World Network Traffic [J].
Hardegen, Christoph ;
Pfuelb, Benedikt ;
Rieger, Sebastian ;
Gepperth, Alexander .
IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2020, 17 (04) :2662-2676
[15]   Online classification of user activities using machine learning on network traffic [J].
Labayen, Victor ;
Magana, Eduardo ;
Morato, Daniel ;
Izal, Mikel .
COMPUTER NETWORKS, 2020, 181
[16]   Adversarial Multi-task Learning for Text Classification [J].
Liu, Pengfei ;
Qiu, Xipeng ;
Huang, Xuanjing .
PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 1, 2017, :1-10
[17]   Network Traffic Classifier With Convolutional and Recurrent Neural Networks for Internet of Things [J].
Lopez-Martin, Manuel ;
Carro, Belen ;
Sanchez-Esguevillas, Antonio ;
Lloret, Jaime .
IEEE ACCESS, 2017, 5 :18042-18050
[18]   Deep packet: a novel approach for encrypted traffic classification using deep learning [J].
Lotfollahi, Mohammad ;
Siavoshani, Mahdi Jafari ;
Zade, Ramin Shirali Hossein ;
Saberian, Mohammdsadegh .
SOFT COMPUTING, 2020, 24 (03) :1999-2012
[19]  
Mo'men A. M. A., 2010, 7th International Symposium on High Capacity Optical Networks and Enabling Technologies (HONET 2010), P51, DOI 10.1109/HONET.2010.5715791
[20]  
Mo'men Ahmed M. Abdel, 2010, 7th International Symposium on High Capacity Optical Networks and Enabling Technologies (HONET 2010), P262, DOI 10.1109/HONET.2010.5715786