A review on machine learning–based approaches for Internet traffic classification

被引:7
作者
Ola Salman
Imad H. Elhajj
Ayman Kayssi
Ali Chehab
机构
[1] American University of Beirut,Department of Electrical and Computer Engineering
来源
Annals of Telecommunications | 2020年 / 75卷
关键词
Machine learning; Internet traffic; Classification; Obfuscation; Survey; Data representation;
D O I
暂无
中图分类号
学科分类号
摘要
Traffic classification acquired the interest of the Internet community early on. Different approaches have been proposed to classify Internet traffic to manage both security and Quality of Service (QoS). However, traditional classification approaches consisting of modifying the Transmission Control Protocol/Internet Protocol (TCP/IP) scheme have not been adopted due to their complex management. In addition, port-based methods and deep packet inspection have limitations in dealing with new traffic characteristics (e.g., dynamic port allocation, tunneling, encryption). Conversely, machine learning (ML) solutions effectively classify traffic down to the device type and specific user action. Another research direction aims to anonymize Internet traffic and thwart classification to maintain user privacy. Existing traffic surveys focus on classification and do not consider anonymization. Here, we review the Internet traffic classification and obfuscation techniques, largely considering the ML-based solutions. In addition, this paper presents a comprehensive review of various data representation methods, and the different objectives of Internet traffic classification. Finally, we present the key findings, limitations, and recommendations for future research.
引用
收藏
页码:673 / 710
页数:37
相关论文
共 393 条
[1]  
Leiner BM(2009)A brief history of the internet ACM SIGCOMM Comput Commun Rev 39 22-31
[2]  
Cerf VG(2018)Iot survey: an sdn and fog computing perspective Comput Netw 143 221-246
[3]  
Clark DD(2013)Fine-grained traffic classification based on functional separation Int J Netw Manag 23 350-381
[4]  
Kahn RE(2015)Recent advancement in machine learning based internet traffic classification Procedia Computer Science 60 784-791
[5]  
Kleinrock L(2015)How robust can a machine learning approach be for classifying encrypted voip? J Netw Syst Manag 23 830-869
[6]  
Lynch DC(2011)Realtime encrypted traffic identification using machine learning JSW 6 1009-1016
[7]  
Postel J(2018)Mobile app identification for encrypted network flows by traffic correlation Int J Distrib Sen Netw 14 1550147718817292-43
[8]  
Roberts LG(2018)Encrypted traffic classification using statistical features. ISeCure 10 29-81
[9]  
Wolff S(2019)Deep learning for encrypted traffic classification: an overview IEEE communications magazine 57 76-78
[10]  
Salman Ola(2017)Robust smartphone app identification via encrypted network traffic analysis IEEE Transactions on Information Forensics and Security 13 63-374