A review on machine learning–based approaches for Internet traffic classification

被引:7
作者
Ola Salman
Imad H. Elhajj
Ayman Kayssi
Ali Chehab
机构
[1] American University of Beirut,Department of Electrical and Computer Engineering
来源
Annals of Telecommunications | 2020年 / 75卷
关键词
Machine learning; Internet traffic; Classification; Obfuscation; Survey; Data representation;
D O I
暂无
中图分类号
学科分类号
摘要
Traffic classification acquired the interest of the Internet community early on. Different approaches have been proposed to classify Internet traffic to manage both security and Quality of Service (QoS). However, traditional classification approaches consisting of modifying the Transmission Control Protocol/Internet Protocol (TCP/IP) scheme have not been adopted due to their complex management. In addition, port-based methods and deep packet inspection have limitations in dealing with new traffic characteristics (e.g., dynamic port allocation, tunneling, encryption). Conversely, machine learning (ML) solutions effectively classify traffic down to the device type and specific user action. Another research direction aims to anonymize Internet traffic and thwart classification to maintain user privacy. Existing traffic surveys focus on classification and do not consider anonymization. Here, we review the Internet traffic classification and obfuscation techniques, largely considering the ML-based solutions. In addition, this paper presents a comprehensive review of various data representation methods, and the different objectives of Internet traffic classification. Finally, we present the key findings, limitations, and recommendations for future research.
引用
收藏
页码:673 / 710
页数:37
相关论文
共 393 条
[51]  
Deng H(2018)A machine learning approach for feature selection traffic classification using security analysis J Supercomput 74 4867-2943
[52]  
Chen J(2019)Unsupervised machine learning for networking: Techniques, applications and research challenges IEEE Access 7 65579-1270
[53]  
Han L(2019)Multivariate network traffic analysis using clustered patterns Computing 101 339-147
[54]  
Wang W(2010)Unsupervised host behavior classification from connection patterns Int J Netw Manag 20 317-585
[55]  
Ertam F(2006)Traffic classification on the fly ACM SIGCOMM Comput Commun Rev 36 23-12
[56]  
Avci E(2013)Internet traffic classification using constrained clustering IEEE Trans Parall Distr Syst 25 2932-1213
[57]  
Cao J(2015)Robust network traffic classification IEEE/ACM Trans Netw (TON) 23 1257-28
[58]  
Fang Z(2013)An effective network traffic classification method with unknown flow detection IEEE Trans Netw Serv Manag 10 133-81
[59]  
Qu G(2013)Unsupervised traffic classification using flow statistical properties and ip packet payload J Comput Syst Sci 79 573-564
[60]  
Sun H(2019)Semtra: a semi-supervised approach to traffic flow labeling with minimal human effort Pattern Recogn 91 1-16