Network traffic analysis using machine learning: an unsupervised approach to understand and slice your network

被引:10
作者
Aouedi, Ons [1 ]
Piamrat, Kandaraj [1 ]
Hamma, Salima [1 ]
Perera, J. K. Menuka [1 ]
机构
[1] Lab Sci Numer Nantes, Nantes, France
关键词
Machine learning; Feature selection; Clustering; Unsupervised learning; Network traffic; Traffic analysis; Network slicing; 5G; SELECTION;
D O I
10.1007/s12243-021-00889-1
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
Recent development in smart devices has lead us to an explosion in data generation and heterogeneity, which requires new network solutions for better analyzing and understanding traffic. These solutions should be intelligent and scalable in order to handle the huge amount of data automatically. With the progress of high-performance computing (HPC), it becomes feasible easily to deploy machine learning (ML) to solve complex problems and its efficiency has been validated in several domains (e.g., healthcare or computer vision). At the same time, network slicing (NS) has drawn significant attention from both industry and academia as it is essential to address the diversity of service requirements. Therefore, the adoption of ML within NS management is an interesting issue. In this paper, we have focused on analyzing network data with the objective of defining network slices according to traffic flow behaviors. For dimensionality reduction, the feature selection has been applied to select the most relevant features (15 out of 87 features) from a real dataset of more than 3 million instances. Then, a K-means clustering is applied to better understand and distinguish behaviors of traffic. The results demonstrated a good correlation among instances in the same cluster generated by the unsupervised learning. This solution can be further integrated in a real environment using network function virtualization.
引用
收藏
页码:297 / 309
页数:13
相关论文
共 32 条
[1]   Optimal 5G network slicing using machine learning and deep learning concepts [J].
Abidi, Mustufa Haider ;
Alkhalefah, Hisham ;
Moiduddin, Khaja ;
Alazab, Mamoun ;
Mohammed, Muneer Khan ;
Ameen, Wadea ;
Gadekallu, Thippa Reddy .
COMPUTER STANDARDS & INTERFACES, 2021, 76
[2]   Network Slicing and Softwarization: A Survey on Principles, Enabling Technologies, and Solutions [J].
Afolabi, Ibrahim ;
Taleb, Tarik ;
Samdanis, Konstantinos ;
Ksentini, Adlen ;
Flinck, Hannu .
IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2018, 20 (03) :2429-2453
[3]   Survey of State-of-the-Art Mixed Data Clustering Algorithms [J].
Ahmad, Amir ;
Khan, Shehroz S. .
IEEE ACCESS, 2019, 7 :31883-31902
[4]   Performance evaluation of feature selection and tree-based algorithms for traffic classification [J].
Aouedi, Ons ;
Piamrat, Kandaraj ;
Parrein, Benoit .
2021 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS WORKSHOPS (ICC WORKSHOPS), 2021,
[5]   Selection of relevant features and examples in machine learning [J].
Blum, AL ;
Langley, P .
ARTIFICIAL INTELLIGENCE, 1997, 97 (1-2) :245-271
[6]   A comprehensive survey on machine learning for networking: evolution, applications and research opportunities [J].
Boutaba, Raouf ;
Salahuddin, Mohammad A. ;
Limam, Noura ;
Ayoubi, Sara ;
Shahriar, Nashid ;
Estrada-Solano, Felipe ;
Caicedo, Oscar M. .
JOURNAL OF INTERNET SERVICES AND APPLICATIONS, 2018, 9 (09)
[7]   CLUSTER SEPARATION MEASURE [J].
DAVIES, DL ;
BOULDIN, DW .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1979, 1 (02) :224-227
[8]   A Few Useful Things to Know About Machine Learning [J].
Domingos, Pedro .
COMMUNICATIONS OF THE ACM, 2012, 55 (10) :78-87
[9]  
Ester M., 1996, P 2 INT C KNOWL DISC, DOI DOI 10.5555/3001460.3001507
[10]   When Network Slicing Meets Prospect Theory: A Service Provider Revenue Maximization Framework [J].
Fantacci, Romano ;
Picano, Benedetta .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2020, 69 (03) :3179-3189