Comparison of Machine Learning Techniques Applied to Traffic Prediction of Real Wireless Network

被引:23
作者
Alekseeva, Daria [1 ]
Stepanov, Nikolai [2 ]
Veprev, Albert [2 ]
Sharapova, Alexandra [2 ]
Lohan, Elena Simona [1 ]
Ometov, Aleksandr [1 ]
机构
[1] Tampere Univ, Fac Informat Technol & Commun Sci, Tampere 33720, Finland
[2] Natl Res Univ Higher Sch Econ, HSE Tikhonov Moscow Inst Elect & Math, Moscow 101000, Russia
基金
芬兰科学院;
关键词
Measurement; Support vector machines; Prediction algorithms; Predictive models; Boosting; Computational modeling; Bayes methods; Machine learning; communication system traffic; prediction algorithms; optimization; next generation networking; ARTIFICIAL-INTELLIGENCE; ANOMALY DETECTION; 5G MOBILE; SCENARIOS;
D O I
10.1109/ACCESS.2021.3129850
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Today, the traffic amount is growing inexorably due to the increase in the number of devices on the network. Researchers analyze traffic by identifying sophisticated dependencies, anomalies, and novel traffic patterns to improve the performance of systems as a whole. One of the fast development niches in this domain is related to Classic and Deep Machine Learning techniques that are supposed to improve the network operation in the most complex heterogeneous environment. In this work, we first outline existing applications of Machine Learning in the communications domain and further list the most significant challenges and potential solutions while implementing those. Finally, we compare different classical methods predicting the traffic on the LTE network Edge by utilizing such techniques as Linear Regression, Gradient Boosting, Random Forest, Bootstrap Aggregation (Bagging), Huber Regression, Bayesian Regression, and Support Vector Machines (SVM). We develop the corresponding Machine Learning environment based on a public cellular traffic dataset and present a comparison table of the quality metrics and execution time for each model. After the analysis, the SVM method proved to allow for a much faster training compared to other algorithms. Gradient Boosting showed the best quality of predictions as it has the most efficient data determination. Random forest shows the worst result since it depends on the number of features that may be limited. The probabilistic approach-based Bayesian regression method showed slightly worse results than Gradient Boosting, but its training time was shorter. The performance evaluation demonstrated good results for linear models with the Huber loss function, which optimizes the model parameters better. As a standalone contribution, we offer the source code of the analyzed algorithms in Open Access.
引用
收藏
页码:159495 / 159514
页数:20
相关论文
共 71 条
[1]   Evaluation of Machine Learning Techniques for Security in SDN [J].
Ahmad, Ahnaf ;
Harjula, Erkki ;
Ylianttila, Mika ;
Ahmad, Ijaz .
2020 IEEE GLOBECOM WORKSHOPS (GC WKSHPS), 2020,
[2]   Machine Learning for Wireless Communication Channel Modeling: An Overview [J].
Aldossari, Saud Mobark ;
Chen, Kwang-Cheng .
WIRELESS PERSONAL COMMUNICATIONS, 2019, 106 (01) :41-70
[3]  
Andreoletti D, 2019, IEEE CONF COMPUT, P246, DOI [10.1109/INFCOMW.2019.8845132, 10.1109/infcomw.2019.8845132]
[4]   Detection of Memory Leaks in C/C plus plus Code via Machine Learning [J].
Andrzejak, Artur ;
Eichler, Felix ;
Ghanavati, Mohammadreza .
2017 IEEE 28TH INTERNATIONAL SYMPOSIUM ON SOFTWARE RELIABILITY ENGINEERING WORKSHOPS (ISSREW 2017), 2017, :252-258
[5]  
[Anonymous], 2018, P 2018 IEEE GLOB COM
[6]  
[Anonymous], 2019, 23725 3GPP TR
[7]   Data Science and Artificial Intelligence for Communications [J].
Atov, Irena ;
Chen, Kwang-Cheng ;
Kamal, Ahmed ;
Louta, Malamati .
IEEE COMMUNICATIONS MAGAZINE, 2020, 58 (10) :56-57
[8]  
Balevi E, 2017, IEEE IPCCC
[9]   Big Data and Machine Learning in Health Care [J].
Beam, Andrew L. ;
Kohane, Isaac S. .
JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2018, 319 (13) :1317-1318
[10]   Quantum machine learning [J].
Biamonte, Jacob ;
Wittek, Peter ;
Pancotti, Nicola ;
Rebentrost, Patrick ;
Wiebe, Nathan ;
Lloyd, Seth .
NATURE, 2017, 549 (7671) :195-202