Anomaly Detection and Classification in Cellular Networks Using Automatic Labeling Technique for Applying Supervised Learning

被引:17
作者
Al Mamun, S. M. Abdullah [1 ]
Valimaki, Juha [1 ]
机构
[1] TTG Int Ltd, TR-34799 Istanbul, Turkey
来源
CYBER PHYSICAL SYSTEMS AND DEEP LEARNING | 2018年 / 140卷
关键词
Anomaly Detection; AD; Telecommunications; Machine learning; ML; Automation; Quality Assurance; QA; Key Performace Indicator; KPI; Big Data; Analytics; Diagnostics; Self-Diagnostics; LTE; LTE-A; CDMA; WCDMA; UMTS; GSM; 4G; 3G; 2G; IP; Packet Data; Cellular; Wireless; Networks; Hiding HW-Fault; Hiding SW-Bug;
D O I
10.1016/j.procs.2018.10.328
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Anomaly Detection (AD) is a promising new approach for quality control in e.g. operational telecommunications and data networks. In this paper we have applied Supervised Machine Learning (SML) to a set of long term observation time series from a Cellular/Wireless network. We have shown that periodically collected Key Performance Indicators (KPIs) can be analyzed by supervised ML. Generally, the network creates a new big data periodically when different KPIs from e.g. all the cells (sectors of each 2G/3G/4G/5G base station) are output to a remote Database. We have applied a single class support vector machine in the first phase to find out outliers in range based KPI values. Then LSTM RNN (Recurrent Neural Network) is used for deeper understanding of their behavior over time. Both profile based KPIs and range based KPIs are used to filter out the FP (False Positive) or FN (False Negative) anomaly candidates. In this study, we have applied a novel approach to automatically label the huge data into a supervised training set. This is possible when the meaning of major KPIs is well understood. Both a time series profile based prediction and a logical combination of acceptable value ranges (Min/Max) are used for Anomaly Filtering (AF). A Min or a Max condition is omitted in a single threshold case. AF is used both for AD and for automatic labelling of the training set for ML. Automated labelling with AF performed well also for any large dataset The pure time series graph profile based KPIs without applicable limits were not used for labelling nor for AF. This technique gave us better results than unsupervised learning based AD. Our enhanced supervised AD decreased the number of FP anomalies from 33 to 0, while the total anomalies decreased from 35 uncertain cases to 2 TP (True Positive), 0 FN. Finally, KNN algorithm is used to classify test data sets. Our proposed method seems to solve several major problems in the field of Cellular/ Wireless, Fixed, [Packet (e.g. IP)] Data Networks as well as within related network side and user equipment. Automation in general, including medical/ any critical systems and equipment is another possible application domain Automated labelling with AF performed well also for any large dataset. (C) 2018 The Authors. Published by Elsevier B.V.
引用
收藏
页码:186 / 195
页数:10
相关论文
共 18 条
[1]  
[Anonymous], 2014, P IEEE NETW OP MAN S
[2]  
Ben Slimen Y., 2017, P 2017 IEEE GLOBAL C, P1
[3]  
Bouillard A., 2012, 2012 8th International Conference on Network and Service Management (CNSM 2012), P82
[4]  
Brutlag JD, 2000, USENIX ASSOCIATION PROCEEDINGS OF THE FOURTEENTH SYSTEMS ADMINISTRATION CONFERENCE (LISA XIV), P139
[5]  
Cherla S., 2015, 2015 INT JOINT C NEU, P1
[6]  
Ciocarlie GF, 2013, INT CONF NETW SER, P171, DOI 10.1109/CNSM.2013.6727831
[7]  
Ciocarlie GF, 2014, 2014 11TH INTERNATIONAL SYMPOSIUM ON WIRELESS COMMUNICATIONS SYSTEMS (ISWCS), P611, DOI 10.1109/ISWCS.2014.6933426
[8]   A Self-Adaptive Deep Learning-Based System for Anomaly Detection in 5G Networks [J].
Fernandez Maimo, Lorenzo ;
Perales Gomez, Angel Luis ;
Garcia Clemente, Felix J. ;
Gil Perez, Manuel ;
Martinez Perez, Gregorio .
IEEE ACCESS, 2018, 6 :7700-7712
[9]  
Himura Y, 2009, IEEE ICC, P1003
[10]  
Karatepe I.A., 2014, European Wireless 2014