Automated identification of network anomalies and their causes with interpretable machine learning: The CIAN methodology and TTrees implementation

被引:2
作者
Moulay, Mohamed [1 ]
Leiva, Rafael Garcia [2 ]
Maroni, Pablo J. Rojo [3 ]
Diez, Fernando [4 ]
Mancuso, Vincenzo [5 ]
Anta, Antonio Fernandez [5 ]
机构
[1] Univ Carlos III Madrid, Madrid, Spain
[2] Vodafone, Madrid, Spain
[3] Nokia Cloud & Networks Serv, Madrid, Spain
[4] Univ Politecn Madrid, Madrid, Spain
[5] IMDEA Networks Inst, Madrid, Spain
关键词
Troubleshooting; Anomaly detection; Feature selection; Interpretable machine learning; INFORMATION; DIAGNOSIS; FRAMEWORK;
D O I
10.1016/j.comcom.2022.05.013
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Leveraging machine learning (ML) for the detection of network problems dates back to handling call-dropping issues in telephony. However, troubleshooting cellular networks is still a manual task, assigned to experts who monitor the network around the clock. To help in this task we present CIAN (from Causality Inference of Anomalies in Networks), a practical and interpretable ML methodology, which we implement in the form of a software tool named TTrees (from Troubleshooting Trees). We have designed CIAN to automate the identification of the causes of performance anomalies in cellular networks. Our methodology is unsupervised and combines multiple ML algorithms (e.g., decision trees and clustering) and Kolmogorov complexity-inspired data analysis tools that we have developed for this work. CIAN can be used with small volumes of data and is quick at training.Our experiments use diverse data sets obtained from measurements in operational commercial mobile networks. They show that the TTrees implementation of CIAN can automatically identify and accurately classify network anomalies - e.g., cases for which a network low performance is not apparently justified by operational conditions - training with just a few hundreds of data samples. The resulting information hence enables precise troubleshooting actions. In particular, we showcase how TTrees can be flexibly used to monitor the performance of TCP and QUIC protocols when they are adopted to serve mobile users.
引用
收藏
页码:327 / 348
页数:22
相关论文
共 50 条
  • [41] An Automated Strategy for Early Risk Identification of Sudden Cardiac Death by Using Machine Learning Approach on Measurable Arrhythmic Risk Markers
    Lai, Dakun
    Zhang, Yifei
    Zhang, Xinshu
    Su, Ye
    Bin Heyat, Md Belal
    IEEE ACCESS, 2019, 7 : 94701 - 94716
  • [42] Identification of suicidality in patients with major depressive disorder via dynamic functional network connectivity signatures and machine learning
    Xu, Manxi
    Zhang, Xiaojing
    Li, Yanqing
    Chen, Shengli
    Zhang, Yingli
    Zhou, Zhifeng
    Lin, Shiwei
    Dong, Tianfa
    Hou, Gangqiang
    Qiu, Yingwei
    TRANSLATIONAL PSYCHIATRY, 2022, 12 (01)
  • [43] Identification of factors influencing net primary productivity of terrestrial ecosystems based on interpretable machine learning --evidence from the county-level administrative districts in China
    Yi, Zhaoqiang
    Wu, Lihua
    JOURNAL OF ENVIRONMENTAL MANAGEMENT, 2023, 326
  • [44] Interpretable machine learning tools to analyze PM2.5 sensor network data so as to quantify local source impacts and long-range transport
    de Foy, Benjamin
    Edwards, Ross
    Joy, Khaled Shaifullah
    Zaman, Shahid Uz
    Salam, Abdus
    Schauer, James J.
    ATMOSPHERIC RESEARCH, 2024, 311
  • [45] Identification of multi-element geochemical anomalies using unsupervised machine learning algorithms: A case study from Ag-Pb-Zn deposits in north-western Zhejiang, China
    Wang, Jun
    Zhou, Yongzhang
    Xiao, Fan
    APPLIED GEOCHEMISTRY, 2020, 120
  • [46] Automated Identification of Sleep Disorder Types Using Triplet Half-Band Filter and Ensemble Machine Learning Techniques with EEG Signals
    Sharma, Manish
    Tiwari, Jainendra
    Patel, Virendra
    Acharya, U. Rajendra
    ELECTRONICS, 2021, 10 (13)
  • [47] Identification of Potential Genes and Critical Pathways in Postoperative Recurrence of Crohn's Disease by Machine Learning And WGCNA Network Analysis
    Rajalingam, Aruna
    Sekar, Kanagaraj
    Ganjiwale, Anjali
    CURRENT GENOMICS, 2023, 24 (02) : 84 - 99
  • [48] Development of machine learning and natural language processing algorithms for preoperative prediction and automated identification of intraoperative vascular injury in anterior lumbar spine surgery
    Karhade, Aditya V.
    Bongers, Michiel E. R.
    Groot, Olivier Q.
    Cha, Thomas D.
    Doorly, Terence P.
    Fogel, Harold A.
    Hershman, Stuart H.
    Tobert, Daniel G.
    Srivastava, Sunita D.
    Bono, Christopher M.
    Kang, James D.
    Harris, Mitchel B.
    Schwab, Joseph H.
    SPINE JOURNAL, 2021, 21 (10) : 1635 - 1642
  • [49] Construction of regulatory network for alopecia areata progression and identification of immune monitoring genes based on multiple machine-learning algorithms
    Xiong, Jiachao
    Chen, Guodong
    Liu, Zhixiao
    Wu, Xuemei
    Xu, Sha
    Xiong, Jun
    Ji, Shizhao
    Wu, Minjuan
    PRECISION CLINICAL MEDICINE, 2023, 6 (02)
  • [50] An Explainable Machine Learning Network for Classification of Autism Spectrum Disorder Using Optimal Frequency Band Identification From Brain EEG
    Saranya, S.
    Menaka, R.
    IEEE ACCESS, 2025, 13 : 32016 - 32030