Machine Learning for Failure Management in Microwave Networks: A Data-Centric Approach

被引:0
|
作者
Di Cicco, Nicola [1 ]
Ibrahimi, Memedhe [1 ]
Musumeci, Francesco [1 ]
Bruschetta, Federica [2 ]
Milano, Michele [2 ]
Passera, Claudio [2 ]
Tornatore, Massimo [1 ]
机构
[1] Politecn Milan, Dept Elect Informat & Bioengn, I-20133 Milan, Italy
[2] SIAE Microelettron, I-20093 Cologno Monzese, Italy
来源
IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT | 2024年 / 21卷 / 05期
关键词
Microwave networks; machine learning; failure management; DATA AUGMENTATION; PERFORMANCE; SMOTE;
D O I
10.1109/TNSM.2024.3406934
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We consider the problem of classifying hardware failures in microwave networks given a collection of alarms using Machine Learning (ML). While ML models have been shown to work extremely well on similar tasks, an ML model is, at most, as good as its training data. In microwave networks, building a good-quality dataset is significantly harder than training a good classifier: annotating data is a costly and time-consuming procedure. We, therefore, shift the perspective from a Model-Centric approach, i.e., how to train the best ML model from a given dataset, to a Data-Centric approach, i.e., how to make the best use of the data at our disposal. To this end, we explore two orthogonal Data-Centric approaches for hardware failure identification in microwave networks. At training time, we leverage synthetic data generation with Conditional Variational Autoencoders to cope with extreme data imbalance and ensure fair performance in all failure classes. At inference time, we leverage Batch Uncertainty-based Active Learning to guide the data annotation procedure of multiple concurrent domain-expert labelers and achieve the best possible classification performance with the smallest possible training dataset. Illustrative experimental results on a real-world dataset show that our Data-Centric approaches allow for training top-performing models with similar to 4.5x less annotated data, while improving the classifier's F1-Score by similar to 2.5% in a condition of extreme data scarcity. Finally, for the first time to the best of our knowledge, we make our dataset (curated by microwave industry experts) publicly available, aiming to foster research in data-driven failure management.
引用
收藏
页码:5420 / 5431
页数:12
相关论文
共 50 条
  • [1] Data-centric approach to improve machine learning models for inorganic materials
    Bartel, Christopher J.
    PATTERNS, 2021, 2 (11):
  • [2] A Data-Centric Optimization Framework for Machine Learning
    Rausch, Oliver
    Ben-Nun, Tal
    Dryden, Nikoli
    Ivanov, Andrei
    Li, Shigang
    Hoefler, Torsten
    PROCEEDINGS OF THE 36TH ACM INTERNATIONAL CONFERENCE ON SUPERCOMPUTING, ICS 2022, 2022,
  • [3] A Data-Centric Reinforcement Learning Approach for Self-Updating Machine Learning Models
    Sack, Mandy
    OPEN ARCHITECTURE/OPEN BUSINESS MODEL NET-CENTRIC SYSTEMS AND DEFENSE TRANSFORMATION 2022, 2022, 12119
  • [4] A Data-Centric Machine Learning Approach for Controlling Exploration in Estimation of Distribution Algorithms
    Bolufe-Rohler, Antonio
    Luke, Jordan
    2022 IEEE INTERNATIONAL IOT, ELECTRONICS AND MECHATRONICS CONFERENCE (IEMTRONICS), 2022, : 72 - 80
  • [5] A Data-Centric Approach to Generate Invariants for a Smart Grid Using Machine Learning
    Hudani, Danish
    Haseeb, Muhammad
    Taufiq, Muhammad
    Umer, Muhammad Azmi
    Kandasamy, Nandha Kumar
    SAT-CPS'22: PROCEEDINGS OF THE 2022 ACM WORKSHOP ON SECURE AND TRUSTWORTHY CYBER-PHYSICAL SYSTEMS, 2022, : 31 - 36
  • [6] Data-Centric Machine Learning in Nursing: A Concept Clarification
    Ball Dunlap, Patricia A.
    Nahm, Eun-Shim
    Umberfield, Elizabeth E.
    CIN-COMPUTERS INFORMATICS NURSING, 2024, 42 (05) : 325 - 333
  • [7] Data-Centric Approaches to Radio Frequency Machine Learning
    Kuzdeba, Scott
    Robinson, Josh
    2022 IEEE MILITARY COMMUNICATIONS CONFERENCE (MILCOM), 2022,
  • [8] Data-centric machine learning in quantum information science
    Lohani, Sanjaya
    Lukens, Joseph M.
    Glasser, Ryan T.
    Searles, Thomas A.
    Kirby, Brian T.
    MACHINE LEARNING-SCIENCE AND TECHNOLOGY, 2022, 3 (04):
  • [9] Data-Centric Machine Learning Pipeline for Hardware Verification
    Shin, Hongsup
    2022 IEEE 35TH INTERNATIONAL SYSTEM-ON-CHIP CONFERENCE (IEEE SOCC 2022), 2022, : 11 - 12
  • [10] Taxonomy of machine learning paradigms: A data-centric perspective
    Emmert-Streib, Frank
    Dehmer, Matthias
    WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2022, 12 (05)