Semi-supervised anomaly detection methods for leakage identification in water distribution networks: A comparative study

被引:4
作者
Tornyeviadzi, Hoese Michel [1 ]
Mohammed, Hadi [1 ]
Seidu, Razak [1 ]
机构
[1] Norwegian Univ Sci & Technol, Dept Ocean Operat & Civil Engn, Smart Water Lab, Alesund, Norway
来源
MACHINE LEARNING WITH APPLICATIONS | 2023年 / 14卷
关键词
Anomaly detection; Leakage detection; Semi -supervised learning; Water distribution networks; ALGORITHMS;
D O I
10.1016/j.mlwa.2023.100501
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This study presents a comprehensive evaluation of 10 state of the art semi-supervised anomaly detection (AD) methods for leakage identification in water distribution networks (WDNs). The performances of the semisupervised AD methods is evaluated on LeakDB, a benchmark consisting of independent leakage scenarios that also account for the various sources of uncertainties arising in WDNs. Three performance metrics (F beta Measure, PR AUC Score, and Identification Lag Time) that collectively capture the different facets of leakage identification in WDNs is utilised to measure the efficacy of semi-supervised AD methods. Additionally, the TOPSIS MCDM tool supported with two weighting approaches is implemented to simultaneously consider all performance metrics in ranking the performance of semi-supervised AD methods. The results of this extensive comparative study shows that Local Outlier factor (LOF) is the overall best performing semi-supervised AD method on LeakDB. It is also evident that proximity based semi-supervised AD methods are superior to linear and probabilistic AD methods due to their ability to unearth leak events in the neighbourhood of normal operational data points. Finally, the impact of uncertainties on the performance of the semi-supervised AD models is discussed in addition to general recommendations on the usage of semi-supervised AD methods in leakage identification.
引用
收藏
页数:16
相关论文
共 56 条
  • [1] A Novel Outlier Detection Method for Multivariate Data
    Almardeny, Yahya
    Boujnah, Noureddine
    Cleary, Frances
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (09) : 4052 - 4062
  • [2] AWWA W. A., 2016, Loss control programs (M36): Awwa manual of practice
  • [3] Kernelized technique for outliers detection to monitoring water pipeline based on WSNs
    Ayadi, Aya
    Ghorbel, Oussama
    BenSalah, M. S.
    Abid, Mohamed
    [J]. COMPUTER NETWORKS, 2019, 150 : 179 - 189
  • [4] Bekkar M., 2013, Journal of Information Security and Applications, V3
  • [5] A dynamic multi-criteria decision-making model for the maintenance planning of reinforced concrete structures
    Benitez, Pablo
    Rocha, Eugenio
    Varum, Humberto
    Rodrigues, Fernanda
    [J]. JOURNAL OF BUILDING ENGINEERING, 2020, 27
  • [6] Bergmann B., 1988, Multiple Hypothesenprufung/Multiple Hypotheses Testing, P100, DOI [DOI 10.1007/978-3-642-52307-68, 10.1007/978-3-642-52307-68, 10.1007/978-3-642-52307-6_8, DOI 10.1007/978-3-642-52307-6_8]
  • [7] Assessing the public health risk of microbial intrusion events in distribution systems: Conceptual model, available data, and challenges
    Besner, Marie-Claude
    Prevost, Michele
    Regli, Stig
    [J]. WATER RESEARCH, 2011, 45 (03) : 961 - 979
  • [8] Modelling the performance potential of forced and natural-draft biomass cookstoves using a hybrid Entropy-TOPSIS approach
    Boafo-Mensah, Gloria
    Neba, Fabrice Abunde
    Tornyeviadzi, Hoese M.
    Seidu, Razak
    Darkwa, Kwadwo Mensah
    Kemausuor, Francis
    [J]. BIOMASS & BIOENERGY, 2021, 150 (150)
  • [9] LOF: Identifying density-based local outliers
    Breunig, MM
    Kriegel, HP
    Ng, RT
    Sander, J
    [J]. SIGMOD RECORD, 2000, 29 (02) : 93 - 104
  • [10] Calvo B, 2016, R J, V8, P248