Long-Term Pipeline Failure Prediction Using Nonparametric Survival Analysis

被引:6
作者
Weeraddana, Dilusha [1 ]
MallawaArachchi, Sudaraka [2 ]
Warnakula, Tharindu [2 ]
Li, Zhidong [3 ]
Wang, Yang [3 ]
机构
[1] Data61 Commonwealth Sci & Ind Res Org CSIRO, Eveleigh, Australia
[2] Monash Univ, Melbourne, Vic, Australia
[3] Univ Technol Sydney, Sydney, NSW, Australia
来源
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: APPLIED DATA SCIENCE TRACK, ECML PKDD 2020, PT IV | 2021年 / 12460卷
关键词
Advanced assets management; Machine learning; Data mining; Nonparametric; Survival analysis; Random survival forest; MODELS;
D O I
10.1007/978-3-030-67667-4_9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Australian water infrastructure is more than a hundred years old, thus has begun to show its age through water main failures. Our work concerns approximately half a million pipelines across major Australian cities that deliver water to houses and businesses, serving over five million customers. Failures on these buried assets cause damage to properties and water supply disruptions. We applied Machine Learning techniques to find a cost-effective solution to the pipe failure problem in these Australian cities, where on average 1500 of water main failures occur each year. To achieve this objective, we construct a detailed picture and understanding of the behaviour of the water pipe network by developing a Machine Learning model to assess and predict the failure likelihood of water main breaking using historical failure records, descriptors of pipes and other environmental factors. Our results indicate that our system incorporating a nonparametric survival analysis technique called 'Random Survival Forest' outperforms several popular algorithms and expert heuristics in long-term prediction. In addition, we construct a statistical inference technique to quantify the uncertainty associated with the long-term predictions.
引用
收藏
页码:139 / 156
页数:18
相关论文
共 26 条
  • [1] Asnaashari A., 2009, Water Science and Technology: Water Supply, V9, P9, DOI 10.2166/ws.2009.020
  • [2] Random forests
    Breiman, L
    [J]. MACHINE LEARNING, 2001, 45 (01) : 5 - 32
  • [3] Cox David Roxbee, 2018, ANAL SURVIVAL DATA, DOI DOI 10.1201/9781315137438
  • [4] Prediction of the failure pressure for complex corrosion defects
    Cronin, DS
    Pick, RJ
    [J]. INTERNATIONAL JOURNAL OF PRESSURE VESSELS AND PIPING, 2002, 79 (04) : 279 - 287
  • [5] Random Survival Forest in practice: a method for modelling complex metabolomics data in time to event analysis
    Dietrich, Stefan
    Floegel, Anna
    Troll, Martina
    Kuehn, Tilman
    Rathmann, Wolfgang
    Peters, Anette
    Sookthai, Disorn
    von Bergen, Martin
    Kaaks, Rudolf
    Adamski, Jerzy
    Prehn, Cornelia
    Boeing, Heiner
    Schulze, Matthias B.
    Illig, Thomas
    Pischon, Tobias
    Knueppel, Sven
    Wang-Sattler, Rui
    Drogan, Dagmar
    [J]. INTERNATIONAL JOURNAL OF EPIDEMIOLOGY, 2016, 45 (05) : 1406 - 1420
  • [6] Stochastic gradient boosting
    Friedman, JH
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2002, 38 (04) : 367 - 378
  • [7] Seasonal factors influencing the failure of buried water reticulation pipes
    Gould, S. J. F.
    Boulaire, F. A.
    Burn, S.
    Zhao, X. L.
    Kodikara, J. K.
    [J]. WATER SCIENCE AND TECHNOLOGY, 2011, 63 (11) : 2692 - 2699
  • [8] Ishwaran H., 2007, Random Survival Forests for R
  • [9] NONPARAMETRIC-ESTIMATION FROM INCOMPLETE OBSERVATIONS
    KAPLAN, EL
    MEIER, P
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1958, 53 (282) : 457 - 481
  • [10] Klein J.P., 2003, SURVIVAL ANAL TECHNI, P91, DOI [DOI 10.1007/B97377, 10.1007/b97377]