Estimation of a logistic regression model by a genetic algorithm to predict pipe failures in sewer networks

被引:18
|
作者
Robles-Velasco, Alicia [1 ,2 ]
Cortes, Pablo [1 ,2 ]
Munuzuri, Jesus [1 ]
Onieva, Luis [1 ]
机构
[1] Univ Seville, ETSI, Dept Org Ind & Gest Empresas, C Camino Descubrimientos S-N, Seville 41092, Spain
[2] Univ Seville, EMASESA, Catedra Agua, Seville, Spain
关键词
Logistic regression; Binary classifier; Pipe failures; Genetic algorithm; Sewer networks;
D O I
10.1007/s00291-020-00614-9
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
Sewer networks are mainly composed of pipelines which are in charge of transporting sewage and rainwater to wastewater treatment plants. A failure in a sewer pipe has many negative consequences, such as accidents, flooding, pollution or extra costs. Machine learning arises as a very powerful tool to predict these incidents when the amount of available data is large enough. In this study, a real-coded genetic algorithm is implemented to estimate the optimal weights of a logistic regression model whose objective is to forecast pipe failures in wastewater networks. The goal is to create an autonomous and independent predictive system able to support the decisions about pipe replacement plans of companies. From the data processing to the validation of the model, all stages for the implementation of the machine-learning system are explored and carefully explained. Moreover, the methodology is applied to a real sewer network of a Spanish city to check its performance. Results demonstrate that by annually replacing 4% of pipe segments, those whose estimated failure probability is higher than 0.75, almost 30% of unexpected pipe failures are prevented. Furthermore, the analysis of the estimated weights of the logistic regression model reveals some weaknesses of the network as well as the influence of the features in the pipe failures. For instance, the predisposition of vitrified clay pipes to fail and of that pipes with smaller diameters.
引用
收藏
页码:759 / 776
页数:18
相关论文
共 50 条
  • [21] A Model to Predict Breast Cancer Survivability Using Logistic Regression
    Nourelahi, Mehdi
    Zamani, Ali
    Talei, Abdolrasoul
    Tahmasebi, Sedigheh
    MIDDLE EAST JOURNAL OF CANCER, 2019, 10 (02) : 132 - 138
  • [22] Comparison of logistic regression and neural networks to predict rehospitalization in patients with stroke
    Ottenbacher, KJ
    Smith, PM
    Illig, SB
    Linn, RT
    Fiedler, RC
    Granger, CV
    JOURNAL OF CLINICAL EPIDEMIOLOGY, 2001, 54 (11) : 1159 - 1165
  • [23] A weighted logistic regression model for estimation of recurrence of adenomas
    Hsu, Chiu-Hsieh
    Green, Sylvan B.
    He, Yulei
    STATISTICS IN MEDICINE, 2007, 26 (07) : 1567 - 1578
  • [24] Convolutional Neural Networks Optimized by Logistic Regression Model
    Yang, Bo
    Zhao, Zuopeng
    Xu, Xinzheng
    INTELLIGENT INFORMATION PROCESSING VIII, 2016, 486 : 91 - 96
  • [25] Bayesian networks with a logistic regression model for the conditional probabilities
    Rijmen, Frank
    INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2008, 48 (02) : 659 - 666
  • [26] Logistic model parameter genetic algorithm solution
    Ma You-ping
    Feng Zhong-ke
    ICMS2010: PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON MODELLING AND SIMULATION, VOL 4: MODELLING AND SIMULATION IN BIOLOGY, ECOLOGY & ENVIRONMENT, 2010, : 103 - 106
  • [27] Genetic algorithm with logistic regression for prediction of progression to Alzheimer's disease
    Piers Johnson
    Luke Vandewater
    William Wilson
    Paul Maruff
    Greg Savage
    Petra Graham
    Lance S Macaulay
    Kathryn A Ellis
    Cassandra Szoeke
    Ralph N Martins
    Christopher C Rowe
    Colin L Masters
    David Ames
    Ping Zhang
    BMC Bioinformatics, 15
  • [28] BAYESIAN ERROR ESTIMATION AND MODEL SELECTION IN SPARSE LOGISTIC REGRESSION
    Huttunen, Heikki
    Manninen, Tapio
    Tohka, Jussi
    2013 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2013,
  • [29] An algorithm based on logistic regression with data fusion in wireless sensor networks
    Longgeng Liu
    Guangchun Luo
    Ke Qin
    Xiping Zhang
    EURASIP Journal on Wireless Communications and Networking, 2017
  • [30] Genetic algorithm search for large logistic regression models with significant variables
    Stacey, A
    Kildea, D
    ITI 2000: PROCEEDINGS OF THE 22ND INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY INTERFACES, 2000, : 275 - 279