Ensemble data mining modeling in corrosion of concrete sewer: A comparative study of network-based (MLPNN & RBFNN) and tree-based (RF, CHAID, & CART) models

被引:78
作者
Zounemat-Kermani, Mohammad [1 ]
Stephan, Dietmar [2 ]
Barjenbruch, Matthias [3 ]
Hinkelmann, Reinhard [4 ]
机构
[1] Shahid Bahonar Univ Kerman, Dept Water Engn, Kerman, Iran
[2] Tech Univ Berlin, Inst Civil Engn, Chair Bldg Mat & Construct Chem, Berlin, Germany
[3] Tech Univ Berlin, Inst Civil Engn, Chair Urban Water Management, Berlin, Germany
[4] Tech Univ Berlin, Inst Civil Engn, Chair Water Resources Management & Modeling Hydro, Berlin, Germany
关键词
Concrete corrosion; Machine learning; Soft computing; Sewer systems; Artificial intelligence; ARTIFICIAL NEURAL-NETWORKS; CLASSIFICATION; DETERIORATION; PREDICTION; PIPES;
D O I
10.1016/j.aei.2019.101030
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This research aims to evaluate ensemble learning (bagging, boosting, and modified bagging) potential in predicting microbially induced concrete corrosion in sewer systems from the data mining (DM) perspective. Particular focus is laid on ensemble techniques for network-based DM methods, including multi-layer perceptron neural network (MLPNN) and radial basis function neural network (RBFNN) as well as tree-based DM methods, such as chi-square automatic interaction detector (CHAID), classification and regression tree (CART), and random forests (RF). Hence, an interdisciplinary approach is presented by combining findings from material sciences and hydrochemistry as well as data mining analyses to predict concrete corrosion. The effective factors on concrete corrosion such as time, gas temperature, gas-phase H2S concentration, relative humidity, pH, and exposure phase are considered as the models' inputs. All 433 datasets are randomly selected to construct an individual model and twenty component models of boosting, bagging, and modified bagging based on training, validating, and testing for each DM base learners. Considering some model performance indices, (e.g., Root mean square error, RMSE; mean absolute percentage error, MAPE; correlation coefficient, r) the best ensemble predictive models are selected. The results obtained indicate that the prediction ability of the random forests DM model is superior to the other ensemble learners, followed by the ensemble Bag-CHAID method. On average, the ensemble tree-based models acted better than the ensemble network-based models; nevertheless, it was also found that taking the advantages of ensemble learning would enhance the general performance of individual DM models by more than 10%.
引用
收藏
页数:12
相关论文
共 34 条
[1]   An evolutionary approach to modelling concrete degradation due to sulphuric acid attack [J].
Alani, Amir M. ;
Faramarzi, Asaad .
APPLIED SOFT COMPUTING, 2014, 24 :985-993
[2]   Model-Tree Ensembles for noise-tolerant system identification [J].
Aleksovski, Darko ;
Kocijan, Jus ;
Dzeroski, Saso .
ADVANCED ENGINEERING INFORMATICS, 2015, 29 (01) :1-15
[3]   A comparative assessment of bagging ensemble models for modeling concrete slump flow [J].
Aydogmus, Hacer Yumurtaci ;
Erdal, Halil Ibrahim ;
Karakurt, Onur ;
Namli, Ersin ;
Turkan, Yusuf S. ;
Erdal, Hamit .
COMPUTERS AND CONCRETE, 2015, 16 (05) :741-757
[4]   Hybrid integration of Multilayer Perceptron Neural Networks and machine learning ensembles for landslide susceptibility assessment at Himalayan area (India) using GIS [J].
Binh Thai Pham ;
Dieu Tien Bui ;
Prakash, Indra ;
Dholakia, M. B. .
CATENA, 2017, 149 :52-63
[5]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[6]  
Breiman L., 2017, Trees, DOI [DOI 10.1201/9781315139470, 10.1201/9781315139470-8, DOI 10.1201/9781315139470-8]
[7]   XGBoost: A Scalable Tree Boosting System [J].
Chen, Tianqi ;
Guestrin, Carlos .
KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, :785-794
[8]   Machine learning in concrete strength simulations: Multi-nation data analytics [J].
Chou, Jui-Sheng ;
Tsai, Chih-Fong ;
Anh-Duc Pham ;
Lu, Yu-Hsin .
CONSTRUCTION AND BUILDING MATERIALS, 2014, 73 :771-780
[9]   Analysis of concrete from corroded sewer pipe [J].
Davis, JL ;
Nica, D ;
Shields, K ;
Roberts, DJ .
INTERNATIONAL BIODETERIORATION & BIODEGRADATION, 1998, 42 (01) :75-84
[10]   High performance concrete compressive strength forecasting using ensemble models based on discrete wavelet transform [J].
Erdal, Halil Ibrahim ;
Karakurt, Onur ;
Namli, Ersin .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2013, 26 (04) :1246-1254