Predicting Bulk Average Velocity with Rigid Vegetation in Open Channels Using Tree-Based Machine Learning: A Novel Approach Using Explainable Artificial Intelligence

被引:21
作者
Meddage, D. P. P. [1 ]
Ekanayake, I. U. [2 ]
Herath, Sumudu [1 ]
Gobirahavan, R. [3 ]
Muttil, Nitin [4 ,5 ]
Rathnayake, Upaka [6 ]
机构
[1] Univ Moratuwa, Dept Civil & Engn, Moratuwa 10400, Sri Lanka
[2] Univ Peradeniya, Dept Comp Engn, Galaha 20400, Sri Lanka
[3] Univ Ruhuna, Dept Civil & Environm Engn, Matara 81000, Sri Lanka
[4] Victoria Univ, Inst Sustainable Ind & Liveable Cities, POB 14428, Melbourne, Vic 8001, Australia
[5] Victoria Univ, Coll Engn & Sci, POB 14428, Melbourne, Vic 8001, Australia
[6] Sri Lanka Inst Informat Technol, Dept Civil Engn, Malabe 10115, Sri Lanka
关键词
bulk average velocity; explainable artificial intelligence; rigid vegetation; tree-based machine learning; RANDOM-FOREST; NEURAL-NETWORKS; MODEL; REGRESSION; STREAMFLOW; FLOW; ACCURACY; CANOPY; RIVER;
D O I
10.3390/s22124398
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Predicting the bulk-average velocity (U-B) in open channels with rigid vegetation is complicated due to the non-linear nature of the parameters. Despite their higher accuracy, existing regression models fail to highlight the feature importance or causality of the respective predictions. Therefore, we propose a method to predict U-B and the friction factor in the surface layer (f(S)) using tree-based machine learning (ML) models (decision tree, extra tree, and XGBoost). Further, Shapley Additive exPlanation (SHAP) was used to interpret the ML predictions. The comparison emphasized that the XGBoost model is superior in predicting U-B (R = 0.984) and f(S) (R = 0.92) relative to the existing regression models. SHAP revealed the underlying reasoning behind predictions, the dependence of predictions, and feature importance. Interestingly, SHAP adheres to what is generally observed in complex flow behavior, thus, improving trust in predictions.
引用
收藏
页数:29
相关论文
共 85 条
[1]  
Ahmad MA, 2018, ACM-BCB'18: PROCEEDINGS OF THE 2018 ACM INTERNATIONAL CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY, AND HEALTH INFORMATICS, P559, DOI [10.1145/3233547.3233667, 10.1109/ICHI.2018.00095]
[2]   Predictive modelling for solar thermal energy systems: A comparison of support vector regression, random forest, extra trees and regression trees [J].
Ahmad, Muhammad Waseem ;
Reynolds, Jonathan ;
Rezgui, Yacine .
JOURNAL OF CLEANER PRODUCTION, 2018, 203 :810-821
[3]   Flash flood susceptibility modelling using functional tree and hybrid ensemble techniques [J].
Arabameri, Alireza ;
Saha, Sunil ;
Chen, Wei ;
Roy, Jagabandhu ;
Pradhan, Biswajeet ;
Bui, Dieu Tien .
JOURNAL OF HYDROLOGY, 2020, 587
[4]   Application of artificial neural network ensembles in probabilistic hydrological forecasting [J].
Araghinejad, Shahab ;
Azmi, Mohammad ;
Kholghi, Majid .
JOURNAL OF HYDROLOGY, 2011, 407 (1-4) :94-104
[5]  
Augustijn D.C.M., 2008, P INT C FLUV HYDR RI, P343
[6]   A tree-based intelligence ensemble approach for spatial prediction of potential groundwater [J].
Avand, Mohammadtaghi ;
Janizadeh, Saeid ;
Tien Bui, Dieu ;
Pham, Viet Hoa ;
Ngo, Phuong Thao T. ;
Nhu, Viet-Ha .
INTERNATIONAL JOURNAL OF DIGITAL EARTH, 2020, 13 (12) :1408-1429
[7]   Blotch Detection in Archive Films Based on Visual Saliency Map [J].
Aydin, Yildiz ;
Dizdaroglu, Bekir .
COMPLEXITY, 2020, 2020
[8]   On inducing equations for vegetation resistance [J].
Baptist, M. J. ;
Babovic, V. ;
Uthurburu, J. Rodriguez ;
Keijzer, M. ;
Uittenbogaard, R. E. ;
Mynett, A. ;
Verwey, A. .
JOURNAL OF HYDRAULIC RESEARCH, 2007, 45 (04) :435-450
[9]  
Baptist M.J., 2005, Modelling floodplain biogeomorphology
[10]   Forecasting of groundwater level fluctuations using ensemble hybrid multi-wavelet neural network-based models [J].
Barzegar, Rahim ;
Fijani, Elham ;
Moghaddam, Asghar Asghari ;
Tziritis, Evangelos .
SCIENCE OF THE TOTAL ENVIRONMENT, 2017, 599 :20-31