Accuracy and interpretability of machine learning-based approaches for daily ETo estimation under semi-arid climate in the West African Sahel

被引:2
作者
Yonaba, Roland [1 ]
Kiema, Arsene [2 ]
Tazen, Fowe [1 ]
Belemtougri, Axel [1 ]
Cisse, Mansourou [1 ]
Mounirou, Lawani Adjadi [1 ]
Bodian, Ansoumana [3 ]
Koita, Mahamadou [1 ]
Karambiri, Harouna [1 ]
机构
[1] Inst Int Ingn Eau & Environm 2iE, Lab Eaux Hydrosyst et Agr LEHSA, 01 BP 594, Ouagadougou, Burkina Faso
[2] Minist Environm Eau & Assainissement, Direct Regionale Eau & Assainissement Ctr Nord, Serv Reg Etud Stat & Sectorielles, 03 BP 7044, Ouagadougou, Burkina Faso
[3] Univ Gaston Berger UGB, Lab Leidi Dynam Terr & Dev, BP 234, St Louis, Senegal
关键词
Burkina Faso; Interpretability; Machine learning; Reference evapotranspiration; SHAP; West African Sahel; REFERENCE EVAPOTRANSPIRATION; MODELS; PREDICTION; EQUATIONS; ALGORITHMS; INSIGHTS; TREND; SVM;
D O I
10.1007/s12145-024-01591-1
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This study evaluates the accuracy and interpretability of 12 selected machine learning (ML) models for estimating daily reference evapotranspiration (ETo) under semi-arid conditions in Burkina Faso, West African Sahel. Meteorological data (1988-2017) from 9 synoptic stations are used to evaluate model performance. The interpreted variable importance was assessed using SHapley Additive exPlanations (SHAP) values and compared against the reference FAO-56 Penman-Monteith model. Spatiotemporal patterns in meteorological variables influencing ETo are first analysed through trend and change point analyses. The ML models are then calibrated station-wise, using a fivefold cross validation scheme. All ML models demonstrated strong predictive capabilities (R2 = 0.93-1.00, RMSE = 0.05-0.20 mm day-1, NRMSE = 0.40%-2.30%, KGE = 0.99-1.00 at the daily timescale). In terms of accuracy, Extreme Gradient Boosting (XGBoost) emerged as the top-performing model (lowest MAE = 0.03 mm day-1). Tree-based ensemble methods and advanced neural networks consistently outperformed other ML approaches across multiple evaluation metrics. At the monthly and annual timescales, ML models accurately captured ETo patterns and interannual variability. The SHAP value analysis showed that Random Forest (RF), Support Vector Machine (SVM), and boosted models (GBoost and XGBoost) most accurately represented the variable importance hierarchies for ETo estimation, although most models overestimated wind speed contribution. This study highlights the potential of ML approaches for ETo estimation in semi-arid regions, while emphasizing the importance of model interpretability. The findings have significant implications for applications in irrigation planning, water resource management, and climate impact assessment in semi-arid regions.
引用
收藏
页数:24
相关论文
共 117 条
[1]   Extreme Learning Machines: A new approach for prediction of reference evapotranspiration [J].
Abdullah, Shafika Sultan ;
Malek, M. A. ;
Abdullah, Namiq Sultan ;
Kisi, Ozgur ;
Yap, Keem Siah .
JOURNAL OF HYDROLOGY, 2015, 527 :184-195
[2]   Development of new machine learning model for streamflow prediction: case studies in Pakistan [J].
Adnan, Rana Muhammad ;
Mostafa, Reham R. ;
Elbeltagi, Ahmed ;
Yaseen, Zaher Mundher ;
Shahid, Shamsuddin ;
Kisi, Ozgur .
STOCHASTIC ENVIRONMENTAL RESEARCH AND RISK ASSESSMENT, 2022, 36 (04) :999-1033
[3]   A review of recent advances and future prospects in calculation of reference evapotranspiration in Bangladesh using soft computing models [J].
Alam, Md Mahfuz ;
Akter, Mst. Yeasmin ;
Islam, Abu Reza Md Towfiqul ;
Mallick, Javed ;
Kabir, Zobaidul ;
Chu, Ronghao ;
Arabameri, Alireza ;
Pal, Subodh Chandra ;
Masud, Md Abdullah Al ;
Costache, Romulus ;
Senapathi, Venkatramanan .
JOURNAL OF ENVIRONMENTAL MANAGEMENT, 2024, 351
[4]   Estimation of sugarcane evapotranspiration from remote sensing and limited meteorological variables using machine learning models [J].
Alavi, Mohammad ;
Albaji, Mohammad ;
Golabi, Mona ;
Naseri, Abd Ali ;
Homayouni, Saeid .
JOURNAL OF HYDROLOGY, 2024, 629
[5]  
Ali Z A., 2023, Acad. J. Nawroz Univ, V12, P320, DOI DOI 10.25007/AJNU.V12N2A1612
[6]  
Allen R. G., 1998, FAO Irrigation and Drainage Paper
[7]   Untangling hybrid hydrological models with explainable artificial intelligence [J].
Althoff, Daniel ;
Bazame, Helizani Couto ;
Nascimento, Jessica Garcia .
H2OPEN JOURNAL, 2021, 4 (01) :13-28
[8]   High performance machine learning approach for reference evapotranspiration estimation [J].
Aly, Mohammed S. ;
Darwish, Saad M. ;
Aly, Ahmed A. .
STOCHASTIC ENVIRONMENTAL RESEARCH AND RISK ASSESSMENT, 2024, 38 (02) :689-713
[9]   New machine learning approaches to improve reference evapotranspiration estimates using intra-daily temperature-based variables in a semi-arid region of Spain [J].
Antonio Bellido-Jimenez, Juan ;
Estevez, Javier ;
Penelope Garcia-Marin, Amanda .
AGRICULTURAL WATER MANAGEMENT, 2021, 245
[10]   Robustness of Extreme Learning Machine in the prediction of hydrological flow series [J].
Atiquzzaman, Md ;
Kandasamy, Jaya .
COMPUTERS & GEOSCIENCES, 2018, 120 :105-114