Accuracy and interpretability of machine learning-based approaches for daily ETo estimation under semi-arid climate in the West African Sahel

被引:2
作者
Yonaba, Roland [1 ]
Kiema, Arsene [2 ]
Tazen, Fowe [1 ]
Belemtougri, Axel [1 ]
Cisse, Mansourou [1 ]
Mounirou, Lawani Adjadi [1 ]
Bodian, Ansoumana [3 ]
Koita, Mahamadou [1 ]
Karambiri, Harouna [1 ]
机构
[1] Inst Int Ingn Eau & Environm 2iE, Lab Eaux Hydrosyst et Agr LEHSA, 01 BP 594, Ouagadougou, Burkina Faso
[2] Minist Environm Eau & Assainissement, Direct Regionale Eau & Assainissement Ctr Nord, Serv Reg Etud Stat & Sectorielles, 03 BP 7044, Ouagadougou, Burkina Faso
[3] Univ Gaston Berger UGB, Lab Leidi Dynam Terr & Dev, BP 234, St Louis, Senegal
关键词
Burkina Faso; Interpretability; Machine learning; Reference evapotranspiration; SHAP; West African Sahel; REFERENCE EVAPOTRANSPIRATION; MODELS; PREDICTION; EQUATIONS; ALGORITHMS; INSIGHTS; TREND; SVM;
D O I
10.1007/s12145-024-01591-1
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This study evaluates the accuracy and interpretability of 12 selected machine learning (ML) models for estimating daily reference evapotranspiration (ETo) under semi-arid conditions in Burkina Faso, West African Sahel. Meteorological data (1988-2017) from 9 synoptic stations are used to evaluate model performance. The interpreted variable importance was assessed using SHapley Additive exPlanations (SHAP) values and compared against the reference FAO-56 Penman-Monteith model. Spatiotemporal patterns in meteorological variables influencing ETo are first analysed through trend and change point analyses. The ML models are then calibrated station-wise, using a fivefold cross validation scheme. All ML models demonstrated strong predictive capabilities (R2 = 0.93-1.00, RMSE = 0.05-0.20 mm day-1, NRMSE = 0.40%-2.30%, KGE = 0.99-1.00 at the daily timescale). In terms of accuracy, Extreme Gradient Boosting (XGBoost) emerged as the top-performing model (lowest MAE = 0.03 mm day-1). Tree-based ensemble methods and advanced neural networks consistently outperformed other ML approaches across multiple evaluation metrics. At the monthly and annual timescales, ML models accurately captured ETo patterns and interannual variability. The SHAP value analysis showed that Random Forest (RF), Support Vector Machine (SVM), and boosted models (GBoost and XGBoost) most accurately represented the variable importance hierarchies for ETo estimation, although most models overestimated wind speed contribution. This study highlights the potential of ML approaches for ETo estimation in semi-arid regions, while emphasizing the importance of model interpretability. The findings have significant implications for applications in irrigation planning, water resource management, and climate impact assessment in semi-arid regions.
引用
收藏
页数:24
相关论文
共 117 条
[51]  
Kuhn M, 2024, Packages. 1.2.0
[52]   New alternatives for reference evapotranspiration estimation in West Africa using limited weather data and ancillary data supply strategies. [J].
Landeras, Gorka ;
Bekoe, Emmanuel ;
Ampofo, Joseph ;
Logah, Frederick ;
Diop, Mbaye ;
Cisse, Madiama ;
Shiri, Jalal .
THEORETICAL AND APPLIED CLIMATOLOGY, 2018, 132 (3-4) :701-716
[53]  
Lange H, 2020, ECOL STUD-ANAL SYNTH, V240, P233, DOI 10.1007/978-3-030-26086-6_10
[54]  
Leye B., 2021, Climate Change and Water Resources in Africa, DOI [10.1007/978-3-030-61225-2_14, DOI 10.1007/978-3-030-61225-2_14]
[55]   Performance of partial Mann-Kendall tests for trend detection in the presence of covariates [J].
Libiseller, C ;
Grimvall, A .
ENVIRONMETRICS, 2002, 13 (01) :71-84
[56]   Improved remote sensing reference evapotranspiration estimation using simple satellite data and machine learning [J].
Liu, Dan ;
Wang, Zhongjing ;
Wang, Lei ;
Chen, Jibin ;
Li, Congcong ;
Shi, Yujia .
SCIENCE OF THE TOTAL ENVIRONMENT, 2024, 947
[57]   Irrigation schedule analysis and optimization under the different combination of P and ET0 using a spatially distributed crop model [J].
Liu, Xiao ;
Yang, Dawen .
AGRICULTURAL WATER MANAGEMENT, 2021, 256
[58]  
Lundberg SM, 2017, ADV NEUR IN, V30
[59]   Multivariate adaptive regression splines-assisted approximate Bayesian computation for calibration of complex hydrological models [J].
Ma, Jinfeng ;
Li, Ruonan ;
Zheng, Hua ;
Li, Weifeng ;
Rao, Kaifeng ;
Yang, Yanzheng ;
Wu, Bo .
JOURNAL OF HYDROINFORMATICS, 2024, 26 (02) :503-518
[60]  
Mahringer W., 1970, Archiv fur Meteorologie, Geophysik und Bioklimatologie, Serie B, V18, P1, DOI 10.1007/BF02245865