High performance machine learning approach for reference evapotranspiration estimation

被引:17
作者
Aly, Mohammed S. [1 ]
Darwish, Saad M. [2 ]
Aly, Ahmed A. [3 ]
机构
[1] Alexandria Sanit & Drainage Co, Alexandria 21526, Egypt
[2] Alexandria Univ, Inst Grad Studies & Res, Dept Informat Technol, Alexandria 21526, Egypt
[3] Alexandria Univ, Fac Agr, Dept Agr Engn, Alexandria 21526, Egypt
关键词
Reference evapotranspiration (ET0); Extra tree regressor (ETR); Support vector regressor (SVR); K-nearest neighbor (KNN); AdaBoost regression (ADA); Super learner; Ensemble learning; Cross-validation; LIMITED METEOROLOGICAL DATA; NEURO-FUZZY; PREDICTION; MODELS; ALGORITHMS; ENSEMBLES; SENSOR; SVM;
D O I
10.1007/s00477-023-02594-y
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Accurate reference evapotranspiration (ET0) estimation has an effective role in reducing water losses and raising the efficiency of irrigation water management. The complicated nature of the evapotranspiration process is illustrated in the amount of meteorological variables required to estimate ET0. Incomplete meteorological data is the most significant challenge that confronts ET0 estimation. For this reason, different machine learning techniques have been employed to predict ET0, but the complicated structures and architectures of many of them make ET0 estimation very difficult. For these challenges, ensemble learning techniques are frequently employed for estimating ET0, particularly when there is a shortage of meteorological data. This paper introduces a powerful super learner ensemble technique for ET0 estimation, where four machine learning models: Extra Tree Regressor, Support Vector Regressor, K-Nearest Neighbor and AdaBoost Regression represent the base learners and their outcomes used as training data for the meta learner. Overcoming the overfitting problem that affects most other ensemble methods is a significant advantage of this cross-validation theory-based approach. Super learner performances were compared with the base learners for their forecasting capabilities through different statistical standards, where the results revealed that the super learner has better accuracy than the base learners, where different combinations of variables have been used whereas Coefficient of Determination (R-2) ranged from 0.9279 to 0.9994 and Mean Squared Error (MSE) ranged from 0.0026 to 0.3289 mm/day but for the base learners R-2 ranged from 0.5592 to 0.9977, and MSE ranged from 0.0896 to 2.0118 mm/day therefore, super learner is highly recommended for ET0 prediction with limited meteorological data.
引用
收藏
页码:689 / 713
页数:25
相关论文
共 85 条
[1]   Reference evapotranspiration estimation in hyper-arid regions via D-vine copula based-quantile regression and comparison with empirical approaches and machine learning models [J].
Abdallah, Mohammed ;
Mohammadi, Babak ;
Zaroug, Modathir A. H. ;
Omer, Abubaker ;
Cheraghalizadeh, Majid ;
Eldow, Mohamed E. E. ;
Duan, Zheng .
JOURNAL OF HYDROLOGY-REGIONAL STUDIES, 2022, 44
[2]   A Comparative Study of Potential Evapotranspiration Estimation by Three Methods with FAO Penman-Monteith Method across Sri Lanka [J].
Abeysiriwardana, Himasha Dilshani ;
Muttil, Nitin ;
Rathnayake, Upaka .
HYDROLOGY, 2022, 9 (11)
[3]   Modern Techniques to Modeling Reference Evapotranspiration in a Semiarid Area Based on ANN and GEP Models [J].
Achite, Mohammed ;
Jehanzaib, Muhammad ;
Sattari, Mohammad Taghi ;
Toubal, Abderrezak Kamel ;
Elshaboury, Nehal ;
Walega, Andrzej ;
Krakauer, Nir ;
Yoo, Ji-Young ;
Kim, Tae-Woong .
WATER, 2022, 14 (08)
[4]  
Allen R.G., 2017, FAO IRRIGATION DRAIN
[5]  
[Anonymous], 2009, The Elements of Statistical Learning, V27, P83, DOI DOI 10.1007/B94608
[6]   Prediction of heat waves using meteorological variables in diverse regions of Iran with advanced machine learning models [J].
Asadollah, Seyed Babak Haji Seyed ;
Khan, Najeebullah ;
Sharafati, Ahmad ;
Shahid, Shamsuddin ;
Chung, Eun-Sung ;
Wang, Xiao-Jun .
STOCHASTIC ENVIRONMENTAL RESEARCH AND RISK ASSESSMENT, 2022, 36 (07) :1959-1974
[7]   Estimation of daily reference evapotranspiration by hybrid singular spectrum analysis-based stochastic gradient boosting [J].
Basakin, Eyyup Ensar ;
Ekmekcioglu, Omer ;
Stoy, Paul C. ;
Ozger, Mehmet .
METHODSX, 2023, 10
[8]   A regional machine learning method to outperform temperature-based reference evapotranspiration estimations in Southern Spain [J].
Bellido-Jimenez, Juan A. ;
Estevez, Javier ;
Garcia-Marin, Amanda P. .
AGRICULTURAL WATER MANAGEMENT, 2022, 274
[9]  
Bembom O, 2007, STAT APPL GENET MOL, V6
[10]   Online cross-validation-based ensemble learning [J].
Benkeser, David ;
Ju, Cheng ;
Lendle, Sam ;
van der Laan, Mark .
STATISTICS IN MEDICINE, 2018, 37 (02) :249-260