Multiple Types of Missing Precipitation Data Filling Based on Ensemble Artificial Intelligence Models

被引:1
作者
Qiu, He [1 ,2 ]
Chen, Hao [1 ,2 ,3 ]
Xu, Bingjiao [1 ,2 ]
Liu, Gaozhan [1 ,2 ]
Huang, Saihua [1 ,2 ]
Nie, Hui [1 ,2 ]
Xie, Huawei [1 ,2 ]
机构
[1] Zhejiang Univ Water Resources & Elect Power, Sch Hydraul Engn, Hangzhou 310018, Peoples R China
[2] Zhejiang Univ Water Resources & Elect Power, Int Sci & Technol Cooperat Base Utilizat & Sustain, Hangzhou 310018, Peoples R China
[3] Zhejiang Univ Water Resources & Elect Power, Nanxun Innovat Inst, Hangzhou 310018, Peoples R China
关键词
precipitation data missingness; artificial intelligence; data imputation; ensemble simulation; Jiaojiang River basin; SVR MODEL; INTERPOLATION; PREDICTION; IMPUTATION;
D O I
10.3390/w16223192
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
The completeness of precipitation observation data is a crucial foundation for hydrological simulation, water resource analysis, and environmental assessment. Traditional data imputation methods suffer from poor adaptability, lack of precision, and limited model diversity. Rapid and accurate imputation using available data is a key challenge in precipitation monitoring. This study selected precipitation data from the Jiaojiang River basin in the southeastern Zhejiang Province of China from 1991 to 2020. The data were categorized based on various missing rates and scenarios, namely MCR (Missing Completely Random), MR (Missing Random), and MNR (Missing Not Random). Imputation of precipitation data was conducted using three types of Artificial Intelligence (AI) methods (Backpropagation Neural Network (BPNN), Random Forest (RF), and Support Vector Regression (SVR)), along with a novel Multiple Linear Regression (MLR) imputation method built upon these algorithms. The results indicate that the constructed MLR imputation method achieves an average Pearson's correlation coefficient (PCC) of 0.9455, an average Nash-Sutcliffe Efficiency (NSE) of 0.8329, and an average Percent Bias (Pbias) of 10.5043% across different missing rates. MLR simulation results in higher NSE and lower Pbias than the other three single AI models, thus effectively improving the estimation performance. The proposed methods in this study can be applied to other river basins to improve the quality of precipitation data and support water resource management.
引用
收藏
页数:21
相关论文
共 73 条
  • [51] Multiple linear regression approach to predict tensile properties of Sn-Ag-Cu (SAC) alloys
    Schon, Aline Ferreira
    Castro, Nicolau Apoena
    Barros, Andre dos Santos
    Spinelli, Jose Eduardo
    Garcia, Amauri
    Cheung, Noe
    Silva, Bismarck Luiz
    [J]. MATERIALS LETTERS, 2021, 304
  • [52] Random forests-based error-correction of streamflow from a large-scale hydrological model: Using model state variables to estimate error terms
    Shen, Youchen
    Ruijsch, Jessica
    Lu, Meng
    Sutanudjaja, Edwin H.
    Karssenberg, Derek
    [J]. COMPUTERS & GEOSCIENCES, 2022, 159
  • [53] Prediction of multi-sectoral longitudinal water withdrawals using hierarchical machine learning models
    Shortridge, Julie
    [J]. JOURNAL OF HYDROINFORMATICS, 2023, 25 (06) : 2389 - 2405
  • [54] Machine learning methods for empirical streamflow simulation: a comparison of model accuracy, interpretability, and uncertainty in seasonal watersheds
    Shortridge, Julie E.
    Guikema, Seth D.
    Zaitchik, Benjamin F.
    [J]. HYDROLOGY AND EARTH SYSTEM SCIENCES, 2016, 20 (07) : 2611 - 2628
  • [55] Use of the data depth function to differentiate between case of interpolation and extrapolation in hydrological model prediction
    Singh, Shailesh Kumar
    McMillan, Hilary
    Bardossy, Andras
    [J]. JOURNAL OF HYDROLOGY, 2013, 477 : 213 - 228
  • [56] Incomplete information imputation in limited data environments with application to disaster response
    Sokat, Kezban Yagci
    Dolinskaya, Irina S.
    Smilowitz, Karen
    Bank, Ryan
    [J]. EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2018, 269 (02) : 466 - 485
  • [57] Ground visibility prediction using tree-based and random-forest machine learning algorithm: Comparative study based on atmospheric pollution and atmospheric boundary layer data
    Wang, Fuzeng
    Liu, Ruolan
    Yan, Hao
    Liu, Duanyang
    Han, Lin
    Yuan, Shujie
    [J]. ATMOSPHERIC POLLUTION RESEARCH, 2024, 15 (11)
  • [58] Hybrid random forest models optimized by Sparrow search algorithm (SSA) and Harris hawk optimization algorithm (HHO) for slope stability prediction
    Wang, Meng
    Zhao, Guoyan
    Wang, Shaofeng
    [J]. TRANSPORTATION GEOTECHNICS, 2024, 48
  • [59] Imputation of missing daily rainfall data; A comparison between artificial intelligence and statistical techniques
    Wangwongchai, Angkool
    Waqas, Muhammad
    Dechpichai, Porntip
    Hlaing, Phyo Thandar
    Ahmad, Shakeel
    Humphries, Usa Wannasingha
    [J]. METHODSX, 2023, 11
  • [60] SOH prediction of lithium battery based on IC curve feature and BP neural network
    Wen, Jianping
    Chen, Xing
    Li, Xianghe
    Li, Yikun
    [J]. ENERGY, 2022, 261