Examination and comparison of binary metaheuristic wrapper-based input variable selection for local and global climate information-driven one-step monthly streamflow forecasting

被引:22
作者
Ren, Kun [1 ]
Wang, Xue [2 ]
Shi, Xiaoyu [3 ]
Qu, Jihong [1 ]
Fang, Wei [3 ]
机构
[1] North China Univ Water Resources & Elect Power, Zhengzhou 450045, Peoples R China
[2] Zheng Zhou Sci & Technol Ind Sch, Zhengzhou 450045, Peoples R China
[3] Xian Univ Technol, Xian 710048, Peoples R China
关键词
Streamflow forecasting; Input variable selection; Binary metaheuristic algorithm; Wrapper method; Machine learning; ARTIFICIAL NEURAL-NETWORKS; SUPPORT VECTOR REGRESSION; MACHINE LEARNING-METHODS; SHORT-TERM; DATA SET; MODEL; WAVELET; TIME; ALGORITHMS; PREDICTION;
D O I
10.1016/j.jhydrol.2021.126152
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
The use of data-driven models to forecast streamflow has received substantial attention from scholars in recent years. However, systematic studies have not been performed to examine binary metaheuristic wrapper-based input variable selection (BMWIVS) in real-world streamflow forecasting. In this study, we explored binary metaheuristic-based shallow machine learning wrappers for one-step monthly streamflow forecasting using local weather information and global climate indices from three catchments with different hydroclimatic conditions. First, the maximal information coefficient (MIC) was employed to investigate the correlations among the forecasting target, streamflow and candidate input variables, which included both local and global climate information. Then, the BMWIVS models obtained by combining eight binary metaheuristic algorithms, five commonly used shallow machine learning algorithms, two combined filter-based input variable selection (FIVS) methods, and two forecasting methods were examined. Finally, the performance of each model was compared with the performance of typical benchmark models, including the univariate seasonal autoregressive integrated moving average model, five machine learning algorithms with no input variable selection, and five machine learning algorithms that use five different FIVS methods. The experimental results emphasized three significant findings. First, an appropriate input variable selection method should be selected in practice because several examined wrappers were inferior to the benchmark models. Second, the BMWIVS model that combined the regularized extreme learning machine method, binary gray wolf optimizer, FIVS results-based initialization method, and forecasted values averaged over multiple runs yielded the best performance in the three cases studied. Third, the correlations in terms of the MIC between the global climate indices and streamflow were lower than those between local weather information and streamflow, and the best wrapper and FIVS would select more local weather information variables than global climate index variables, which suggests that global climate information can be complementary to local weather information for one-step monthly streamflow forecasting. These findings have remarkable practical applications for forecasting monthly streamflow.
引用
收藏
页数:20
相关论文
共 117 条
  • [1] The CAMELS data set: catchment attributes and meteorology for large-sample studies
    Addor, Nans
    Newman, Andrew J.
    Mizukami, Naoki
    Clark, Martyn P.
    [J]. HYDROLOGY AND EARTH SYSTEM SCIENCES, 2017, 21 (10) : 5293 - 5313
  • [2] Least square support vector machine and multivariate adaptive regression splines for streamflow prediction in mountainous basin using hydro-meteorological data as inputs
    Adnan, Rana Muhammad
    Liang, Zhongmin
    Heddam, Salim
    Zounemat-Kermani, Mohammad
    Kisi, Ozgur
    Li, Binquan
    [J]. JOURNAL OF HYDROLOGY, 2020, 586 (586)
  • [3] Input attributes optimization using the feasibility of genetic nature inspired algorithm: Application of river flow forecasting
    Afan, Haitham Abdulmohsin
    Allawi, Mohammed Falah
    El-Shafie, Amr
    Yaseen, Zaher Mundher
    Ahmed, Ali Najah
    Malek, Marlinda Abdul
    Koting, Suhana Binti
    Salih, Sinan Q.
    Mohtar, Wan Hanna Melini Wan
    Lai, Sai Hin
    Sefelnasr, Ahmed
    Sherif, Mohsen
    El-Shafie, Ahmed
    [J]. SCIENTIFIC REPORTS, 2020, 10 (01)
  • [4] Performance Assessment of the Linear, Nonlinear and Nonparametric Data Driven Models in River Flow Forecasting
    Ahani, Ali
    Shourian, Mojtaba
    Rad, Peiman Rahimi
    [J]. WATER RESOURCES MANAGEMENT, 2018, 32 (02) : 383 - 399
  • [5] A stepwise model to predict monthly streamflow
    Al-Juboori, Anas Mahmood
    Guven, Aytac
    [J]. JOURNAL OF HYDROLOGY, 2016, 543 : 283 - 292
  • [6] minerva and minepy: a C engine for the MINE suite and its R, Python']Python and MATLAB wrappers
    Albanese, Davide
    Filosi, Michele
    Visintainer, Roberto
    Riccadonna, Samantha
    Jurman, Giuseppe
    Furlanello, Cesare
    [J]. BIOINFORMATICS, 2013, 29 (03) : 407 - 408
  • [7] Two hybrid wrapper-filter feature selection algorithms applied to high-dimensional microarray experiments
    Apolloni, Javier
    Leguizamon, Guillermo
    Alba, Enrique
    [J]. APPLIED SOFT COMPUTING, 2016, 38 : 922 - 932
  • [8] Sequential assimilation of soil moisture and streamflow data in a conceptual rainfall-runoff model
    Aubert, D
    Loumagne, C
    Oudin, L
    [J]. JOURNAL OF HYDROLOGY, 2003, 280 (1-4) : 145 - 161
  • [9] Generalization performance of support vector machines and neural networks in runoff modeling
    Behzad, Mohsen
    Asghari, Keyvan
    Eazi, Morten
    Palhang, Maziar
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (04) : 7624 - 7629
  • [10] Selection of relevant features and examples in machine learning
    Blum, AL
    Langley, P
    [J]. ARTIFICIAL INTELLIGENCE, 1997, 97 (1-2) : 245 - 271