Examination and comparison of binary metaheuristic wrapper-based input variable selection for local and global climate information-driven one-step monthly streamflow forecasting

被引:27
作者
Ren, Kun [1 ]
Wang, Xue [2 ]
Shi, Xiaoyu [3 ]
Qu, Jihong [1 ]
Fang, Wei [3 ]
机构
[1] North China Univ Water Resources & Elect Power, Zhengzhou 450045, Peoples R China
[2] Zheng Zhou Sci & Technol Ind Sch, Zhengzhou 450045, Peoples R China
[3] Xian Univ Technol, Xian 710048, Peoples R China
关键词
Streamflow forecasting; Input variable selection; Binary metaheuristic algorithm; Wrapper method; Machine learning; ARTIFICIAL NEURAL-NETWORKS; SUPPORT VECTOR REGRESSION; MACHINE LEARNING-METHODS; SHORT-TERM; DATA SET; MODEL; WAVELET; TIME; ALGORITHMS; PREDICTION;
D O I
10.1016/j.jhydrol.2021.126152
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
The use of data-driven models to forecast streamflow has received substantial attention from scholars in recent years. However, systematic studies have not been performed to examine binary metaheuristic wrapper-based input variable selection (BMWIVS) in real-world streamflow forecasting. In this study, we explored binary metaheuristic-based shallow machine learning wrappers for one-step monthly streamflow forecasting using local weather information and global climate indices from three catchments with different hydroclimatic conditions. First, the maximal information coefficient (MIC) was employed to investigate the correlations among the forecasting target, streamflow and candidate input variables, which included both local and global climate information. Then, the BMWIVS models obtained by combining eight binary metaheuristic algorithms, five commonly used shallow machine learning algorithms, two combined filter-based input variable selection (FIVS) methods, and two forecasting methods were examined. Finally, the performance of each model was compared with the performance of typical benchmark models, including the univariate seasonal autoregressive integrated moving average model, five machine learning algorithms with no input variable selection, and five machine learning algorithms that use five different FIVS methods. The experimental results emphasized three significant findings. First, an appropriate input variable selection method should be selected in practice because several examined wrappers were inferior to the benchmark models. Second, the BMWIVS model that combined the regularized extreme learning machine method, binary gray wolf optimizer, FIVS results-based initialization method, and forecasted values averaged over multiple runs yielded the best performance in the three cases studied. Third, the correlations in terms of the MIC between the global climate indices and streamflow were lower than those between local weather information and streamflow, and the best wrapper and FIVS would select more local weather information variables than global climate index variables, which suggests that global climate information can be complementary to local weather information for one-step monthly streamflow forecasting. These findings have remarkable practical applications for forecasting monthly streamflow.
引用
收藏
页数:20
相关论文
共 117 条
[1]   The CAMELS data set: catchment attributes and meteorology for large-sample studies [J].
Addor, Nans ;
Newman, Andrew J. ;
Mizukami, Naoki ;
Clark, Martyn P. .
HYDROLOGY AND EARTH SYSTEM SCIENCES, 2017, 21 (10) :5293-5313
[2]   Least square support vector machine and multivariate adaptive regression splines for streamflow prediction in mountainous basin using hydro-meteorological data as inputs [J].
Adnan, Rana Muhammad ;
Liang, Zhongmin ;
Heddam, Salim ;
Zounemat-Kermani, Mohammad ;
Kisi, Ozgur ;
Li, Binquan .
JOURNAL OF HYDROLOGY, 2020, 586 (586)
[3]   Input attributes optimization using the feasibility of genetic nature inspired algorithm: Application of river flow forecasting [J].
Afan, Haitham Abdulmohsin ;
Allawi, Mohammed Falah ;
El-Shafie, Amr ;
Yaseen, Zaher Mundher ;
Ahmed, Ali Najah ;
Malek, Marlinda Abdul ;
Koting, Suhana Binti ;
Salih, Sinan Q. ;
Mohtar, Wan Hanna Melini Wan ;
Lai, Sai Hin ;
Sefelnasr, Ahmed ;
Sherif, Mohsen ;
El-Shafie, Ahmed .
SCIENTIFIC REPORTS, 2020, 10 (01)
[4]   Performance Assessment of the Linear, Nonlinear and Nonparametric Data Driven Models in River Flow Forecasting [J].
Ahani, Ali ;
Shourian, Mojtaba ;
Rad, Peiman Rahimi .
WATER RESOURCES MANAGEMENT, 2018, 32 (02) :383-399
[5]   A stepwise model to predict monthly streamflow [J].
Al-Juboori, Anas Mahmood ;
Guven, Aytac .
JOURNAL OF HYDROLOGY, 2016, 543 :283-292
[6]   minerva and minepy: a C engine for the MINE suite and its R, Python']Python and MATLAB wrappers [J].
Albanese, Davide ;
Filosi, Michele ;
Visintainer, Roberto ;
Riccadonna, Samantha ;
Jurman, Giuseppe ;
Furlanello, Cesare .
BIOINFORMATICS, 2013, 29 (03) :407-408
[7]  
[Anonymous], 2011, Acm T. Intel. Syst. Tec., DOI DOI 10.1145/1961189.1961199
[8]   Two hybrid wrapper-filter feature selection algorithms applied to high-dimensional microarray experiments [J].
Apolloni, Javier ;
Leguizamon, Guillermo ;
Alba, Enrique .
APPLIED SOFT COMPUTING, 2016, 38 :922-932
[9]   Sequential assimilation of soil moisture and streamflow data in a conceptual rainfall-runoff model [J].
Aubert, D ;
Loumagne, C ;
Oudin, L .
JOURNAL OF HYDROLOGY, 2003, 280 (1-4) :145-161
[10]   Generalization performance of support vector machines and neural networks in runoff modeling [J].
Behzad, Mohsen ;
Asghari, Keyvan ;
Eazi, Morten ;
Palhang, Maziar .
EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (04) :7624-7629