Complete ensemble empirical mode decomposition hybridized with random forest and kernel ridge regression model for monthly rainfall forecasts

被引:144
作者
Ali, Mumtaz [1 ]
Prasad, Ramendra [2 ]
Xiang, Yong [1 ]
Yaseen, Zaher Mundher [3 ]
机构
[1] Deakin Univ, Deakin SWU Joint Res Ctr Big Data, Sch Informat Technol, Burwood, Vic 3125, Australia
[2] Univ Fiji, Sch Sci & Technol, Dept Sci, Saweni, Lautoka, Fiji
[3] Duy Tan Univ, Inst Res & Dev, Da Nang 550000, Vietnam
关键词
Rainfall; Water resources; Drought; CEEMD; Random forest; KRR; ARTIFICIAL NEURAL-NETWORK; METEOROLOGICAL DATA; INPUT SELECTION; RIVER FLOW; PRECIPITATION; RUNOFF; MACHINE; OPTIMIZATION; PERFORMANCE; STREAMFLOW;
D O I
10.1016/j.jhydrol.2020.124647
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Persistent risks of extreme weather events including droughts and floods due to climate change require precise and timely rainfall forecasting. Yet, the naturally occurring non-stationarity entrenched within the rainfall time series lowers the model performances and is an ongoing research endeavour for practicing hydrologists and drought-risk evaluators. In this paper, an attempt is made to resolve the non-stationarity challenges faced by rainfall forecasting models via a complete ensemble empirical mode decomposition (CEEMD) combined with Random Forest (RF) and Kernel Ridge Regression (KRR) algorithms in designing a hybrid CEEMD-RF-KRR model in forecasting rainfall at Gilgit, Muzaffarabad, and Parachinar in Pakistan at monthly time scale. The rainfall time-series data are simultaneously factorized into respective intrinsic mode functions (IMFs) and a residual element using CEEMD. Once the significant lags of each IMF and the residual are identified, both are forecasted using the RF algorithm. Finally, the KRR model is adopted where the forecasted IMFs and the residual components are combined to generate the final forecasted rainfall. The CEEMD-RF-KRR model shows the best performances at all three sites, in comparison to the comparative models, with maximum values of correlation coefficient (0.97-0.99), Willmott's index (0.94-0.97), Nash-Sutcliffe coefficient (0.94-0.97) and Legates-McCabe's index (0.74-0.81). Furthermore, the CEEMD-RF-KRR model generated the most accurate results for Gilgit station considering the Legate-McCabe's index as base assessment criteria in addition to obtaining the lowest magnitudes of RMSE = 2.52 mm and MAE = 1.98 mm. The proposed hybrid CEEMD-RF-KRR model attained better rainfall forecasting accuracy which is imperative for agriculture, water resource management, and early drought/flooding warning systems.
引用
收藏
页数:15
相关论文
共 71 条
[1]   Input selection and optimisation for monthly rainfall forecasting in Queensland, Australia, using artificial neural networks [J].
Abbot, John ;
Marohasy, Jennifer .
ATMOSPHERIC RESEARCH, 2014, 138 :166-178
[2]  
Akhtar I.U.H., 2012, SCI DEV NET
[3]  
Alaoui A, 2015, ADV NEURAL INFORM PR, V28, P775
[4]   Multi-stage hybridized online sequential extreme learning machine integrated with Markov Chain Monte Carlo copula-Bat algorithm for rainfall forecasting [J].
Ali, Mumtaz ;
Deo, Ravinesh C. ;
Downs, Nathan J. ;
Maraseni, Tek .
ATMOSPHERIC RESEARCH, 2018, 213 :450-464
[5]   An ensemble-ANFIS based uncertainty assessment model for forecasting multi-scalar standardized precipitation index [J].
Ali, Mumtaz ;
Deo, Ravinesh C. ;
Downs, Nathan J. ;
Maraseni, Tek .
ATMOSPHERIC RESEARCH, 2018, 207 :155-180
[6]  
Alvanitopoulos P.-F., 2014, IFIP ADV INFORM COMM, P447, DOI [10.1007/978-3-662-44654-644.IFIP, DOI 10.1007/978-3-662-44654-644.IFIP]
[7]  
Anderson M.G., 1985, HYDROLOGICAL FORECAS, P372
[8]  
[Anonymous], ATMOS OCEAN SCI LETT
[9]  
[Anonymous], 2003, PRACTICAL GUIDE SUPP
[10]   Using meteorological data to forecast seasonal runoff on the River Jhelum, Pakistan [J].
Archer, D. R. ;
Fowler, H. J. .
JOURNAL OF HYDROLOGY, 2008, 361 (1-2) :10-23