Daily Streamflow Forecasting in Mountainous Catchment Using XGBoost, LightGBM and CatBoost

被引:74
作者
Szczepanek, Robert [1 ]
机构
[1] Jagiellonian Univ, Inst Geol Sci, Fac Geog & Geol, PL-30387 Krakow, Poland
关键词
streamflow forecasting; machine learning; XGBoost; random forest; multiple linear regression; POLISH CARPATHIAN MOUNTAINS; PRECIPITATION; MODEL; PREDICTION; REGRESSION; RIVER;
D O I
10.3390/hydrology9120226
中图分类号
TV21 [水资源调查与水利规划];
学科分类号
081501 ;
摘要
Streamflow forecasting in mountainous catchments is and will continue to be one of the important hydrological tasks. In recent years machine learning models are increasingly used for such forecasts. A direct comparison of the use of the three gradient boosting models (XGBoost, LightGBM and CatBoost) to forecast daily streamflow in mountainous catchment is our main contribution. As predictors we use daily precipitation, runoff at upstream gauge station and two-day preceding observations. All three algorithms are simple to implement in Python, fast and robust. Compared to deep machine learning models (like LSTM), they allow for easy interpretation of the significance of predictors. All tested models achieved Nash-Sutcliffe model efficiency (NSE) in the range of 0.85-0.89 and RMSE in the range of 6.8-7.8 m3s-1. A minimum of 12 years of training data series is required for such a result. The XGBoost did not turn out to be the best model for the daily streamflow forecast, although it is the most popular model. Using default model parameters, the best results were obtained with CatBoost. By optimizing the hyperparameters, the best forecast results were obtained by LightGBM. The differences between the model results are much smaller than the differences within the models themselves when suboptimal hyperparameters are used.
引用
收藏
页数:18
相关论文
共 56 条
[1]   Modelling monthly pan evaporation utilising Random Forest and deep learning algorithms [J].
Abed, Mustafa ;
Imteaz, Monzur Alam ;
Ahmed, Ali Najah ;
Huang, Yuk Feng .
SCIENTIFIC REPORTS, 2022, 12 (01) :13132
[2]   Least square support vector machine and multivariate adaptive regression splines for streamflow prediction in mountainous basin using hydro-meteorological data as inputs [J].
Adnan, Rana Muhammad ;
Liang, Zhongmin ;
Heddam, Salim ;
Zounemat-Kermani, Mohammad ;
Kisi, Ozgur ;
Li, Binquan .
JOURNAL OF HYDROLOGY, 2020, 586 (586)
[3]  
Afshari M., 2022, THESIS UTRECHT U UTR
[4]   EMD-Based Predictive Deep Belief Network for Time Series Prediction: An Application to Drought Forecasting [J].
Agana, Norbert A. ;
Homaifar, Abdollah .
HYDROLOGY, 2018, 5 (01)
[5]   The Risk of Extreme Streamflow Drought in the Polish Carpathians-A Two-Dimensional Approach [J].
Baran-Gurgul, Katarzyna .
INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2022, 19 (21)
[6]   A Review on Interpretable and Explainable Artificial Intelligence in Hydroclimatic Applications [J].
Basagaoglu, Hakan ;
Chakraborty, Debaditya ;
Do Lago, Cesar ;
Gutierrez, Lilianna ;
Sahinli, Mehmet Arif ;
Giacomoni, Marcio ;
Furl, Chad ;
Mirchi, Ali ;
Moriasi, Daniel ;
Sengor, Sema Sevinc .
WATER, 2022, 14 (08)
[7]   Application of Machine Learning and Process-Based Models for Rainfall-Runoff Simulation in DuPage River Basin, Illinois [J].
Bhusal, Amrit ;
Parajuli, Utsav ;
Regmi, Sushmita ;
Kalra, Ajay .
HYDROLOGY, 2022, 9 (07)
[8]   Drought Trends in the Polish Carpathian Mts. in the Years 1991-2020 [J].
Bokwa, Anita ;
Klimek, Mariusz ;
Krzaklewski, Pawel ;
Kukulka, Wojciech .
ATMOSPHERE, 2021, 12 (10)
[9]   Research on soil moisture prediction model based on deep learning [J].
Cai, Yu ;
Zheng, Wengang ;
Zhang, Xin ;
Zhangzhong, Lili ;
Xue, Xuzhang .
PLOS ONE, 2019, 14 (04)
[10]   XGBoost: A Scalable Tree Boosting System [J].
Chen, Tianqi ;
Guestrin, Carlos .
KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, :785-794