Daily Streamflow Forecasting in Mountainous Catchment Using XGBoost, LightGBM and CatBoost

被引:59
作者
Szczepanek, Robert [1 ]
机构
[1] Jagiellonian Univ, Inst Geol Sci, Fac Geog & Geol, PL-30387 Krakow, Poland
关键词
streamflow forecasting; machine learning; XGBoost; random forest; multiple linear regression; POLISH CARPATHIAN MOUNTAINS; PRECIPITATION; MODEL; PREDICTION; REGRESSION; RIVER;
D O I
10.3390/hydrology9120226
中图分类号
TV21 [水资源调查与水利规划];
学科分类号
081501 ;
摘要
Streamflow forecasting in mountainous catchments is and will continue to be one of the important hydrological tasks. In recent years machine learning models are increasingly used for such forecasts. A direct comparison of the use of the three gradient boosting models (XGBoost, LightGBM and CatBoost) to forecast daily streamflow in mountainous catchment is our main contribution. As predictors we use daily precipitation, runoff at upstream gauge station and two-day preceding observations. All three algorithms are simple to implement in Python, fast and robust. Compared to deep machine learning models (like LSTM), they allow for easy interpretation of the significance of predictors. All tested models achieved Nash-Sutcliffe model efficiency (NSE) in the range of 0.85-0.89 and RMSE in the range of 6.8-7.8 m3s-1. A minimum of 12 years of training data series is required for such a result. The XGBoost did not turn out to be the best model for the daily streamflow forecast, although it is the most popular model. Using default model parameters, the best results were obtained with CatBoost. By optimizing the hyperparameters, the best forecast results were obtained by LightGBM. The differences between the model results are much smaller than the differences within the models themselves when suboptimal hyperparameters are used.
引用
收藏
页数:18
相关论文
共 56 条
  • [1] Modelling monthly pan evaporation utilising Random Forest and deep learning algorithms
    Abed, Mustafa
    Imteaz, Monzur Alam
    Ahmed, Ali Najah
    Huang, Yuk Feng
    [J]. SCIENTIFIC REPORTS, 2022, 12 (01) : 13132
  • [2] Least square support vector machine and multivariate adaptive regression splines for streamflow prediction in mountainous basin using hydro-meteorological data as inputs
    Adnan, Rana Muhammad
    Liang, Zhongmin
    Heddam, Salim
    Zounemat-Kermani, Mohammad
    Kisi, Ozgur
    Li, Binquan
    [J]. JOURNAL OF HYDROLOGY, 2020, 586 (586)
  • [3] Afshari M., 2022, THESIS UTRECHT U UTR
  • [4] EMD-Based Predictive Deep Belief Network for Time Series Prediction: An Application to Drought Forecasting
    Agana, Norbert A.
    Homaifar, Abdollah
    [J]. HYDROLOGY, 2018, 5 (01)
  • [5] The Risk of Extreme Streamflow Drought in the Polish Carpathians-A Two-Dimensional Approach
    Baran-Gurgul, Katarzyna
    [J]. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2022, 19 (21)
  • [6] A Review on Interpretable and Explainable Artificial Intelligence in Hydroclimatic Applications
    Basagaoglu, Hakan
    Chakraborty, Debaditya
    Do Lago, Cesar
    Gutierrez, Lilianna
    Sahinli, Mehmet Arif
    Giacomoni, Marcio
    Furl, Chad
    Mirchi, Ali
    Moriasi, Daniel
    Sengor, Sema Sevinc
    [J]. WATER, 2022, 14 (08)
  • [7] Application of Machine Learning and Process-Based Models for Rainfall-Runoff Simulation in DuPage River Basin, Illinois
    Bhusal, Amrit
    Parajuli, Utsav
    Regmi, Sushmita
    Kalra, Ajay
    [J]. HYDROLOGY, 2022, 9 (07)
  • [8] Drought Trends in the Polish Carpathian Mts. in the Years 1991-2020
    Bokwa, Anita
    Klimek, Mariusz
    Krzaklewski, Pawel
    Kukulka, Wojciech
    [J]. ATMOSPHERE, 2021, 12 (10)
  • [9] Research on soil moisture prediction model based on deep learning
    Cai, Yu
    Zheng, Wengang
    Zhang, Xin
    Zhangzhong, Lili
    Xue, Xuzhang
    [J]. PLOS ONE, 2019, 14 (04):
  • [10] XGBoost: A Scalable Tree Boosting System
    Chen, Tianqi
    Guestrin, Carlos
    [J]. KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, : 785 - 794