Development of an Extreme Gradient Boosting Model Integrated With Evolutionary Algorithms for Hourly Water Level Prediction

被引:34
作者
Nguyen, Duc Hai [1 ,2 ]
Hien Le, Xuan [2 ]
Heo, Jae-Yeong [1 ]
Bae, Deg-Hyo [1 ]
机构
[1] Sejong Univ, Dept Civil & Environm Engn, Seoul 143747, South Korea
[2] Thuyloi Univ, Fac Water Resources Engn, Hanoi 116705, Vietnam
来源
IEEE ACCESS | 2021年 / 9卷
关键词
Predictive models; Radio frequency; Floods; Machine learning algorithms; Genetic algorithms; Urban areas; Prediction algorithms; Extreme gradient boosting; evolutionary algorithms; water level prediction; tree-based model; urban floods; DIFFERENTIAL-EVOLUTION; GENETIC ALGORITHMS; NEURAL-NETWORK; OPTIMIZATION; WAVELET; ANFIS; CAPACITY;
D O I
10.1109/ACCESS.2021.3111287
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The establishment of reliable water level prediction models is vital for urban flood control and planning. In this paper, we develop hybrid models (GA-XGBoost and DE-XGBoost) that couple two evolutionary models, a genetic algorithm (GA) and a differential evolution (DE) algorithm, with the extreme gradient boosting (XGBoost) model for hourly water level prediction. The Jungrang urban basin located on the Han River, South Korea, was selected as a case study for the proposed models. Hourly rainfall and water level data were collected between 2003 and 2020 to construct and evaluate the performance of the selected models. To compare the prediction efficiency, two other tree-based models were chosen: classification and registration tree (CART) and random forest (RF) models. A comparison of the results showed that two hybrid models, GA-XGBoost and DE-XGBoost, outperformed RF and CART in the multistep-ahead prediction of water level, and the relative errors of the hybrid model ranged from [2.18%-9.21%], compared to [3.76%-10.41%] and [2.99%-11.88%] for the RF and CART, respectively. Reliable performance was also supported by other measures. In general, the GA-XGBoost and DE-XGBoost models displayed relatively similar performance despite their small differences. The CART model was not preferable for multistep-ahead water level predictions, even though it yielded the lowest Akaike information criterion (AIC) value. This study verifies that despite having some drawbacks when considering long step-ahead prediction and model complexity, hybrid XGBoost models might be superior to many existing models for hourly water level prediction.
引用
收藏
页码:125853 / 125867
页数:15
相关论文
共 60 条
  • [31] Holland J.H., 1975, ADAPTATION NATURAL A
  • [32] Holland JH, 1992, ADAPTATION NATURAL A, DOI [10.7551/mitpress/1090.001.0001, DOI 10.7551/MITPRESS/1090.001.0001]
  • [33] Using genetic algorithms to optimize the analogue method for precipitation prediction in the Swiss Alps
    Horton, Pascal
    Jaboyedoff, Michel
    Obled, Charles
    [J]. JOURNAL OF HYDROLOGY, 2018, 556 : 1220 - 1231
  • [34] Improving interpolation of daily precipitation for hydrologic modelling: spatial patterns of preferred interpolators
    Kurtzman, Daniel
    Navon, Shilo
    Morin, Efrat
    [J]. HYDROLOGICAL PROCESSES, 2009, 23 (23) : 3281 - 3291
  • [35] Comparison of Deep Learning Techniques for River Streamflow Forecasting
    Le, Xuan-Hien
    Nguyen, Duc-Hai
    Jung, Sungho
    Yeon, Minho
    Lee, Giha
    [J]. IEEE ACCESS, 2021, 9 : 71805 - 71820
  • [36] Characterization of runoff generation in a mountainous hillslope according to multiple threshold behavior and hysteretic loop features
    Lee, Eunhyung
    Kim, Sanghyun
    [J]. JOURNAL OF HYDROLOGY, 2020, 590 (590)
  • [37] Estimating annual runoff in response to forest change: A statistical method based on random forest
    Li, Ming
    Zhang, Yongqiang
    Wallace, Jeremy
    Campbell, Eddy
    [J]. JOURNAL OF HYDROLOGY, 2020, 589
  • [38] Streamflow Prediction Using Deep Learning Neural Network: Case Study of Yangtze River
    Liu, Darong
    Jiang, Wenchao
    Mu, Lin
    Wang, Si
    [J]. IEEE ACCESS, 2020, 8 : 90069 - 90086
  • [39] Hybrid BART-based models optimized by nature-inspired metaheuristics to predict ultimate axial capacity of CCFST columns
    Luat, Nguyen-Vu
    Shin, Jiuk
    Lee, Kihak
    [J]. ENGINEERING WITH COMPUTERS, 2022, 38 (02) : 1421 - 1450
  • [40] Derivation of Optimized Equations for Estimation of Dispersion Coefficient in Natural Streams Using Hybridized ANN With PSO and CSO Algorithms
    Madvar, Hossien Riahi
    Dehghani, Majid
    Memarzadeh, Rasoul
    Salwana, Ely
    Mosavi, Amir
    Shamshirband, Shahab
    [J]. IEEE ACCESS, 2020, 8 : 156582 - 156599