共 1 条
Predicting flood stages in watersheds with different scales using hourly rainfall dataset: A high-volume rainfall features empowered machine learning approach
被引:3
|作者:
Qiao, Lei
[1
]
Livsey, Daniel
[2
]
Wise, Jarrett
[2
]
Kadavy, Kem
[2
]
Hunt, Sherry
[2
]
Wagner, Kevin
[1
,3
]
机构:
[1] Oklahoma State Univ, Oklahoma Water Resources Ctr, Stillwater, OK 74078 USA
[2] US DA, Agroclimate & Hydraul Res Unit, Agr Res Unit, Stillwater, OK 74075 USA
[3] Oklahoma State Univ, Dept Plant & Soil Sci, Stillwater, OK 74078 USA
关键词:
Lake level prediction;
Surface runoff;
Streamflow;
Machine learning;
Feature engineering;
Hydrological modeling;
UNCERTAINTY;
MODEL;
D O I:
10.1016/j.scitotenv.2024.175231
中图分类号:
X [环境科学、安全科学];
学科分类号:
08 ;
0830 ;
摘要:
Accurate prediction of instantaneous high lake water levels and flood flows (flood stages) from micro-catchments to big river basins are critical for flood forecasting. Lake Carl Blackwell, a small-watershed reservoir in the southcentral USA, served as a primary case study due to its rich historical dataset. Bearing knowledge that both current and previous rainfall contributes to the reservoirs' water body, a series of hourly rainfall features were created to maximize predicting power, which include total rainfall amounts in the current hour, the past 2 h, 3 h, ..., 600 h in addition to previous-day lake levels. Notedly, the rainfall features are the accumulated rainfall amounts from present to previous hours rather than the rainfall amount in any specific hour. Random Forest Regression (RFR) was used to score the features' importance and predict the flood stages along with Neural Network - Multi-layer Perceptron Regression (NN-MLP), Support Vector Regression (SVR), Extreme Gradient Boosting (XGBoost), and the ordinary multi-variant linear regression (MLR) together with dimension reduced linear models of Principal Component Regression (PCR) and Partial Least Square Regression (PLSR). The prediction accuracy for the lake flood stages can be as high as 0.95 in R2, 2 , 0.11 ft. in mean absolute error (MAE), and 0.21 ft. in root mean square error (RMSE) for the testing dataset by the RFR (NN-MLP performed equally well), with small accuracy decreases by the other two non-linear algorithms of XGBoost and SVR. The linear regressions with dimension reductions had the lowest accuracy. Furthermore, our approach demonstrated high accuracy and broad applicability for surface runoff and streamflow predictions across three different-sized watersheds from micro-catchment to big river basins in the region, with increases of predicting power from earlier rainfall for larger watersheds and vice versa.
引用
收藏
页数:11
相关论文