Analysis of Data Splitting on Streamflow Prediction using Random Forest

被引:3
作者
Puri, Diksha [1 ]
Sihag, Parveen [2 ]
Thakur, Mohindra Singh [3 ]
Jameel, Mohammed [4 ]
Chadee, Aaron Anil [5 ]
Hazi, Mohammad Azamathulla [5 ]
机构
[1] Shoolini Univ, Sch Environm Sci, Solan 173229, Himachal Prades, India
[2] Chandigarh Univ, Dept Civil Engn, Himachal Pradesh 140301, Punjab, India
[3] Shoolini Univ, Dept Civil Engn, Solan 173229, Himachal Prades, India
[4] King Khalid Univ, Dept Civil Engn, Abha, Saudi Arabia
[5] Univ West Indies, Dept Civil & Environm Engn, St Augustine, Trinidad Tobago
关键词
streamflow prediction; Kesinga basin; data separation ratios; random forest; 60/40; COMPUTER-AIDED DIAGNOSIS; WATER-RESOURCES; TIME-SERIES; NETWORK; WAVELET;
D O I
10.3934/environsci.2024029
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
This study is focused on the use of random forest (RF) to forecast the streamflow in the Kesinga River basin. A total of 169 data points were gathered monthly for the years 1991-2004 to create a model for streamflow prediction. The dataset was allotted into training and testing stages using various ratios, such as 50/50, 60/40, 70/30, and 80/20. The produced models were evaluated using three statistical indices: the root mean square error (RMSE), the mean absolute error (MAE), and the correlation coefficient (CC). The analysis of the models' performances revealed that the training and testing ratios had a substantial impact on the RF model's predictive abilities; models performed best when the ratio was 60/40. The findings demonstrated the right dataset ratios for precise streamflow prediction, which will be beneficial for hydraulic engineers during the water-related design and engineering stages of water projects.
引用
收藏
页码:593 / 609
页数:17
相关论文
共 51 条
  • [1] Development of a short-term river flood forecasting method for snowmelt driven floods based on wavelet and cross-wavelet analysis
    Adarnowski, Jan F.
    [J]. JOURNAL OF HYDROLOGY, 2008, 353 (3-4) : 247 - 266
  • [2] A new method for prediction of air pollution based on intelligent computation
    Al-Janabi, Samaher
    Mohammad, Mustafa
    Al-Sultan, Ali
    [J]. SOFT COMPUTING, 2020, 24 (01) : 661 - 680
  • [3] The Influence of Data Length on the Performance of Artificial Intelligence Models in Predicting Air Pollution
    AlOmar, Mohamed Khalid
    Khaleel, Faidhalrahman
    AlSaadi, Abdulwahab Abdulrazaaq
    Hameed, Mohammed Majeed
    AlSaadi, Mohammed Abdulhakim
    Al-Ansari, Nadhir
    [J]. ADVANCES IN METEOROLOGY, 2022, 2022
  • [4] Continuous streamflow prediction in ungauged basins: long short-term memory neural networks clearly outperform traditional hydrological models
    Arsenault, Richard
    Martel, Jean-Luc
    Brunet, Frederic
    Brissette, Francois
    Mai, Juliane
    [J]. HYDROLOGY AND EARTH SYSTEM SCIENCES, 2023, 27 (01) : 139 - 157
  • [5] Asteris P. G., 2020, Metaheuristic Comput. Appl, V1, P63, DOI [10.12989/mca.2020.1.1.063, DOI 10.12989/MCA.2020.1.1.063]
  • [6] Non-renewable groundwater use and groundwater depletion: a review
    Bierkens, Marc F. P.
    Wada, Yoshihide
    [J]. ENVIRONMENTAL RESEARCH LETTERS, 2019, 14 (06)
  • [7] Landslide susceptibility assesssment in the Uttarakhand area (India) using GIS: a comparison study of prediction capability of na⟨ve bayes, multilayer perceptron neural networks, and functional trees methods
    Binh Thai Pham
    Dieu Tien Bui
    Pourghasemi, Hamid Reza
    Indra, Prakash
    Dholakia, M. B.
    [J]. THEORETICAL AND APPLIED CLIMATOLOGY, 2017, 128 (1-2) : 255 - 273
  • [8] Random forests
    Breiman, L
    [J]. MACHINE LEARNING, 2001, 45 (01) : 5 - 32
  • [9] Wavelet analysis of variability in annual Canadian streamflows
    Coulibaly, P
    Burn, DH
    [J]. WATER RESOURCES RESEARCH, 2004, 40 (03) : W031051 - W0310514
  • [10] Comparison of tree-based machine learning algorithms for predicting liquefaction potential using canonical correlation forest, rotation forest, and random forest based on CPT data
    Demir, Selcuk
    Sahin, Emrehan Kutlug
    [J]. SOIL DYNAMICS AND EARTHQUAKE ENGINEERING, 2022, 154