Harnessing ensemble Machine learning models for improved salinity prediction in large river basin scales

被引:0
作者
Mahmoud, Mohamed F. [1 ]
Arabi, Mazdak [1 ]
Pallickara, Shrideep [2 ]
机构
[1] Colorado State Univ, Dept Civil & Environm Engn, 1372 Campus Delivery, Ft Collins, CO 80523 USA
[2] Colorado State Univ, Dept Comp Sci, Ft Collins, CO USA
基金
美国国家科学基金会;
关键词
Machine learning; Bayesian model averaging; Spatial prediction; Stacked ensembles; XGBoost; Colorado River Basin; Salinity prediction; NEURAL-NETWORKS; COLORADO RIVER; REGRESSION; CLASSIFICATION; TREES;
D O I
10.1016/j.jhydrol.2025.132691
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
This study develops a robust ensemble machine learning methodology for predicting average annual salinity by combining multiple machine learning algorithms. Salt concentration is a crucial water quality indicator, and salinity issues cost $300 million annually in the U.S. Irrigated agricultural lands in the Upper Colorado River Basin contribute excessively to dissolved solid loads despite covering less than 2% of the basin area. The economic impact and complex relationship between irrigation practices, groundwater dynamics, and salinity levels necessitate improved predictive capabilities at river basin scales. Using twenty years of data from 150 watersheds, eleven machine learning algorithms were evaluated through both random and spatial cross-validation approaches, with Extreme Gradient Boosting, Gradient Boosting, and Random Forest emerging as top performers. Bayesian Model Averaging and stacked generalization were employed to create ensemble models, demonstrating enhanced performance validity. The BMA ensemble achieved better spatial generalization compared to individual models while requiring significantly less computational resources than stacking. Model uncertainty analysis revealed that BMA provided the most stable predictions among all approaches. Soil electrical conductivity and calcium carbonate content emerged as the most important predictors, followed by river flow. The resulting spatially distributed predictions revealed distinct patterns in sulfate loads and concentrations across sub-basins, providing insights for targeted salinity management. This study demonstrates the effectiveness of ensemble machine learning approaches for robust salinity prediction while highlighting the importance of comprehensive uncertainty assessment and spatial validation in environmental modeling applications.
引用
收藏
页数:15
相关论文
共 77 条
  • [31] What are decision trees?
    Kingsford, Carl
    Salzberg, Steven L.
    [J]. NATURE BIOTECHNOLOGY, 2008, 26 (09) : 1011 - 1013
  • [32] Decision trees: a recent overview
    Kotsiantis, S. B.
    [J]. ARTIFICIAL INTELLIGENCE REVIEW, 2013, 39 (04) : 261 - 283
  • [33] Toward Improved Predictions in Ungauged Basins: Exploiting the Power of Machine Learning
    Kratzert, Frederik
    Klotz, Daniel
    Herrnegger, Mathew
    Sampson, Alden K.
    Hochreiter, Sepp
    Nearing, Grey S.
    [J]. WATER RESOURCES RESEARCH, 2019, 55 (12) : 11344 - 11354
  • [34] Improved weather and seasonal climate forecasts from multimodel superensemble
    Krishnamurti, TN
    Kishtawal, CM
    LaRow, TE
    Bachiochi, DR
    Zhang, Z
    Williford, CE
    Gadgil, S
    Surendran, S
    [J]. SCIENCE, 1999, 285 (5433) : 1548 - 1550
  • [35] A STOCHASTIC-MODEL OF RIVER WATER-QUALITY - APPLICATION TO SALINITY IN THE COLORADO RIVER
    LEE, DJ
    HOWITT, RE
    MARINO, MA
    [J]. WATER RESOURCES RESEARCH, 1993, 29 (12) : 3917 - 3923
  • [36] ON THE EFFECT OF PRIOR ASSUMPTIONS IN BAYESIAN MODEL AVERAGING WITH APPLICATIONS TO GROWTH REGRESSION
    Ley, Eduardo
    Steel, Mark F. J.
    [J]. JOURNAL OF APPLIED ECONOMETRICS, 2009, 24 (04) : 651 - 674
  • [37] Landslide susceptibility assessment using SVM machine learning algorithm
    Marjanovic, Milos
    Kovacevic, Milos
    Bajat, Branislav
    Vozenilek, Vit
    [J]. ENGINEERING GEOLOGY, 2011, 123 (03) : 225 - 234
  • [38] Miikkulainen R, 2019, ARTIFICIAL INTELLIGENCE IN THE AGE OF NEURAL NETWORKS AND BRAIN COMPUTING, P293, DOI 10.1016/B978-0-12-815480-9.00015-3
  • [39] Miles J, 2014, Wiley StatsRef: Statistics Reference Online, DOI [DOI 10.1002/9781118445112.STAT06593, 10.1002/9781118445112.stat06593]
  • [40] Miller MP, US Geol Surv Sci Investig Rep 2017-5009, V2017, DOI [10.3133/sir20175009, DOI 10.3133/SIR20175009]