Harnessing ensemble Machine learning models for improved salinity prediction in large river basin scales

被引:0
|
作者
Mahmoud, Mohamed F. [1 ]
Arabi, Mazdak [1 ]
Pallickara, Shrideep [2 ]
机构
[1] Colorado State Univ, Dept Civil & Environm Engn, 1372 Campus Delivery, Ft Collins, CO 80523 USA
[2] Colorado State Univ, Dept Comp Sci, Ft Collins, CO USA
基金
美国国家科学基金会;
关键词
Machine learning; Bayesian model averaging; Spatial prediction; Stacked ensembles; XGBoost; Colorado River Basin; Salinity prediction; NEURAL-NETWORKS; COLORADO RIVER; REGRESSION; CLASSIFICATION; TREES;
D O I
10.1016/j.jhydrol.2025.132691
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
This study develops a robust ensemble machine learning methodology for predicting average annual salinity by combining multiple machine learning algorithms. Salt concentration is a crucial water quality indicator, and salinity issues cost $300 million annually in the U.S. Irrigated agricultural lands in the Upper Colorado River Basin contribute excessively to dissolved solid loads despite covering less than 2% of the basin area. The economic impact and complex relationship between irrigation practices, groundwater dynamics, and salinity levels necessitate improved predictive capabilities at river basin scales. Using twenty years of data from 150 watersheds, eleven machine learning algorithms were evaluated through both random and spatial cross-validation approaches, with Extreme Gradient Boosting, Gradient Boosting, and Random Forest emerging as top performers. Bayesian Model Averaging and stacked generalization were employed to create ensemble models, demonstrating enhanced performance validity. The BMA ensemble achieved better spatial generalization compared to individual models while requiring significantly less computational resources than stacking. Model uncertainty analysis revealed that BMA provided the most stable predictions among all approaches. Soil electrical conductivity and calcium carbonate content emerged as the most important predictors, followed by river flow. The resulting spatially distributed predictions revealed distinct patterns in sulfate loads and concentrations across sub-basins, providing insights for targeted salinity management. This study demonstrates the effectiveness of ensemble machine learning approaches for robust salinity prediction while highlighting the importance of comprehensive uncertainty assessment and spatial validation in environmental modeling applications.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] Sediment load prediction in Johor river: deep learning versus machine learning models
    Latif, Sarmad Dashti
    Chong, K. L.
    Ahmed, Ali Najah
    Huang, Y. F.
    Sherif, Mohsen
    El-Shafie, Ahmed
    APPLIED WATER SCIENCE, 2023, 13 (03)
  • [42] Sediment load prediction in Johor river: deep learning versus machine learning models
    Sarmad Dashti Latif
    K. L. Chong
    Ali Najah Ahmed
    Y. F. Huang
    Mohsen Sherif
    Ahmed El-Shafie
    Applied Water Science, 2023, 13
  • [43] Enhancing software defect prediction: a framework with improved feature selection and ensemble machine learning
    Ali, Misbah
    Mazhar, Tehseen
    Al-Rasheed, Amal
    Shahzad, Tariq
    Ghadi, Yazeed Yasin
    Khan, Muhammad Amir
    PEERJ COMPUTER SCIENCE, 2024, 10
  • [44] Prediction of soil salinity in the Upputeru river estuary catchment, India, using machine learning techniques
    Mantena, Sireesha
    Mahammood, Vazeer
    Rao, Kunjam Nageswara
    ENVIRONMENTAL MONITORING AND ASSESSMENT, 2023, 195 (08)
  • [45] Prediction of soil salinity in the Upputeru river estuary catchment, India, using machine learning techniques
    Sireesha Mantena
    Vazeer Mahammood
    Kunjam Nageswara Rao
    Environmental Monitoring and Assessment, 2023, 195
  • [46] Improved flood risk assessment using multi-model ensemble machine-learning techniques in a tropical river basin of Southern India
    Achu, A. L.
    Aju, C. D.
    Raicy, M. C.
    Bhadran, Arun
    George, Amal
    Surendran, U.
    Girishbai, Drishya
    Ajayakumar, P.
    Gopinath, Girish
    Pradhan, Biswajeet
    PHYSICAL GEOGRAPHY, 2025,
  • [47] Prediction of Water Infiltration of Three Types of Soil with Machine Learning in the Sahuayo River Basin
    Lupian-Machuca, Mitzi R.
    Cruz-Cardenas, Gustavo
    Flores-Magallon, Rebeca
    Silva-Garcia, Jose T.
    Ochoa-Estrada, Salvador
    Martinez-Trinidad, Sergio
    APPLIED AND ENVIRONMENTAL SOIL SCIENCE, 2024, 2024
  • [48] Prediction of manifest refraction using machine learning ensemble models on wavefront aberrometry data
    Hernandez, Carlos S.
    Gil, Andrea
    Casares, Ignacio
    Poderoso, Jesus
    Wehse, Alec
    Dave, Shivang R.
    Lim, Daryl
    Sanchez-Montanes, Manuel
    Lage, Eduardo
    JOURNAL OF OPTOMETRY, 2022, 15 : S22 - S31
  • [49] Prediction of diabetes disease using an ensemble of machine learning multi-classifier models
    Abnoosian, Karlo
    Farnoosh, Rahman
    Behzadi, Mohammad Hassan
    BMC BIOINFORMATICS, 2023, 24 (01)
  • [50] Feature importance analysis of solar flares and prediction research with ensemble machine learning models
    Yang, Yun
    FRONTIERS IN ASTRONOMY AND SPACE SCIENCES, 2025, 11