Harnessing ensemble Machine learning models for improved salinity prediction in large river basin scales

被引:0
|
作者
Mahmoud, Mohamed F. [1 ]
Arabi, Mazdak [1 ]
Pallickara, Shrideep [2 ]
机构
[1] Colorado State Univ, Dept Civil & Environm Engn, 1372 Campus Delivery, Ft Collins, CO 80523 USA
[2] Colorado State Univ, Dept Comp Sci, Ft Collins, CO USA
基金
美国国家科学基金会;
关键词
Machine learning; Bayesian model averaging; Spatial prediction; Stacked ensembles; XGBoost; Colorado River Basin; Salinity prediction; NEURAL-NETWORKS; COLORADO RIVER; REGRESSION; CLASSIFICATION; TREES;
D O I
10.1016/j.jhydrol.2025.132691
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
This study develops a robust ensemble machine learning methodology for predicting average annual salinity by combining multiple machine learning algorithms. Salt concentration is a crucial water quality indicator, and salinity issues cost $300 million annually in the U.S. Irrigated agricultural lands in the Upper Colorado River Basin contribute excessively to dissolved solid loads despite covering less than 2% of the basin area. The economic impact and complex relationship between irrigation practices, groundwater dynamics, and salinity levels necessitate improved predictive capabilities at river basin scales. Using twenty years of data from 150 watersheds, eleven machine learning algorithms were evaluated through both random and spatial cross-validation approaches, with Extreme Gradient Boosting, Gradient Boosting, and Random Forest emerging as top performers. Bayesian Model Averaging and stacked generalization were employed to create ensemble models, demonstrating enhanced performance validity. The BMA ensemble achieved better spatial generalization compared to individual models while requiring significantly less computational resources than stacking. Model uncertainty analysis revealed that BMA provided the most stable predictions among all approaches. Soil electrical conductivity and calcium carbonate content emerged as the most important predictors, followed by river flow. The resulting spatially distributed predictions revealed distinct patterns in sulfate loads and concentrations across sub-basins, providing insights for targeted salinity management. This study demonstrates the effectiveness of ensemble machine learning approaches for robust salinity prediction while highlighting the importance of comprehensive uncertainty assessment and spatial validation in environmental modeling applications.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] River Water Salinity Prediction Using Hybrid Machine Learning Models
    Melesse, Assefa M.
    Khosravi, Khabat
    Tiefenbacher, John P.
    Heddam, Salim
    Kim, Sungwon
    Mosavi, Amir
    Pham, Binh Thai
    WATER, 2020, 12 (10) : 1 - 21
  • [2] Improved SVR machine learning models for agricultural drought prediction at downstream of Langat River Basin, Malaysia
    Fung, Kit Fai
    Huang, Yuk Feng
    Koo, Chai Hoon
    Mirzaei, Majid
    JOURNAL OF WATER AND CLIMATE CHANGE, 2020, 11 (04) : 1383 - 1398
  • [3] Decomposing streamflow for improved river flow prediction accuracy of machine learning models
    Elkurdy, Mostafa
    Binns, Andrew
    Gharabaghi, Bahram
    INTERNATIONAL JOURNAL OF RIVER BASIN MANAGEMENT, 2025,
  • [4] Application of machine learning ensemble models for rainfall prediction
    Hasan Ahmadi
    Babak Aminnejad
    Hojat Sabatsany
    Acta Geophysica, 2023, 71 : 1775 - 1786
  • [5] Application of machine learning ensemble models for rainfall prediction
    Ahmadi, Hasan
    Aminnejad, Babak
    Sabatsany, Hojat
    ACTA GEOPHYSICA, 2023, 71 (04) : 1775 - 1786
  • [6] Harnessing Ensemble in Machine Learning for Accurate Early Prediction and Prevention of Heart Disease
    Husain, Mohammad
    Kumar, Pankaj
    Ahmed, Mohammad Nadeem
    Ali, Arshad
    Rasool, Mohammad Ashiquee
    Hussain, Mohammad Rashid
    Dildar, Muhammad Shahid
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (10) : 182 - 195
  • [7] Ensemble of Machine Learning Models for an Improved Facial Emotion Recognition
    Pulido-Castro, Sergio
    Palacios-Quecan, Nubia
    Ballen-Cardenas, Michelle P.
    Cancino-Suarez, Sandra
    Rizo-Arevalo, Alejandra
    Lopez Lopez, Juan M.
    2021 IEEE URUCON, 2021, : 512 - 516
  • [8] Ensemble machine learning models for aviation incident risk prediction
    Zhang, Xiaoge
    Mahadevan, Sankaran
    DECISION SUPPORT SYSTEMS, 2019, 116 : 48 - 63
  • [9] Enhancing Machine Learning based QoE Prediction by Ensemble Models
    Casas, Pedro
    Seufert, Michael
    Wehner, Nikolas
    Schwind, Anika
    Wamser, Florian
    2018 IEEE 38TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS), 2018, : 1642 - 1647
  • [10] Early Prediction of Diabetes Using an Ensemble of Machine Learning Models
    Dutta, Aishwariya
    Hasan, Md Kamrul
    Ahmad, Mohiuddin
    Awal, Md Abdul
    Islam, Md Akhtarul
    Masud, Mehedi
    Meshref, Hossam
    INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2022, 19 (19)