Harnessing ensemble Machine learning models for improved salinity prediction in large river basin scales

被引:0
|
作者
Mahmoud, Mohamed F. [1 ]
Arabi, Mazdak [1 ]
Pallickara, Shrideep [2 ]
机构
[1] Colorado State Univ, Dept Civil & Environm Engn, 1372 Campus Delivery, Ft Collins, CO 80523 USA
[2] Colorado State Univ, Dept Comp Sci, Ft Collins, CO USA
基金
美国国家科学基金会;
关键词
Machine learning; Bayesian model averaging; Spatial prediction; Stacked ensembles; XGBoost; Colorado River Basin; Salinity prediction; NEURAL-NETWORKS; COLORADO RIVER; REGRESSION; CLASSIFICATION; TREES;
D O I
10.1016/j.jhydrol.2025.132691
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
This study develops a robust ensemble machine learning methodology for predicting average annual salinity by combining multiple machine learning algorithms. Salt concentration is a crucial water quality indicator, and salinity issues cost $300 million annually in the U.S. Irrigated agricultural lands in the Upper Colorado River Basin contribute excessively to dissolved solid loads despite covering less than 2% of the basin area. The economic impact and complex relationship between irrigation practices, groundwater dynamics, and salinity levels necessitate improved predictive capabilities at river basin scales. Using twenty years of data from 150 watersheds, eleven machine learning algorithms were evaluated through both random and spatial cross-validation approaches, with Extreme Gradient Boosting, Gradient Boosting, and Random Forest emerging as top performers. Bayesian Model Averaging and stacked generalization were employed to create ensemble models, demonstrating enhanced performance validity. The BMA ensemble achieved better spatial generalization compared to individual models while requiring significantly less computational resources than stacking. Model uncertainty analysis revealed that BMA provided the most stable predictions among all approaches. Soil electrical conductivity and calcium carbonate content emerged as the most important predictors, followed by river flow. The resulting spatially distributed predictions revealed distinct patterns in sulfate loads and concentrations across sub-basins, providing insights for targeted salinity management. This study demonstrates the effectiveness of ensemble machine learning approaches for robust salinity prediction while highlighting the importance of comprehensive uncertainty assessment and spatial validation in environmental modeling applications.
引用
收藏
页数:15
相关论文
共 50 条
  • [21] Prediction of Combined Terrestrial Evapotranspiration Index (CTEI) over Large River Basin Based on Machine Learning Approaches
    Elbeltagi, Ahmed
    Kumari, Nikul
    Dharpure, Jaydeo K.
    Mokhtar, Ali
    Alsafadi, Karam
    Kumar, Manish
    Mehdinejadiani, Behrouz
    Ramezani Etedali, Hadi
    Brouziyne, Youssef
    Towfiqul Islam, Abu Reza Md.
    Kuriqi, Alban
    WATER, 2021, 13 (04)
  • [22] Dissolved oxygen forecasting in the Mississippi River: advanced ensemble machine learning models
    Granata, Francesco
    Zhu, Senlin
    Di Nunno, Fabio
    ENVIRONMENTAL SCIENCE-ADVANCES, 2024, 3 (11):
  • [23] Drought driving mechanism and risk situation prediction based on machine learning models in the Yellow River Basin, China
    Kang, Ling
    Wen, Yunliang
    Zhou, Liwei
    Chen, Hao
    Ye, Jinwang
    GEOMATICS NATURAL HAZARDS & RISK, 2023, 14 (01)
  • [24] Leveraging level data for accurate downstream flow prediction in the Narmada River Basin with advanced machine learning models
    Kumar, Vijendra
    Rathnayake, Namal
    Reddy, S. Hariprasad
    Kedam, Naresh
    Rathnayake, Upaka
    Hoshino, Yukinobu
    JOURNAL OF HYDROINFORMATICS, 2025, 27 (02) : 141 - 158
  • [25] Enhancing Monthly Streamflow Prediction Using Meteorological Factors and Machine Learning Models in the Upper Colorado River Basin
    Thota, Saichand
    Nassar, Ayman
    Boubrahimi, Soukaina Filali
    Hamdi, Shah Muhammad
    Hosseinzadeh, Pouya
    HYDROLOGY, 2024, 11 (05)
  • [26] An improved ensemble learning machine for biological activity prediction of tyrosine kinase inhibitors
    Tavakoli, Hossein
    Ghasemi, Jahan B.
    JOURNAL OF CHEMOMETRICS, 2015, 29 (04) : 213 - 223
  • [27] Improved prediction of biomass gasification models through machine learning
    Sakheta, Aban
    Raj, Thomas
    Nayak, Richi
    O'Hara, Ian
    Ramirez, Jerome
    COMPUTERS & CHEMICAL ENGINEERING, 2024, 191
  • [28] Harnessing Disordered-Ensemble Quantum Dynamics for Machine Learning
    Fujii, Keisuke
    Nakajima, Kohei
    PHYSICAL REVIEW APPLIED, 2017, 8 (02):
  • [29] Harnessing the Power of Ensemble Machine Learning for the Heart Stroke Classification
    Pal P.
    Nandal M.
    Dikshit S.
    Thusu A.
    Singh H.V.
    EAI Endorsed Transactions on Pervasive Health and Technology, 2023, 9 (01)
  • [30] Ensemble Machine-Learning Models for Accurate Prediction of Solar Irradiation in Bangladesh
    Alam, Md Shafiul
    Al-Ismail, Fahad Saleh
    Hossain, Md Sarowar
    Rahman, Syed Masiur
    PROCESSES, 2023, 11 (03)