Harnessing ensemble Machine learning models for improved salinity prediction in large river basin scales

被引:0
|
作者
Mahmoud, Mohamed F. [1 ]
Arabi, Mazdak [1 ]
Pallickara, Shrideep [2 ]
机构
[1] Colorado State Univ, Dept Civil & Environm Engn, 1372 Campus Delivery, Ft Collins, CO 80523 USA
[2] Colorado State Univ, Dept Comp Sci, Ft Collins, CO USA
基金
美国国家科学基金会;
关键词
Machine learning; Bayesian model averaging; Spatial prediction; Stacked ensembles; XGBoost; Colorado River Basin; Salinity prediction; NEURAL-NETWORKS; COLORADO RIVER; REGRESSION; CLASSIFICATION; TREES;
D O I
10.1016/j.jhydrol.2025.132691
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
This study develops a robust ensemble machine learning methodology for predicting average annual salinity by combining multiple machine learning algorithms. Salt concentration is a crucial water quality indicator, and salinity issues cost $300 million annually in the U.S. Irrigated agricultural lands in the Upper Colorado River Basin contribute excessively to dissolved solid loads despite covering less than 2% of the basin area. The economic impact and complex relationship between irrigation practices, groundwater dynamics, and salinity levels necessitate improved predictive capabilities at river basin scales. Using twenty years of data from 150 watersheds, eleven machine learning algorithms were evaluated through both random and spatial cross-validation approaches, with Extreme Gradient Boosting, Gradient Boosting, and Random Forest emerging as top performers. Bayesian Model Averaging and stacked generalization were employed to create ensemble models, demonstrating enhanced performance validity. The BMA ensemble achieved better spatial generalization compared to individual models while requiring significantly less computational resources than stacking. Model uncertainty analysis revealed that BMA provided the most stable predictions among all approaches. Soil electrical conductivity and calcium carbonate content emerged as the most important predictors, followed by river flow. The resulting spatially distributed predictions revealed distinct patterns in sulfate loads and concentrations across sub-basins, providing insights for targeted salinity management. This study demonstrates the effectiveness of ensemble machine learning approaches for robust salinity prediction while highlighting the importance of comprehensive uncertainty assessment and spatial validation in environmental modeling applications.
引用
收藏
页数:15
相关论文
共 50 条
  • [31] Ensemble Boosting and Bagging Based Machine Learning Models for Groundwater Potential Prediction
    Mosavi, Amirhosein
    Sajedi Hosseini, Farzaneh
    Choubin, Bahram
    Goodarzi, Massoud
    Dineva, Adrienn A.
    Rafiei Sardooi, Elham
    WATER RESOURCES MANAGEMENT, 2021, 35 (01) : 23 - 37
  • [32] Gusset Plate Compression Capacity Prediction Using Ensemble Machine Learning Models
    Arafin, Palisa
    Billah, A. H. M. Muntasir
    PROCEEDINGS OF THE CANADIAN SOCIETY OF CIVIL ENGINEERING ANNUAL CONFERENCE 2022, VOL 2, CSCE 2022, 2023, 348 : 1017 - 1032
  • [33] Prediction of soil thermal conductivity using individual and ensemble machine learning models
    Wang, Caijin
    Wu, Meng
    Cai, Guojun
    He, Huan
    Zhao, Zening
    Chang, Jianxin
    JOURNAL OF THERMAL ANALYSIS AND CALORIMETRY, 2024, 149 (11) : 5415 - 5432
  • [34] Ensemble Boosting and Bagging Based Machine Learning Models for Groundwater Potential Prediction
    Amirhosein Mosavi
    Farzaneh Sajedi Hosseini
    Bahram Choubin
    Massoud Goodarzi
    Adrienn A. Dineva
    Elham Rafiei Sardooi
    Water Resources Management, 2021, 35 : 23 - 37
  • [35] The two-stage machine learning ensemble models for stock price prediction by combining mode decomposition, extreme learning machine and improved harmony search algorithm
    Jiang, Manrui
    Jia, Lifen
    Chen, Zhensong
    Chen, Wei
    ANNALS OF OPERATIONS RESEARCH, 2022, 309 (02) : 553 - 585
  • [36] The two-stage machine learning ensemble models for stock price prediction by combining mode decomposition, extreme learning machine and improved harmony search algorithm
    Manrui Jiang
    Lifen Jia
    Zhensong Chen
    Wei Chen
    Annals of Operations Research, 2022, 309 : 553 - 585
  • [37] Estimation of Potential Evapotranspiration in the Yellow River Basin Using Machine Learning Models
    Liu, Jie
    Yu, Kunxia
    Li, Peng
    Jia, Lu
    Zhang, Xiaoming
    Yang, Zhi
    Zhao, Yang
    ATMOSPHERE, 2022, 13 (09)
  • [38] Comparison of machine learning models for flood forecasting in the Mahanadi River Basin, India
    Sharma, Sanjay
    Kumari, Sangeeta
    JOURNAL OF WATER AND CLIMATE CHANGE, 2024, 15 (04) : 1629 - 1652
  • [39] Detection and Prediction of Future Mental Disorder From Social Media Data Using Machine Learning, Ensemble Learning, and Large Language Models
    Abdullah, Mohammed
    Negied, Nermin
    IEEE ACCESS, 2024, 12 : 120553 - 120569
  • [40] Soil erosion susceptibility mapping using ensemble machine learning models: A case study of upper Congo river sub-basin
    Kulimushi, Luc Cimusa
    Bashagaluke, Janvier Bigabwa
    Prasad, Pankaj
    Heri-Kazi, Aim B. Heri-Kazi
    Kushwaha, Nand Lal
    Masroor, Md
    Choudhari, Pandurang
    Elbeltagi, Ahmed
    Sajjad, Haroon
    Mohammed, Safwan
    CATENA, 2023, 222