Hyperparameter tuning of supervised bagging ensemble machine learning model using Bayesian optimization for estimating stormwater quality

被引:1
|
作者
Moeini, Mohammadreza [1 ]
机构
[1] Univ Illinois, Dept Civil Mat &Environm Engn, Chicago, IL 60607 USA
基金
美国国家科学基金会;
关键词
Bayesian optimization; Machine learning; Ensemble modeling; Stormwater quality; Urban watershed; TOTAL SUSPENDED-SOLIDS; RANDOM FOREST; WATER-QUALITY; REGRESSION TREE; LAND-USE; NETWORKS; VARIABILITY; PREDICTION; CATCHMENTS; LOAD;
D O I
10.1007/s40899-024-01064-9
中图分类号
TV21 [水资源调查与水利规划];
学科分类号
081501 ;
摘要
Physically based models (PBMs), including stormwater management model (SWMM), require a significant amount of in situ data and expertise to predict water quality in urban watersheds. In recent years, data-driven models have been increasingly used as an alternative for the prediction of pollutant concentrations. Supervised machine learning (ML) models have been used for estimating stormwater quality parameters. However, optimizing the structure of such ML models has rarely been considered. This study aims to comprehensively evaluate the optimization of the supervised ensemble bagging ML model for forecasting stormwater quality using an ML-based optimization method called Bayesian optimization (BO). To that end, a bagging ensemble model, namely random forest (RF), was first developed for estimating total suspended solids (TSS) concentration in urban watersheds. Eleven factors, including drainage area, land-use types, impervious area, rainfall depth, the volume of runoff, and antecedent dry days, were implemented as predictive features in the model, and their data were acquired from the National Stormwater Quality Database (NSQD). Values for the number of basic estimators, the number of basic selected features for developing basic estimators, subsamples, and the maximum depth of basic learners were optimized using BO. A sensitivity analysis was done on the ML model and the BO parameters, including acquisition function, number of initial points, and realizations. Results indicated that the accuracy of the RF model depends on all mentioned RF parameters. The performance of the best-developed RF model was satisfactory in both the training and the testing steps. This model obtained the R2 values of 0.955 and 0.915 for the training and testing step, respectively. The study demonstrated the potential of a combination of the RF models and BO for accurately predicting stormwater quality parameters.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Hyperparameter tuning of supervised bagging ensemble machine learning model using Bayesian optimization for estimating stormwater quality
    Mohammadreza Moeini
    Sustainable Water Resources Management, 2024, 10
  • [2] Bayesian Hyperparameter Optimization and Ensemble Learning for Machine Learning Models on Software Effort Estimation
    Marco, Robert
    Ahmad, Sakinah Sharifah Syed
    Ahmad, Sabrina
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (03) : 419 - 429
  • [3] Hyperparameter optimization for machine learning models based on Bayesian optimization
    Wu J.
    Chen X.-Y.
    Zhang H.
    Xiong L.-D.
    Lei H.
    Deng S.-H.
    Journal of Electronic Science and Technology, 2019, 17 (01) : 26 - 40
  • [4] Hyperparameter Optimization for Machine Learning Models Based on Bayesian Optimization
    Jia Wu
    Xiu-Yun Chen
    Hao Zhang
    Li-Dong Xiong
    Hang Lei
    Si-Hao Deng
    Journal of Electronic Science and Technology, 2019, (01) : 26 - 40
  • [5] Classification of buildings' potential for seismic damage using a machine learning model with auto hyperparameter tuning
    Kostinakis, Konstantinos
    Morfidis, Konstantinos
    Demertzis, Konstantinos
    Iliadis, Lazaros
    ENGINEERING STRUCTURES, 2023, 290
  • [6] Efficient Deep Learning Hyperparameter Tuning using Cloud Infrastructure Intelligent Distributed Hyperparameter tuning with Bayesian Optimization in the Cloud
    Ranjit, Mercy Prasanna
    Ganapathy, Gopinath
    Sridhar, Kalaivani
    Arumugham, Vikram
    2019 IEEE 12TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (IEEE CLOUD 2019), 2019, : 520 - 522
  • [7] Automated Hyperparameter Tuning and Ensemble Machine Learning Approach for Network Traffic Classification
    Chen, Liwei
    Sun, Xiu
    Li, Yuchan
    Jaseemuddin, Muhammad
    Kazi, Baha Uddin
    19TH IEEE INTERNATIONAL SYMPOSIUM ON BROADBAND MULTIMEDIA SYSTEMS AND BROADCASTING, BMSB 2024, 2024, : 690 - 695
  • [8] Deep Learning on Active Sonar Data Using Bayesian Optimization for Hyperparameter Tuning
    Berg, Henrik
    Hjelmervik, Karl Thomas
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 6546 - 6553
  • [9] Machine Learning-Based Boosted Regression Ensemble Combined with Hyperparameter Tuning for Optimal Adaptive Learning
    Isabona, Joseph
    Imoize, Agbotiname Lucky
    Kim, Yongsung
    SENSORS, 2022, 22 (10)
  • [10] Fast hyperparameter tuning using Bayesian optimization with directional derivatives
    Joy, Tinu Theckel
    Rana, Santu
    Gupta, Sunil
    Venkatesh, Svetha
    KNOWLEDGE-BASED SYSTEMS, 2020, 205