Prediction of 5-day biochemical oxygen demand in the Buriganga River of Bangladesh using novel hybrid machine learning algorithms

被引:12
作者
Nafsin, Nabila [1 ]
Li, Jin [1 ]
机构
[1] Univ Wisconsin, Dept Civil & Environm Engn, Milwaukee, WI 53211 USA
关键词
ANN-SVM; BOD5; prediction; Buriganga river; hybrid machine learning; RF-SVM; WATER-QUALITY PARAMETERS; TREATMENT-PLANT; EVENT DETECTION; POLLUTION;
D O I
10.1002/wer.10718
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Biochemical oxygen demand (BOD) is one of the most important variables indicating stream pollution with a severe condition of organic loading and maintaining aquatic life in ecosystems. Advanced monitoring techniques such as machine learning (ML) methods have been developed for an accurate, reliable, and cost-effective prediction of BOD. This study investigated the effectiveness of four stand-alone ML algorithms, namely, artificial neural network (ANN), support vector machine (SVM), random forest (RF), and gradient boosting machine (GBM), and six novel hybrid algorithms, namely, RF-SVM, ANN-SVM, GBM-SVM, RF-ANN, GBM-ANN, and RF-GBM, in predicting BOD of the Buriganga river system of Bangladesh. The feature importance analysis of RF algorithm indicated that chemical oxygen demand (COD), total dissolved solids (TDS), conductivity, total solids (TS), suspended solids (SS), and turbidity are the most influential parameters for predicting BOD5. The significance of this study is the application of the novel hybrid models that resulted in higher prediction success; RF-SVM with the highest R-2 value (0.908). The employed novel hybrid ML models can be particularly useful for efficient and systematic data management, water pollution control, and prevention in developing countries such as Bangladesh. Practitioner Points Investigated the efficiency of four stand-alone and six novel hybrid ML models for predicting BOD in a river of Bangladesh. The significance of this study is the application of the six novel hybrid models that resulted in higher prediction success. The best three prediction models were RF-SVM, ANN-SVM, and GBM-SVM with a prediction success of 91%, 89.6%, and 88.8% respectively. ML models indicated COD, conductivity, TDS, TS, SS, and turbidity as the most influential variables for predicting BOD. The novel hybrid models can be useful for developing countries for efficient systematic data management, pollution control, and prevention strategies.
引用
收藏
页数:17
相关论文
共 50 条
[1]  
Ahammed S S., 2016, International Journal of Scientific Technology Research, V5, P36
[2]  
Ahmad MK, 2010, INT J ENVIRON RES, V4, P321
[3]   Efficient Water Quality Prediction Using Supervised Machine Learning [J].
Ahmed, Umair ;
Mumtaz, Rafia ;
Anwar, Hirra ;
Shah, Asad A. ;
Irfan, Rabia ;
Garcia-Nieto, Jose .
WATER, 2019, 11 (11)
[4]   Comparison of the performance of decision tree (DT) algorithms and extreme learning machine (ELM) model in the prediction of water quality of the Upper Green River watershed [J].
Anmala, Jagadeesh ;
Turuganti, Venkateswarlu .
WATER ENVIRONMENT RESEARCH, 2021, 93 (11) :2360-2373
[5]  
Ashiqur RahmanMd., 2010, IRANICA J OFENERGY E, V1, P81, DOI DOI 10.1016/0012-821X(95)00123-T
[6]   Predicting river water quality index using data mining techniques [J].
Babbar, Richa ;
Babbar, Sakshi .
ENVIRONMENTAL EARTH SCIENCES, 2017, 76 (14)
[7]   Biochemical oxygen demand prediction in wastewater treatment plant by using different regression analysis models [J].
Baki, Osman Tugrul ;
Aras, Egemen ;
Akdemir, Ummukulsum Ozel ;
Yilmaz, Banu .
DESALINATION AND WATER TREATMENT, 2019, 157 :79-89
[8]   Source apportionment and pollution evaluation of heavy metals in water and sediments of Buriganga River, Bangladesh, using multivariate analysis and pollution evaluation indices [J].
Bhuiyan, Mohammad Amir Hossain ;
Dampare, Samuel B. ;
Suzuki, M. A. Islam Shigeyuki .
ENVIRONMENTAL MONITORING AND ASSESSMENT, 2015, 187 (01)
[9]   Machine learning methods for imbalanced data set for prediction of faecal contamination in beach waters [J].
Bourel, Mathias ;
Segura, Angel M. ;
Crisci, Carolina ;
Lopez, Guzman ;
Sampognaro, Lia ;
Vidal, Victoria ;
Kruk, Carla ;
Piccini, Claudia ;
Perera, Gonzalo .
WATER RESEARCH, 2021, 202
[10]   Comparative analysis of surface water quality prediction performance and identification of key water parameters using different machine learning models based on big data [J].
Chen, Kangyang ;
Chen, Hexia ;
Zhou, Chuanlong ;
Huang, Yichao ;
Qi, Xiangyang ;
Shen, Ruqin ;
Liu, Fengrui ;
Zuo, Min ;
Zou, Xinyi ;
Wang, Jinfeng ;
Zhang, Yan ;
Chen, Da ;
Chen, Xingguo ;
Deng, Yongfeng ;
Ren, Hongqiang .
WATER RESEARCH, 2020, 171 (171)