Comparison of machine learning algorithms to predict dissolved oxygen in an urban stream

被引:15
作者
Bolick, Madeleine M. [1 ]
Post, Christopher J. [1 ]
Naser, Mohannad-Zeyad [2 ]
Mikhailova, Elena A. [1 ]
机构
[1] Clemson Univ, Dept Forestry & Environm Conservat, Clemson, SC 29634 USA
[2] Clemson Univ, Dept Civil & Environm Engn & Earth Sci, Clemson, SC 29634 USA
基金
美国食品与农业研究所;
关键词
Artificial intelligence; Machine learning; Regression; Water quality; Water resources; Watershed; ARTIFICIAL NEURAL-NETWORK; WATER-QUALITY; CORRELATION-COEFFICIENTS; HYDROLOGICAL MODELS; INTELLIGENCE; ENVIRONMENT; CATCHMENT; GRADIENT; CLIMATE; SEWAGE;
D O I
10.1007/s11356-023-27481-5
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Water quality monitoring for urban watersheds is critical to identify the negative urbanization impacts. This study sought to identify a successful predictive machine learning model with minimal parameters from easy-to-deploy, low-cost sensors to create a monitoring system for the urban stream network, Hunnicutt Creek, in Clemson, SC, USA. A multiple linear regression model was compared to machine learning algorithms k-nearest neighbor, decision tree, random forest, and gradient boosting. These algorithms were evaluated to understand which best predicted dissolved oxygen (DO) from water temperature, conductivity, turbidity, and water level change at four locations along the urban stream. The random forest algorithm had the highest performance in predicting DO for all four sites, with Nash-Sutcliffe model efficiency coefficient (NSE) scores > 0.9 at three sites and > 0.598 at the fourth site. The random forest model was further examined using explainable artificial intelligence (XAI) and found that temperature influenced the DO predictions for three of the four sites, but there were different water quality interactions depending on site location. Calculating the land cover type in each site's sub-watershed revealed that different amounts of impervious surface and vegetation influenced water quality and the resulting DO predictions. Overall, machine learning combined with land cover data helps decision-makers better understand the nuances of urban watersheds and the relationships between urban land cover and water quality.
引用
收藏
页码:78075 / 78096
页数:22
相关论文
共 99 条
[1]   Implementation of data intelligence models coupled with ensemble machine learning for prediction of water quality index [J].
Abba, Sani Isah ;
Pham, Quoc Bao ;
Saini, Gaurav ;
Linh, Nguyen Thi Thuy ;
Ahmed, Ali Najah ;
Mohajane, Meriame ;
Khaledian, Mohammadreza ;
Abdulkadir, Rabiu Aliyu ;
Bach, Quang-Vu .
ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH, 2020, 27 (33) :41524-41539
[2]   Peeking Inside the Black-Box: A Survey on Explainable Artificial Intelligence (XAI) [J].
Adadi, Amina ;
Berrada, Mohammed .
IEEE ACCESS, 2018, 6 :52138-52160
[3]   Dissolved oxygen concentration predictions for running waters with different land use land cover using a quantile regression forest machine learning technique [J].
Ahmed, Mohammad Hafez ;
Lin, Lian-Shin .
JOURNAL OF HYDROLOGY, 2021, 597
[4]   Efficient Water Quality Prediction Using Supervised Machine Learning [J].
Ahmed, Umair ;
Mumtaz, Rafia ;
Anwar, Hirra ;
Shah, Asad A. ;
Irfan, Rabia ;
Garcia-Nieto, Jose .
WATER, 2019, 11 (11)
[5]   RETRACTED: Water Quality Prediction Using Artificial Intelligence Algorithms (Retracted Article) [J].
Aldhyani, Theyazn H. H. ;
Al-Yaari, Mohammed ;
Alkahtani, Hasan ;
Maashi, Mashael .
APPLIED BIONICS AND BIOMECHANICS, 2020, 2020
[6]   Stream water quality prediction using boosted regression tree and random forest models [J].
Alnahit, Ali O. ;
Mishra, Ashok K. ;
Khan, Abdul A. .
STOCHASTIC ENVIRONMENTAL RESEARCH AND RISK ASSESSMENT, 2022, 36 (09) :2661-2680
[7]   Untangling hybrid hydrological models with explainable artificial intelligence [J].
Althoff, Daniel ;
Bazame, Helizani Couto ;
Nascimento, Jessica Garcia .
H2OPEN JOURNAL, 2021, 4 (01) :13-28
[8]  
Anguita D., 2012, InESANN, P441
[9]  
[Anonymous], TECHNICAL REPORT
[10]  
Belghazi MI, 2018, PR MACH LEARN RES, V80