Importance of land use factors in the prediction of water quality of the Upper Green River watershed, Kentucky, USA, using random forest

被引:12
作者
Venkateswarlu, Turuganti [1 ]
Anmala, Jagadeesh [2 ]
机构
[1] Natl Inst Technol NIT, Adhoc Fac, Dept Civil Engn, Tadepalligudem 534101, Andhra Prades, India
[2] Birla Inst Technol & Sci, Dept Civil Engn, Hyderabad Campus, Hyderabad 500078, Telangana, India
关键词
Random forest; Artificial neural network; Fecal coliform; Turbidity; pH; Conductivity; FECAL-COLIFORM; PATTERNS; COVER; CONTAMINATION; VARIABLES; PIEDMONT; IMPACTS;
D O I
10.1007/s10668-023-03630-1
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Surface waters are essential for meeting the needs of the world. In many regions, stream water quality is a major concern due to contamination from multiple sources. Stream water is also susceptible to climatic events and land-use practices influencing its catchment. Understanding the impact of such events on stream water quality is crucial for managing and protecting aquatic ecosystems and providing safe drinking water to communities that rely on these streams. Hence, monitoring and evaluating stream water quality holds significance in identifying potential hazards and implementing suitable management strategies. In this paper, a novel effort was made to determine the relative feature importance of a set of watershed characteristics (precipitation, temperature, urban land use, agricultural land use, and forest land-use factors) on four important water quality parameters (WQPs): fecal coliforms (FC), turbidity, pH, and conductivity of the Upper Green River watershed, Kentucky, USA. Random forest (RF), an ensemble learning method, was used to predict the WQPs from the causal parameters and determine the feature importance characteristics of the four WQPs previously mentioned. This model demonstrated that precipitation and temperature are the most influential factors on FC, turbidity, and pH. Forest land use and temperature are the two most important factors for conductivity. The novel feature importance factors of the RF model have likewise been confirmed for each WQP. In modeling stream WQPs, the developed the RF model outperformed the artificial neural network (ANN) model. Using the RF model, we obtain regression coefficients of (0.93, 0.74, and 0.94) for pH in training, testing, and overall. We obtain regression coefficients of (0.60, 0.64, and 0.61) using the ANN model. ⁠⁠⁠⁠⁠⁠⁠Overall, the RF model was more effective than the ANN model in modeling stream WQPs. The model identified precipitation and temperature as the most influential factors on FC, turbidity, and pH, while forest land use and temperature were the most important factors in determining conductivity. It is also found that land use factors are important to improve the accuracy of WQPs predictions from climate variables. The results of this study can be used by authorities to better understand and control pollution at the watershed scale.
引用
收藏
页码:23961 / 23984
页数:24
相关论文
共 46 条
[1]   Stream water quality prediction using boosted regression tree and random forest models [J].
Alnahit, Ali O. ;
Mishra, Ashok K. ;
Khan, Abdul A. .
STOCHASTIC ENVIRONMENTAL RESEARCH AND RISK ASSESSMENT, 2022, 36 (09) :2661-2680
[2]  
Anmala, 2019, 24 HYDRO2019 INT C H, P2913
[3]   Comparison of the performance of decision tree (DT) algorithms and extreme learning machine (ELM) model in the prediction of water quality of the Upper Green River watershed [J].
Anmala, Jagadeesh ;
Turuganti, Venkateswarlu .
WATER ENVIRONMENT RESEARCH, 2021, 93 (11) :2360-2373
[4]   Statistical assessment and neural network modeling of stream water quality observations of Green River watershed, KY, USA [J].
Anmala, Jagadeesh ;
Venkateshwarlu, Turuganti .
WATER SUPPLY, 2019, 19 (06) :1831-1840
[5]   GIS and Artificial Neural Network-Based Water Quality Model for a Stream Network in the Upper Green River Basin, Kentucky, USA [J].
Anmala, Jagadeesh ;
Meier, Ouida W. ;
Meier, Albert J. ;
Grubbs, Scott .
JOURNAL OF ENVIRONMENTAL ENGINEERING, 2015, 141 (05)
[6]  
[Anonymous], about us
[7]   Cumulative impacts of landuse on water quality in a southern Appalachian watershed [J].
Bolstad, PV ;
Swank, WT .
JOURNAL OF THE AMERICAN WATER RESOURCES ASSOCIATION, 1997, 33 (03) :519-533
[8]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[9]   Relationships between land use patterns and water quality in the Taizi River basin, China [J].
Bu, Hongmei ;
Meng, Wei ;
Zhang, Yuan ;
Wan, Jun .
ECOLOGICAL INDICATORS, 2014, 41 :187-197
[10]   Effects of Land Use, Topography and Socio-Economic Factors on River Water Quality in a Mountainous Watershed with Intensive Agricultural Production in East China [J].
Chen, Jiabo ;
Lu, Jun .
PLOS ONE, 2014, 9 (08)