Importance of land use factors in the prediction of water quality of the Upper Green River watershed, Kentucky, USA, using random forest

被引:12
作者
Venkateswarlu, Turuganti [1 ]
Anmala, Jagadeesh [2 ]
机构
[1] Natl Inst Technol NIT, Adhoc Fac, Dept Civil Engn, Tadepalligudem 534101, Andhra Prades, India
[2] Birla Inst Technol & Sci, Dept Civil Engn, Hyderabad Campus, Hyderabad 500078, Telangana, India
关键词
Random forest; Artificial neural network; Fecal coliform; Turbidity; pH; Conductivity; FECAL-COLIFORM; PATTERNS; COVER; CONTAMINATION; VARIABLES; PIEDMONT; IMPACTS;
D O I
10.1007/s10668-023-03630-1
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Surface waters are essential for meeting the needs of the world. In many regions, stream water quality is a major concern due to contamination from multiple sources. Stream water is also susceptible to climatic events and land-use practices influencing its catchment. Understanding the impact of such events on stream water quality is crucial for managing and protecting aquatic ecosystems and providing safe drinking water to communities that rely on these streams. Hence, monitoring and evaluating stream water quality holds significance in identifying potential hazards and implementing suitable management strategies. In this paper, a novel effort was made to determine the relative feature importance of a set of watershed characteristics (precipitation, temperature, urban land use, agricultural land use, and forest land-use factors) on four important water quality parameters (WQPs): fecal coliforms (FC), turbidity, pH, and conductivity of the Upper Green River watershed, Kentucky, USA. Random forest (RF), an ensemble learning method, was used to predict the WQPs from the causal parameters and determine the feature importance characteristics of the four WQPs previously mentioned. This model demonstrated that precipitation and temperature are the most influential factors on FC, turbidity, and pH. Forest land use and temperature are the two most important factors for conductivity. The novel feature importance factors of the RF model have likewise been confirmed for each WQP. In modeling stream WQPs, the developed the RF model outperformed the artificial neural network (ANN) model. Using the RF model, we obtain regression coefficients of (0.93, 0.74, and 0.94) for pH in training, testing, and overall. We obtain regression coefficients of (0.60, 0.64, and 0.61) using the ANN model. ⁠⁠⁠⁠⁠⁠⁠Overall, the RF model was more effective than the ANN model in modeling stream WQPs. The model identified precipitation and temperature as the most influential factors on FC, turbidity, and pH, while forest land use and temperature were the most important factors in determining conductivity. It is also found that land use factors are important to improve the accuracy of WQPs predictions from climate variables. The results of this study can be used by authorities to better understand and control pollution at the watershed scale.
引用
收藏
页码:23961 / 23984
页数:24
相关论文
共 46 条
[31]   Evaluating the Relationships between Riparian Land Cover Characteristics and Biological Integrity of Streams Using Random Forest Algorithms [J].
Park, Se-Rin ;
Kim, Suyeon ;
Lee, Sang-Woo .
INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2021, 18 (06) :1-14
[32]  
Penick M D., 2012, INT AQUAT RES, V4, P1, DOI DOI 10.1186/2008-6970-4-20
[33]   Predicting fecal indicator organism contamination in Oregon coastal streams [J].
Pettus, Paul ;
Foster, Eugene ;
Pan, Yangdong .
ENVIRONMENTAL POLLUTION, 2015, 207 :68-78
[34]   Urbanization, land use, and water quality in Shanghai 1947-1996 [J].
Ren, WW ;
Zhong, Y ;
Meligrana, J ;
Anderson, B ;
Watt, WE ;
Chen, JK ;
Leung, HL .
ENVIRONMENT INTERNATIONAL, 2003, 29 (05) :649-659
[35]   Water quality index modeling using random forest and improved SMO algorithm for support vector machine in Saf-Saf river basin [J].
Sakaa, Bachir ;
Elbeltagi, Ahmed ;
Boudibi, Samir ;
Chaffai, Hicham ;
Islam, Abu Reza Md. Towfiqul ;
Kulimushi, Luc Cimusa ;
Choudhari, Pandurang ;
Hani, Azzedine ;
Brouziyne, Youssef ;
Wong, Yong Jie .
ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH, 2022, 29 (32) :48491-48508
[36]   Land cover impacts on stream nutrients and fecal coliform in the lower Piedmont of West Georgia [J].
Schoonover, Jon E. ;
Lockaby, B. Graeme .
JOURNAL OF HYDROLOGY, 2006, 331 (3-4) :371-382
[37]   Environmental and anthropogenic factors associated with the likelihood of detecting Salmonella in agricultural watersheds [J].
Toro, Magaly ;
Weller, Daniel ;
Ramos, Romina ;
Diaz, Leonela ;
Alvarez, Francisca P. ;
Reyes-Jara, Angelica ;
Moreno-Switt, Andrea I. ;
Meng, Jianghong ;
Adell, Aiko D. .
ENVIRONMENTAL POLLUTION, 2022, 306
[38]   PCA, CCA, and ANN Modeling of Climate and Land-Use Effects on Stream Water Quality of Karst Watershed in Upper Green River, Kentucky [J].
Venkateswarlu, Turuganti ;
Anmala, Jagadeesh ;
Dharwa, Mayank .
JOURNAL OF HYDROLOGIC ENGINEERING, 2020, 25 (06)
[39]  
Victoriano Jayson M., 2020, International Journal of Environmental Science and Development, V11, P36, DOI 10.18178/ijesd.2020.11.1.1222
[40]   Random forest-based modeling of stream nutrients at national level in a data-scarce region [J].
Virro, Holger ;
Kmoch, Alexander ;
Vainu, Marko ;
Uuemaa, Evelyn .
SCIENCE OF THE TOTAL ENVIRONMENT, 2022, 840