A tree-based statistical classification algorithm (CHAID) for identifying variables responsible for the occurrence of faecal indicator bacteria during waterworks operations

被引:6
作者
Bichler, Andrea [1 ]
Neumaier, Arnold [2 ]
Hofmann, Thilo [1 ]
机构
[1] Univ Vienna, Dept Environm Geosci, A-1090 Vienna, Austria
[2] Univ Vienna, Dept Math, A-1090 Vienna, Austria
关键词
Groundwater quality; Drinking water; Faecal indicator bacteria; Total coliforms; Classification tree; CHAID; GROUNDWATER; POLLUTION; RAINFALL; EVENTS;
D O I
10.1016/j.jhydrol.2014.08.013
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Microbial contamination of groundwater used for drinking water can affect public health and is of major concern to local water authorities and water suppliers. Potential hazards need to be identified in order to protect raw water resources. We propose a non-parametric data mining technique for exploring the presence of total coliforms (TC) in a groundwater abstraction well and its relationship to readily available, continuous time series of hydrometric monitoring parameters (seven year records of precipitation, river water levels, and groundwater heads). The original monitoring parameters were used to create an extensive generic dataset of explanatory variables by considering different accumulation or averaging periods, as well as temporal offsets of the explanatory variables. A classification tree based on the Chi-Squared Automatic Interaction Detection (CHAID) recursive partitioning algorithm revealed statistically significant relationships between precipitation and the presence of TC in both a production well and a nearby monitoring well. Different secondary explanatory variables were identified for the two wells. Elevated water levels and short-term water table fluctuations in the nearby river were found to be associated with TC in the observation well. The presence of TC in the production well was found to relate to elevated groundwater heads and fluctuations in groundwater levels. The generic variables created proved useful for increasing significance levels. The tree-based model was used to predict the occurrence of TC on the basis of hydrometric variables. (C) 2014 Elsevier B.V. All rights reserved.
引用
收藏
页码:909 / 917
页数:9
相关论文
共 31 条
[1]   Effects of foliar nutrients and environmental factors on site productivity in Pinus pinaster Ait. stands in Asturias (NW Spain) [J].
Alvarez-Alvarez, Pedro ;
Afif Khouri, Elas ;
Camara-Obregon, Asuncion ;
Castedo-Dorado, Fernando ;
Barrio-Anta, Marcos .
ANNALS OF FOREST SCIENCE, 2011, 68 (03) :497-509
[2]  
[Anonymous], 2006, COR LAND COV 2006 RA
[3]  
[Anonymous], IBM SPSS STAT WIND V
[4]   Macropores and water flow in soils revisited [J].
Beven, Keith ;
Germann, Peter .
WATER RESOURCES RESEARCH, 2013, 49 (06) :3071-3092
[5]   Statistical modeling: The two cultures [J].
Breiman, L .
STATISTICAL SCIENCE, 2001, 16 (03) :199-215
[6]   Extreme water-related weather events and waterborne disease [J].
Cann, K. F. ;
Thomas, D. Rh ;
Salmon, R. L. ;
Wyn-Jones, A. P. ;
Kay, D. .
EPIDEMIOLOGY AND INFECTION, 2013, 141 (04) :671-686
[7]   The impact of point source pollution on shallow groundwater used for human consumption in a threshold country [J].
Cecilia Cruz, Mercedes ;
Gutierrez Cacciabue, Dolores ;
Gil, Jose F. ;
Gamboni, Oscar ;
Soledad Vicente, Maria ;
Wuertz, Stefan ;
Gonzo, Elio ;
Rajal, Veronica B. .
JOURNAL OF ENVIRONMENTAL MONITORING, 2012, 14 (09) :2338-2349
[8]   Catchment process affecting drinking water quality, including the significance of rainfall events, using factor analysis and event mean concentrations [J].
Cinque, Kathy ;
Jayasuriya, Niranjali .
JOURNAL OF WATER AND HEALTH, 2010, 8 (04) :751-763
[9]  
Clark L.A., 1992, STAT MODELS S, P337
[10]   Overland flow delivery of faecal bacteria to a headwater pastoral stream [J].
Collins, R ;
Elliott, S ;
Adams, R .
JOURNAL OF APPLIED MICROBIOLOGY, 2005, 99 (01) :126-132