Developing robust arsenic awareness prediction models using machine learning algorithms

被引:38
作者
Singh, Sushant K. [1 ,5 ]
Taylor, Robert W. [1 ]
Rahman, Mohammad Mahmudur [2 ]
Pradhan, Biswajeet [3 ,4 ]
机构
[1] Montclair State Univ, Dept Earth & Environm Studies, 1 Normal Ave, Montclair, NJ 07043 USA
[2] Univ Newcastle UON, Fac Sci, GCER, Callaghan, NSW, Australia
[3] Univ Technol Sydney, Sch Syst Management & Leadership, Fac Engn & IT, Bldg 11,Level 06,81 Broadway, Ultimo, NSW 2007, Australia
[4] Sejong Univ, Dept Energy & Mineral Resources Engn, 209 Neungdong Ro, Seoul 05006, South Korea
[5] Virtusa Corp, Irvington, NJ 07111 USA
关键词
Arsenic; Awareness; Socioeconomic; Demographics; Sociobehavioral; Machine learning algorithms; SVM; RF; GIS; India; MIDDLE GANGA PLAIN; DRINKING-WATER; GROUNDWATER CONTAMINATION; HEALTH; BIHAR; BANGLADESH; INDIA; PERFORMANCE; DISTRICT; LESSONS;
D O I
10.1016/j.jenvman.2018.01.044
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Arsenic awareness plays a vital role in ensuring the sustainability of arsenic mitigation technologies. Thus far, however, few studies have dealt with the sustainability of such technologies and its associated socioeconomic dimensions. As a result, arsenic awareness prediction has not yet been fully conceptualized. Accordingly, this study evaluated arsenic awareness among arsenic-affected communities in rural India, using a structured questionnaire to record socioeconomic, demographic, and other sociobehavioral factors with an eye to assessing their association with and influence on arsenic awareness. First a logistic regression model was applied and its results compared with those produced by six state-of-the-art machine-learning algorithms (Support Vector Machine [SVM], Kernel-SVM, Decision Tree [DT], k-Nearest Neighbor [k-NN], Nave Bayes [NB], and Random Forests [RF]) as measured by their accuracy at predicting arsenic awareness. Most (63%) of the surveyed population was found to be arsenic-aware. Significant arsenic awareness predictors were divided into three types: (1) socioeconomic factors: caste, education level, and occupation; (2) water and sanitation behavior factors: number of family members involved in water collection, distance traveled and time spent for water collection, places for defecation, and materials used for handwashing after defecation; and (3) social capital and trust factors: presence of anganwadi and people's trust in other community members, NGOs, and private agencies. Moreover, individuals' having higher social network positively contributed to arsenic awareness in the communities. Results indicated that both the SVM and the RF algorithms outperformed at overall prediction of arsenic awareness a nonlinear classification problem. Lower-caste, less educated, and unemployed members of the population were found to be the most vulnerable, requiring immediate arsenic mitigation. To this end, local social institutions and NGOs could play a crucial role in arsenic awareness and outreach programs. Use of SVM or RF or a combination of the two, together with use of a larger sample size, could enhance the accuracy of arsenic awareness prediction. (C) 2018 Elsevier Ltd. All rights reserved.
引用
收藏
页码:125 / 137
页数:13
相关论文
共 63 条
[1]   Arsenic groundwater contamination and its health effects in the state of Uttar Pradesh (UP) in upper and middle Ganga plain, India: A severe danger [J].
Ahamed, Sad ;
Sengupta, Mrinal Kumar ;
Mukherjee, Amitava ;
Hossain, M. Amir ;
Das, Bhaskar ;
Nayak, Bishwajit ;
Pal, Arup ;
Mukhejee, Subhas Chandra ;
Pati, Shyamapada ;
Dutta, Rathindra Nath ;
Chattejee, Garga ;
Mukhejee, Adreesh ;
Srivastava, Rishiji ;
Chakraborti, Dipankar .
SCIENCE OF THE TOTAL ENVIRONMENT, 2006, 370 (2-3) :310-322
[2]   Value of arsenic-free drinking water to rural households in Bangladesh [J].
Ahmad, J ;
Goldar, B ;
Misra, S .
JOURNAL OF ENVIRONMENTAL MANAGEMENT, 2005, 74 (02) :173-185
[3]  
[Anonymous], 2018, Integrated Risk Information System (IRIS) [WWW Document]
[4]  
[Anonymous], DRINK WAT QUAL S E A
[5]  
[Anonymous], SUSTAINABLE USE ARSE
[6]  
[Anonymous], INT J SUSTAIN BUILT
[7]  
[Anonymous], INFOCHANGE NEWS FEAT
[8]  
[Anonymous], 1973, NOMICS
[9]  
[Anonymous], EX SUMM ARS CONT WOR
[10]  
[Anonymous], 2016, Facts about Global Groundwater Usage