Comparison of Machine Learning Approaches with a General Linear Model To Predict Personal Exposure to Benzene

被引:15
作者
Aquilina, Noel J. [1 ,2 ]
Delgado-Saborit, Juana Maria [1 ,5 ]
Bugelli, Stefano [3 ]
Ginies, Jason Padovani [3 ]
Harrison, Roy M. [1 ,4 ]
机构
[1] Univ Birmingham, Sch Geog Earth & Environm Sci, Div Environm Hlth & Risk Management, Birmingham B15 2TT, W Midlands, England
[2] Univ Malta, Fac Sci, Dept Geosci, Msida 2080, Msd, Malta
[3] Univ Malta, Fac Sci, Dept Phys, Msida 2080, Msd, Malta
[4] King Abdulaziz Univ, Ctr Excellence Environm Studies, Dept Environm Sci, POB 80203, Jeddah 21589, Saudi Arabia
[5] Barcelona Biomed Res Pk PRBB, Barcelona Inst Global Hlth, ISGlobal, Campus MAR,Doctor Aiguader 88, Barcelona, Spain
基金
美国国家环境保护局;
关键词
VOLATILE ORGANIC-COMPOUNDS; CLASSIFIER ENSEMBLE CONSTRUCTION; PM10; CONCENTRATIONS; AIR; OUTDOOR; APPORTIONMENT; VENTILATION; DIAGNOSIS; SELECTION; AVERAGE;
D O I
10.1021/acs.est.8b03328
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Machine learning techniques (MLTs) offer great power in analyzing complex data sets and have not previously been applied to nonoccupational pollutant exposure. MLT models that can predict personal exposure to benzene have been developed and compared with a standard model using a linear regression approach (GLM). The models were tested against independent data sets obtained from three personal exposure measurement campaigns. A correlation-based feature subset (CFS) selection algorithm identified a reduced attribute set, with common attributes grouped under the use of paints in homes, upholstery materials, space heating, and environmental tobacco smoke as the attributes suitable to predict the personal exposure to benzene. Personal exposure was categorized as low, medium, and high, and for big data sets, both the GLM and MLTs show high variability in performance to correctly classify greater than 90 percentile concentrations, but the MLT models have a higher score when accounting for divergence of incorrectly classified cases. Overall, the MLTs perform at least as well as the GLM and avoid the need to input microenvironment concentrations.
引用
收藏
页码:11215 / 11222
页数:8
相关论文
共 44 条
  • [1] [Anonymous], 2009, SIGKDD Explorations, DOI DOI 10.1145/1656274.1656278
  • [2] [Anonymous], 143 HEI
  • [3] [Anonymous], 2011, IARC MONOGRAPHS EVAL, V1-100
  • [4] Comparative Modeling Approaches for Personal Exposure to Particle-Associated PAH
    Aquilina, Noel J.
    Delgado-Saborit, Juana Mari
    Gauci, Adam P.
    Baker, Stephen
    Meddings, Claire
    Harrison, Roy M.
    [J]. ENVIRONMENTAL SCIENCE & TECHNOLOGY, 2010, 44 (24) : 9370 - 9376
  • [5] Characterization of volatile organic compounds in smoke at experimental fires
    Austin, CC
    Wang, D
    Ecobichon, DJ
    Dussault, G
    [J]. JOURNAL OF TOXICOLOGY AND ENVIRONMENTAL HEALTH-PART A, 2001, 63 (03) : 191 - 206
  • [6] Simultaneous measurement of ventilation using tracer gas techniques and VOC concentrations in homes, garages and vehicles
    Batterman, S
    Jia, CR
    Hatzivasilis, G
    Godwin, C
    [J]. JOURNAL OF ENVIRONMENTAL MONITORING, 2006, 8 (02): : 249 - 256
  • [7] Long duration tests of room air filters in cigarette smokers' homes
    Batterman, S
    Godwin, C
    Jia, CR
    [J]. ENVIRONMENTAL SCIENCE & TECHNOLOGY, 2005, 39 (18) : 7260 - 7268
  • [8] THE MAGNITUDE OF COMPOUNDING CONSERVATISMS IN SUPERFUND RISK ASSESSMENTS
    BURMASTER, DE
    HARRIS, RH
    [J]. RISK ANALYSIS, 1993, 13 (02) : 131 - 134
  • [9] The effects of smoking status and ventilation on environmental tobacco smoke concentrations in public areas of UK pubs and bars
    Carrington, J
    Watson, AFR
    Gee, IL
    [J]. ATMOSPHERIC ENVIRONMENT, 2003, 37 (23) : 3255 - 3266
  • [10] The Effect of VOCs Exposure During Pregnancy on Newborn's Birth Weight in Mothers and Children's Environmental Health (MOCEH) Study
    Chang, Moon-Hee
    Ha, Eun-Hee
    Park, Hyesook
    Ha, Mina
    Kim, Young Ju
    Hong, Yun-Chul
    Kim, Yangho
    Roh, Young-Man
    Lee, Bo-Eun
    Seo, Ju-Hee
    Kim, Byung-Mi
    [J]. EPIDEMIOLOGY, 2011, 22 (01) : S162 - S163