Comparing different methods for statistical modeling of particulate matter in Tehran, Iran

被引:0
作者
Vahid Mehdipour
David S. Stevenson
Mahsa Memarianfard
Parveen Sihag
机构
[1] K N Toosi University of Technology,Department of Civil and Environment Engineering
[2] The University of Edinburgh,School of GeoSciences
[3] National Institute of Technology,undefined
来源
Air Quality, Atmosphere & Health | 2018年 / 11卷
关键词
Air pollution; Bayesian network; Decision tree; Support vector machine; Particulate matter;
D O I
暂无
中图分类号
学科分类号
摘要
Particulate matter has major impacts on human health in urban regions, and Tehran is one of the most polluted metropolitan cities in the world, struggling to control this pollutant more than any other contaminant. PM2.5 concentrations were predicted by three statistical modeling methods: (i) decision tree (DT), (ii) Bayesian network (BN), and (iii) support vector machine (SVM). Collected data for three consecutive years (January 2013 to January 2016) were used to develop the models. Data from the initial 2 years were employed as the training data, and measurements from the last year were used for testing the models. Twelve parameters, covering meteorological variables and concentrations of several chemical species, were explored as potential predictors of PM2.5. According to the sensitivity analysis of PM2.5 by SVM and derived explicit equations from BN and DT, PM10, NO2, SO2, and O3 are the most important predictors. Furthermore, the impacts of the predictors on the PM2.5 were assessed which the chemical precursors’ influences indicated more in comparison with meteorological parameters. Capabilities of the models were compared to each other and the support vector machine was found to be the best performing, based on evaluation criteria. Nonetheless, the decision tree and Bayesian network methods also provided acceptable results. We suggest more studies using the SVM and other methods as hybrids would lead to improved models.
引用
收藏
页码:1155 / 1165
页数:10
相关论文
共 188 条
  • [1] Aguilera PA(2011)Bayesian networks in environmental modelling Environ Model Softw 26 1376-1388
  • [2] Fernández A(2009)Theory of reproducing kernels Am Math Soc 68 337-404
  • [3] Fernández R(2014)Epidemiological time series studies of PM2.5 and daily mortality and hospital admissions: a systematic review and meta-analysis Thorax 69 660-665
  • [4] Rumí R(2014)Geomorphology evaluation of relative tectonic activity in the Tehran basin, central Alborz, northern Iran Geomorphology 29 135-145
  • [5] Salmerón A(1763)An essay towards solving a problem in the doctrine of chances. By the Late Rev. Mr. Bayes, F. R. S. Communicated by Mr. Price, in a Letter to John Canton, A. M. F. R. S Philos Trans R Soc Lond 53 370-418
  • [6] Aronszajn N(1998)Mortality and ambient fine particles in southwest Mexico City, 1993-1995 Environ Health Perspect 106 849-855
  • [7] Atkinson RW(2013)Evolution of PM2.5 measurements and standards in the U.S. and future perspectives for China Aerosol Air Qual Res 13 1197-1211
  • [8] Kang S(2014)Root mean square error (RMSE) or mean absolute error (MAE)? Arguments against avoiding RMSE in the literature Geosci Model Dev 7 1247-1250
  • [9] Anderson HR(1995)Support vector networks Mach Learn 20 273-297
  • [10] Mills IC(2005)Airborne particulate matter and human health: a review Aerosol Sci Technol 39 737-749