A novel regression imputation framework for Tehran air pollution monitoring network using outputs from WRF and CAMx models

被引:38
作者
Shahbazi, Hossein [1 ]
Karimi, Sajjad [2 ]
Hosseini, Vahid [1 ]
Yazgi, Daniel [3 ]
Torbatian, Sara [4 ]
机构
[1] Sharif Univ Technol, Dept Mech Engn, Tehran, Iran
[2] Sharif Univ Technol, Dept Elect Engn, Tehran, Iran
[3] Univ Tehran, Inst Geophys, Tehran, Iran
[4] Tehran Air Qual Control Co, Tehran, Iran
关键词
Urban areas; Regression imputation; Elastic networks; ANFIS; Air pollutants; Concentration retrieval; Tehran air pollution forecasting system; MISSING VALUES; POLLUTANTS; SELECTION;
D O I
10.1016/j.atmosenv.2018.05.055
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Missing or incomplete data in short or long intervals is a common problem in measuring air pollution. Severe issues may arise when dealing with missing data for time-series prediction schemes or mean analysis. This study aimed to develop a new regression imputation framework to impute missing values in the hourly air quality data set of Tehran and enhance the applicability of Tehran Air Pollution Forecasting System (TAPFS). The proposed framework was designed based on three types of features including measurements of other stations, WRF and CAMx physical models. In this framework, elastic net and neuro-fuzzy networks were efficiently combined in a two-layer structure. The framework was applied on Tehran's air pollution monitoring network. The hourly imputing results of the suggested method were seen to be superior to existing methods according to statistical criteria such as RMSE, MAE and R-values. Average R-values of 0.88, 0.73, 0.76 and 0.79 were obtained for O-3, NO, PM2.5 and PM10, respectively. The measurements of other stations had the main predictive power with a modest increase for the two physical models. The benefit of the models was somewhat higher for stations on boundaries of monitoring network. In addition, the central stations had better performance than the boundary stations and an approximately 0.05 increase was obtained in average R-value.
引用
收藏
页码:24 / 33
页数:10
相关论文
共 28 条
[1]   Determining of spatial distribution patterns and temporal trends of an air pollutant using proper orthogonal decomposition basis functions [J].
Ashrafi, Khosro .
ATMOSPHERIC ENVIRONMENT, 2012, 47 :468-476
[2]  
Buragohain M, 2009, THESIS
[3]   Air pollution forecast in cities by an air pollution index highly correlated with meteorological variables [J].
Cogliani, E .
ATMOSPHERIC ENVIRONMENT, 2001, 35 (16) :2871-2877
[4]   Dependence of urban air pollutants on meteorology [J].
Elminir, HK .
SCIENCE OF THE TOTAL ENVIRONMENT, 2005, 350 (1-3) :225-237
[5]   Statistical models and time series forecasting of sulfur dioxide: a case study Tehran [J].
Hassanzadeh, S. ;
Hosseinibalam, F. ;
Alizadeh, R. .
ENVIRONMENTAL MONITORING AND ASSESSMENT, 2009, 155 (1-4) :149-155
[6]  
Hirabayashi S, 2017, Single imputation method of missing air quality data for i-tree eco analyses in the conterminous united states
[7]   Multiple imputation for multivariate data with missing and below-threshold measurements: Time-series concentrations of pollutants in the Arctic [J].
Hopke, PK ;
Liu, CH ;
Rubin, DB .
BIOMETRICS, 2001, 57 (01) :22-33
[8]   Much ado about nothing: A comparison of missing data methods and software to fit incomplete data regression models [J].
Horton, Nicholas J. ;
Kleinman, Ken P. .
AMERICAN STATISTICIAN, 2007, 61 (01) :79-90
[9]   Urban Air Pollution in Iran [J].
Hosseini, Vahid ;
Shahbazi, Hossein .
IRANIAN STUDIES, 2016, 49 (06) :1029-1046
[10]   Effect of climate change on air quality [J].
Jacob, Daniel J. ;
Winner, Darrell A. .
ATMOSPHERIC ENVIRONMENT, 2009, 43 (01) :51-63