Comparison of several linear statistical models to predict tropospheric ozone concentrations

被引:7
作者
Pires, J. C. M. [1 ]
Alvim-Ferraz, M. C. M. [1 ]
Pereira, M. C. [1 ]
Martins, F. G. [1 ]
机构
[1] Univ Porto, Fac Engn, Dept Engn Quim, LEPAE, P-4200465 Oporto, Portugal
关键词
air pollution; tropospheric ozone; statistical models; concentration-level prediction; PRINCIPAL COMPONENT; CLUSTER-ANALYSIS; REGRESSION; MANAGEMENT; NO2;
D O I
10.1080/00949655.2011.623233
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This study aims to evaluate the performance of five linear statistical models in the prediction of the next-day hourly average ozone concentrations. The selected models are as follows: (i) multiple linear regression, (ii) principal component regression, (iii) independent component regression (ICR), (iv) quantile regression (QR) and (v) partial least squares regression (PLSR). As far as it has been known, no study comparing the performance of these five linear models for predicting tropospheric ozone concentrations has been presented. Moreover, it is the first time that ICR is applied with this aim. The considered ozone predictors are meteorological data (hourly averages of temperature, relative humidity and wind speed) and environmental data (hourly average concentrations of sulphur dioxide, carbon monoxide, nitrogen oxide, nitrogen dioxide and ozone) of the previous day collected at an urban site with traffic influences. The analysed periods were May and June 2003. The QR model, which tries to model the entire distribution of the O-3 concentrations, presents a better performance in the training step, because it tries to model the entire distribution of the O-3 concentrations. However, it presents worst predictions in the test step. This means that a new procedure that is better than the one applied (k-nearest neighbours algorithm) and can estimate the percentiles of the output variable in the test data set with more precision should be found. From the five statistical models tested in this study, the PLSR model presents the best predictions of the tropospheric ozone concentrations.
引用
收藏
页码:183 / 192
页数:10
相关论文
共 20 条
[1]   Regression and multilayer perceptron-based models to forecast hourly O3 and NO2 levels in the Bilbao area [J].
Agirre-Basurko, E ;
Ibarra-Berastegi, G ;
Madariaga, I .
ENVIRONMENTAL MODELLING & SOFTWARE, 2006, 21 (04) :430-446
[2]  
[Anonymous], 1962, Modern factor analysis
[3]  
[Anonymous], 2003, Encycl. for res. methods for the soc, sci.
[4]   Use of principal component scores in multiple linear regression models for prediction of Chlorophyll-a in reservoirs [J].
Çamdevyren, H ;
Demyr, N ;
Kanik, A ;
Keskyn, S .
ECOLOGICAL MODELLING, 2005, 181 (04) :581-589
[5]   Assessment of ozone variations and meteorological effects in an urban area in the Mediterranean Coast [J].
Dueñas, C ;
Fernández, MC ;
Cañete, S ;
Carretero, J ;
Liger, E .
SCIENCE OF THE TOTAL ENVIRONMENT, 2002, 299 (1-3) :97-113
[6]  
Eberly Lynn E., 2007, V404, P165, DOI 10.1007/978-1-59745-530-5_9
[7]   Statistical surface ozone models: an improved methodology to account for non-linear behaviour [J].
Gardner, MW ;
Dorling, SR .
ATMOSPHERIC ENVIRONMENT, 2000, 34 (01) :21-34
[8]   Study on the formation and transport of ozone in relation to the air quality management and vegetation protection in Tenerife (Canary Islands) [J].
Guerra, JC ;
Rodríguez, S ;
Arencibia, MT ;
García, MD .
CHEMOSPHERE, 2004, 56 (11) :1157-1167
[9]  
Hayter A.J., 2005, STAT METHODOLOGY, V3, P186
[10]   Independent component analysis:: algorithms and applications [J].
Hyvärinen, A ;
Oja, E .
NEURAL NETWORKS, 2000, 13 (4-5) :411-430