A new method of missing data imputation applied to time series of PM10 concentration

被引:0
作者
Nogarotto, Danilo Covaes [1 ]
Rissi, Nathalia Morgana [1 ]
Pozza, Simone Andrea [1 ]
机构
[1] Univ Estadual Campinas, Campinas, SP, Brazil
来源
REVISTA TECNOLOGIA E SOCIEDADE | 2019年 / 15卷 / 37期
关键词
Missing data; Box-Jenkins Methodology; Particulate Matter; Forecast; PARTICULATE MATTER; POLLUTION DATA; AIR-POLLUTION; VALUES; SINGLE; MODEL; CITY;
D O I
10.3895/rts.v15n37.8594
中图分类号
C [社会科学总论];
学科分类号
03 ; 0303 ;
摘要
The study of atmospheric pollution, with the emphasis of inhalable particulate matter (PM10), is necessary; given the damage was done for population health, besides other losses. Historical series, used for forecasting data, often have gaps due to several factors, which can detract from the quality of the forecast. The aim of this study was purposed a new method of missing data imputation, and after this, to use a time series model to forecast PM10 concentration. It was obtained the PM10 daily concentration in QUALAR system of CETESB, related to cities of Campinas, Jundiai and Paulinia, all in Sao Paulo State. The method of imputation of missing data, purpose in this study, was called TDEM (Time-Dependent Effect Method). The TDEM method was compared to others two methods ("Mean during month" and "Mean during year") of imputation of missing data, and it presented better results related to correlation coefficient, mean square error and mean absolute deviation. After imputation data of series, the data were analysed in order to forecast future PM10 concentrations. It was used ARIMA and SARIMA for time series models. The more satisfactory results were obtained for SARIMA models, which real data remained within the 95% forecast limits.
引用
收藏
页码:275 / 296
页数:22
相关论文
共 21 条
[1]   Atmospheric aerosols: Historical perspective, sources, chemical formation processes and organic composition. [J].
Alves, C .
QUIMICA NOVA, 2005, 28 (05) :859-870
[2]  
[Anonymous], 2017, R LANG ENV STAT COMP
[3]  
[Anonymous], FUNDAMENTALS AIR POL
[4]   Time-series studies of particulate matter [J].
Bell, ML ;
Samet, JM ;
Dominici, F .
ANNUAL REVIEW OF PUBLIC HEALTH, 2004, 25 :247-280
[5]  
DAMELIO M. T. S., 2017, ENSAIOS USF, V1, P80
[6]  
Ehlers R S., 2009, Analise de Series Temporais
[7]   A practical comparison of single and multiple imputation methods to handle complex missing data in air quality datasets [J].
Gomez-Carracedo, M. P. ;
Andrade, J. M. ;
Lopez-Mahia, P. ;
Muniategui, S. ;
Prada, D. .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2014, 134 :23-33
[8]   Statistical models for the prediction of respirable suspended particulate matter in urban cities [J].
Goyal, P ;
Chan, AT ;
Jaiswal, N .
ATMOSPHERIC ENVIRONMENT, 2006, 40 (11) :2068-2077
[9]  
Gujarati D. N, 2012, ECONOMETRIA BASICA
[10]   Imputation of missing data in time series for air pollutants [J].
Junger, W. L. ;
de Leon, A. Ponce .
ATMOSPHERIC ENVIRONMENT, 2015, 102 :96-104