A hybrid air quality early-warning framework: An hourly forecasting model with online sequential extreme learning machines and empirical mode decomposition algorithms

被引:71
作者
Sharma, Ekta [1 ]
Deo, Ravinesh C. [1 ]
Prasad, Ramendra [2 ]
Parisi, Alfio, V [1 ]
机构
[1] Univ Southern Queensland, Sch Sci, Adv Data Analyt Environm Modelling & Simulat Grp, Springfield, Qld 4300, Australia
[2] Univ Fiji, Sch Sci & Technol, Dept Sci, Lautoka, Fiji
关键词
Real-time air quality forecasts; Particulate matter (PM2.5; PM10); Visibility; Artificial intelligence; ICEEMDAN; SUPPORT VECTOR REGRESSION; MULTIPLE LINEAR-REGRESSION; ARTIFICIAL NEURAL-NETWORKS; PARTICULATE MATTER; SOURCE APPORTIONMENT; SOLAR-RADIATION; MISSING DATA; TIME-SERIES; POLLUTION; ENSEMBLE;
D O I
10.1016/j.scitotenv.2019.135934
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Modelling air quality with a practical tool that produces real-time forecasts to mitigate risk to public health continues to face significant challenges considering the chaotic, non-linear and high dimensional nature of air quality predictor variables. The novelty of this research is to propose a hybrid early-warning artificial intelligence (AI) framework that can emulate hourly air quality variables (i.e., Particulate Matter 2.5, PM2.5; Particulate Matter 10, PM10 and lower atmospheric visibility, VIS), the atmospheric variables associated with increased respiratory induced mortality and recurrent health-care cost. Firstly, hourly air quality data series (January-2015 to December-2017) are demarcated into their respective intrinsic mode functions (IMFs) and a residual subseries that reveal patterns and resolve data complexity characteristics, followed by partial autocorrelation function applied to each IMF and residual sub-series to unveil historical changes in air quality. To design the prescribed hybrid model, the data is partitioned into training (70%), validation (15%) and testing (15%) sub-sets. The online sequential-extreme learning machine (OS-ELM) algorithm integrated with improved complete ensemble empirical mode decomposition with adaptive noise (ICEEMDAN) is designed as a data pre-processing system to robustly extract predictive patterns and fine-tune the model generalization to a near-optimal global solution, which represents modelled air quality at hourly forecast horizons. The resulting early warning AI-based framework denoted as ICEEMDAN-OS-ELM model, is individually constructed by forecasting each IMF and residual sub-series, with hourly PM2.5, PM10, and VIS obtained by the aggregated sum of forecasted IMFs and residual sub-series. The results are benchmarked with many competing predictive approaches; e.g., hybrid ICEEMDAN-multiple-linear regression (MLR), ICEEMDAN-M5 model tree and standalone versions: OS-ELM, MLR, M5 model tree. Statistical metrics including the root-mean-square error (RMSE), mean absolute error (MAE), Willmott's Index (WI), Legates & McCabe's Index (E-LM) and Nash-Sutcliffe coefficients (E-NS) are used to evaluate the model's accuracy. Both visual and statistical results show that the proposed ICEEMDAN-OS-ELM model registers superior results, outperforming alternative comparison approaches. For instance, for PM2.5, E-LM values ranged from 0.65-0.82 vs. 0.59-0.77 for ICEEMDAN-M5 tree, 0.59-0.74 for ICEEMDAN-MLR, 0.28-0.54 for OS-ELM, 0.27-0.54 for M5 tree and 0.25-0.53 for the MLR model. For remaining air quality variables (i.e., PM10 & VIS), the objective model (ICEEMDAN-OS-ELM) outperformed the comparative models. In particular, ICEEMDAN-OS-ELM registered relatively low RMSE/MAE, ranging from approximately 0.7-1.03 mu g/m(3) (MAE), 1.01-1.47 mu g/m(3) (RMSE) for PM2.5 whereas for PM10, these metrics registered a value of 1.29-3.84 mu g/m(3) (MAE), 3.01-7.04 mu g/m(3) (RMSE) and for Visibility, they were 0.01-3.72 mu g/m(3) (MAE (Mm(-1))), 0.04-5.98 mu g/m(3) (RMSE (Mm(-1))). Visual analysis of forecasted and observed air quality through a Taylor diagram illustrates the objective model's preciseness, confirming the versatility of early warning AI-model in generating air quality forecasts. The excellent performance ascertains the hybrid model's potential utility for air quality monitoring and subsequent public health risk mitigation. (C) 2019 Elsevier B.V. All rights reserved.
引用
收藏
页数:23
相关论文
共 123 条
[1]   Input selection and optimisation for monthly rainfall forecasting in Queensland, Australia, using artificial neural networks [J].
Abbot, John ;
Marohasy, Jennifer .
ATMOSPHERIC RESEARCH, 2014, 138 :166-178
[2]   Application of artificial neural networks to rainfall forecasting in Queensland, Australia [J].
Abbot, John ;
Marohasy, Jennifer .
ADVANCES IN ATMOSPHERIC SCIENCES, 2012, 29 (04) :717-730
[3]   Particulate Matter Air Pollution and Cardiovascular Disease An Update to the Scientific Statement From the American Heart Association [J].
Brook, Robert D. ;
Rajagopalan, Sanjay ;
Pope, C. Arden, III ;
Brook, Jeffrey R. ;
Bhatnagar, Aruni ;
Diez-Roux, Ana V. ;
Holguin, Fernando ;
Hong, Yuling ;
Luepker, Russell V. ;
Mittleman, Murray A. ;
Peters, Annette ;
Siscovick, David ;
Smith, Sidney C., Jr. ;
Whitsel, Laurie ;
Kaufman, Joel D. .
CIRCULATION, 2010, 121 (21) :2331-2378
[4]  
AIHW B.S., 2007, BURDEN DIS INJURY AU, P337
[5]   Two-phase particle swarm optimized-support vector regression hybrid model integrated with improved empirical mode decomposition with adaptive noise for multiple-horizon electricity demand forecasting [J].
AL-Musaylh, Mohanad S. ;
Deo, Ravinesh C. ;
Li, Yan ;
Adamowski, Jan F. .
APPLIED ENERGY, 2018, 217 :422-439
[6]   Significant wave height forecasting via an extreme learning machine model integrated with improved complete ensemble empirical mode decomposition [J].
Ali, Mumtaz ;
Prasad, Ramendra .
RENEWABLE & SUSTAINABLE ENERGY REVIEWS, 2019, 104 :281-295
[7]   Multi-stage hybridized online sequential extreme learning machine integrated with Markov Chain Monte Carlo copula-Bat algorithm for rainfall forecasting [J].
Ali, Mumtaz ;
Deo, Ravinesh C. ;
Downs, Nathan J. ;
Maraseni, Tek .
ATMOSPHERIC RESEARCH, 2018, 213 :450-464
[8]   Evaluation of the TEOM(R) method for measurement of ambient particulate mass in urban areas [J].
Allen, G ;
Sioutas, C ;
Koutrakis, P ;
Reiss, R ;
Lurmann, FW ;
Roberts, PT .
JOURNAL OF THE AIR & WASTE MANAGEMENT ASSOCIATION, 1997, 47 (06) :682-689
[9]   Generalisation for neural networks through data sampling and training procedures, with applications to streamflow predictions [J].
Anctil, F ;
Lauzon, N .
HYDROLOGY AND EARTH SYSTEM SCIENCES, 2004, 8 (05) :940-958
[10]  
[Anonymous], 2000, AUSTR RAINFORESTS IS, DOI [10.1017/CBO9780511583490, DOI 10.1017/CBO9780511583490]