Evaluating hourly air quality forecasting in Canada with nonlinear updatable machine learning methods

被引:50
作者
Peng, Huiping [1 ]
Lima, Aranildo R. [1 ]
Teakles, Andrew [2 ]
Jin, Jian [3 ]
Cannon, Alex J. [4 ]
Hsieh, William W. [1 ]
机构
[1] Univ British Columbia, Dept Earth Ocean & Atmospher Sci, 2207 Main Mall, Vancouver, BC V6T 1Z4, Canada
[2] Environm & Climate Change Canada, Meteorol Serv Canada, Queens Sq,45 Alderney Dr, Dartmouth, NS B2Y 2N6, Canada
[3] China Normal Univ, Dept Comp Sci & Technol, Shanghai 200241, Peoples R China
[4] Environm & Climate Change Canada, Div Climate Res, POB 1700 STN CSC, Victoria, BC V8W 2Y2, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Air quality; Forecast; Machine learning; Extreme learning machine; Artificial neural network; Ozone; PM2.5; NO2; STATISTICS UMOS SYSTEM; NEURAL-NETWORK; OZONE CONCENTRATIONS; PREDICTION; POLLUTION; MODELS; REGRESSION; SANTIAGO; ENSEMBLE; WEATHER;
D O I
10.1007/s11869-016-0414-3
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Air quality data (observational and numerical) were used to produce hourly spot concentration forecasts of ozone (O-3), particulate matter 2.5 mu m (PM2.5), and nitrogen dioxide (NO2), up to 48 h for six stations across Canada-Vancouver, Edmonton, Winnipeg, Toronto, Montreal, and Halifax. Using numerical data from an air quality model (GEM-MACH15) as predictors, forecast models for pollutant concentrations were built using multiple linear regression (MLR) and multi-layer perceptron neural networks (MLPNN). A relatively new method, the extreme learning machine (ELM), was also used to overcome the limitation of linear methods as well as the large computational demand of MLPNN. In operational forecasting, the continual arrival of new data necessitates frequent model updating. This type of learning (online sequential learning) is straightforward for MLR and ELM but not for MLPNN. Forecast performance of the online sequential MLR (OSMLR) and online sequential ELM (OSELM), together with stepwise MLR, all updated daily, were compared with MLPNN updated seasonally and the benchmark climatology model. OSELM, combining relatively inexpensive frequent model updating with nonlinear modeling capability, tended to outperform the other models in mean absolute error and correlation. Compared to the linear models, the nonlinear models (OSELM and MLPNN) often had worse bias (mean error) and more severe underprediction of extreme events.
引用
收藏
页码:195 / 211
页数:17
相关论文
共 50 条
[1]  
Alemayehu D., 2015, American Journal of Environmental Sciences, V11, P440, DOI [DOI 10.3844/AJESSP.2015.440.449, 10.3844/ajessp.2015.440.449]
[2]  
[Anonymous], 2009, MACHINE LEARNING MET, DOI [DOI 10.1017/CBO9780511627217, 10.1017/CBO9780511627217]
[3]  
Anselmo D., 2010, 12 C ATM CHEM AM MET, P6
[4]  
Antonopoulos S, 2012, NATO SCI PEACE SEC C
[5]   Ambient Air Pollution and Adverse Health Effects [J].
Autrup, Herman .
HARMONY OF CIVILIZATION AND PROSPERITY FOR ALL, 2010, 2 (05) :7333-7338
[6]  
Banerjee T, 2009, ENVIRON MONIT ASSESS, V172, P547
[7]   Bagging predictors [J].
Breiman, L .
MACHINE LEARNING, 1996, 24 (02) :123-140
[8]   Air Pollution and Human Health: From Local to Global Issues [J].
Brunekreef, Bert .
HARMONY OF CIVILIZATION AND PROSPERITY FOR ALL, 2010, 2 (05) :6661-6669
[9]  
Cannon A. J., 2012, MONMLP MONOTONE MULT
[10]   Forecasting summertime surface-level ozone concentrations in the Lower Fraser Valley of British Columbia: An ensemble neural network approach [J].
Cannon, AJ ;
Lord, ER .
JOURNAL OF THE AIR & WASTE MANAGEMENT ASSOCIATION, 2000, 50 (03) :322-339