Combining machine learning models through multiple data division methods for PM2.5 forecasting in Northern Xinjiang, China

被引:0
作者
Miaomiao Ren
Wei Sun
Shu Chen
机构
[1] Sun Yat-Sen University,School of Geography and Planning
[2] Xinjiang University,School of Resources and Environmental Science
[3] Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai),undefined
来源
Environmental Monitoring and Assessment | 2021年 / 193卷
关键词
Artificial neural network; Air quality forecasting; Cross-validation; Combining model; PM; concentration;
D O I
暂无
中图分类号
学科分类号
摘要
In this study, daily average PM2.5 forecasting models were developed and applied in the Northern Xinjiang, China, through combining the back propagation artificial neural network (BPANN) and multiple linear regression (MLR) with another BPANN model. The meteorological (daily average precipitation, pressure, relative humidity, temperature, and wind speed, daily maximum wind speed and sunshine hours on the same day) and air pollutant data (daily PM2.5, PM10, SO2, CO, NO2, and O3 concentrations on the previous day) in January and August of each year from 2015 to 2019 were used as candidate inputs. The optimal member and combining models were evaluated through the leave-one-out cross-validation (LOOCV), fivefold cross-validation, and hold-out methods. Twelve member models with optimal or sub-optimal performance were further used to develop the combining models. The performances of the BPANN and MLR member models were different using three data division methods. The models were evaluated more comprehensively through the LOOCV. The performances of the combining models were generally better than the member models. For both member and combining models, the PM2.5 forecasting model performance in August was generally better than in January. The correlation coefficient (R) for the validation set of the optimal combination model was about 0.87 in January and 0.946 in August. These results showed that combining linear and nonlinear models through multiple data division methods would be an effective tool to forecast PM2.5 concentrations.
引用
收藏
相关论文
共 48 条
[1]  
Chelani AB(2006)Air quality forecasting using a hybrid autoregressive and nonlinear model Atmospheric Environment 40 1774-1780
[2]  
Devotta S(2008)A hybrid ARIMA and artificial neural networks model to forecast particulate matter in urban areas: The case of Temuco Chile. Atmospheric Environment 42 8331-8340
[3]  
Diaz-Robles LA(2015)Artificial neural networks forecasting of PM2.5 pollution using air mass trajectory based geographic model and wavelet transformation Atmospheric Environment 107 118-128
[4]  
Ortega JC(1974)A predictive approach to the random effect model Biometrika 61 101-107
[5]  
Fu JS(2018)Space-time PM2.5 mapping in the severe haze region of Jing-Jin-Ji (China) using a synthetic approach [Article] Environmental Pollution 240 319-329
[6]  
Reed GD(2009)Forecasting peak air pollution levels using NARX models Engineering Applications of Artificial Intelligence 22 593-602
[7]  
Chow JC(2019)A novel spatiotemporal convolutional long short-term neural network for air pollution prediction Science of the Total Environment 654 1091-1099
[8]  
Watson JG(2018)Exposure to ambient particulate matter is associated with accelerated functional decline in idiopathic pulmonary fibrosis [Article] Chest 153 1221-1228
[9]  
Moncada-Herrera JA(2018)Development of a stacked ensemble model for forecasting and analyzing daily average PM2.5 concentrations in Beijing, China [Article] Science of the Total Environment 635 644-658
[10]  
Feng X(2014)A hybrid model for PM2.5 forecasting based on ensemble empirical mode decomposition and a general regression neural network [Article] Science of the Total Environment 496 264-274