An ensemble learning based hybrid model and framework for air pollution forecasting

被引:0
作者
Yue-Shan Chang
Satheesh Abimannan
Hsin-Ta Chiao
Chi-Yeh Lin
Yo-Ping Huang
机构
[1] National Taipei University,Department of Computer Science and Information Engineering
[2] Galgotias University,undefined
[3] Tunghai University,undefined
[4] National Taipei University of Technology,undefined
来源
Environmental Science and Pollution Research | 2020年 / 27卷
关键词
Air pollution forecasting; Ensemble learning; LSTM; Pearson correlation coefficient; PM2.5; SVR; GBTR;
D O I
暂无
中图分类号
学科分类号
摘要
As advance of economy and industry, the impact of air pollution has gradually gained attention. In order to predict air quality, there were many studies that exploited various machine learning techniques to build predictive model for pollutant concentration or air quality prediction. However, enhancing the prediction performance always is the common problem of existing studies. Traditional templates based on machine learning and deep learning methods, such as GBTR (gradient boosted tree regression), SVR (support vector machine-based regression), and LSTM (long short-term memory), are most promising approaches to address these problems. Some previous researches showed that ensemble learning technology can improve predictive performance of other domains. In order to improve the accuracy of forecasting, in this paper, we propose a hybrid model and framework to improve the forecasting accuracy of air pollution. We not only exploit stacking-based ensemble learning scheme with Pearson correlation coefficient to calculate the correlation between different machine learning models to integrate various forecasting models together, but also construct a framework based on Spark+Hadoop machine learning and TensorFlow deep learning framework to physically integrate these models to demonstrate the next 1 to 8 h’ air pollution forecasting. We also conduct experiments and compare the result with GBTR, SVR, LSTM, and LSTM2 (version 2) models to demonstrate the proposed hybrid model’s predictive performance. The experimental results show that the hybrid model is superior to the existing models used for predicting air pollution.
引用
收藏
页码:38155 / 38168
页数:13
相关论文
共 110 条
[1]  
Akima H(1970)A new method of interpolation and smooth curve fitting based on local procedures J ACM 17 589-602
[2]  
Bai L(2018)Air pollution forecasts: an overview Int J Environ Res Public Health 15 780-36
[3]  
Wang J(2016)Ensemble based hybrid machine learning approach for sentiment classification-a review Int J Comput Appl 146 31-140
[4]  
Ma X(1996)Bagging predictors Mach Learn 24 123-1463
[5]  
Lu H(2020)An LSTM-based aggregated model for air pollution forecasting Atmos Pollut Res 11 1451-225
[6]  
Behera RN(2019)Air quality prediction using a deep neural network model J Korean Soc Atmos Environ 35 214-529
[7]  
Roy MD(2005)Air quality prediction in Milan: feed-forward neural networks, pruned neural networks and lazy learning Ecol Model 185 513-109
[8]  
Breiman L(2019)Novel method for improving air pollution prediction based on machine learning approaches: a case study applied to the capital city of Tehran Int J Geo-Inf 8 89-820
[9]  
Chang Y-S(2019)The MR-CA models for analysis of pollution sources and prediction of PM2.5 IEEE Trans Syst Man Cybernet Syst 49 814-116
[10]  
Chiao H-T(2014)Complex time series analysis of PM10 and PM2.5 for a coastal site using artificial neural network modelling and k-means clustering Atmos Environ 94 106-885