An LSTM-based aggregated model for air pollution forecasting

被引:206
作者
Chang, Yue-Shan [1 ]
Chiao, Hsin-Ta [2 ]
Abimannan, Satheesh [3 ]
Huang, Yo-Ping [4 ]
Tsai, Yi-Ting [1 ]
Lin, Kuan-Ming [1 ]
机构
[1] Natl Taipei Univ, New Taipei 237, Taiwan
[2] Tunghai Univ, Taiwan Blvd, Taichung 407, Taiwan
[3] Galgotias Univ, Greater Noida, Uttar Pradesh, India
[4] Natl Taipei Univ Technol, Taipei, Taiwan
关键词
PM2.5; Air pollution; ALSTM; LSTM; ANN; SHORT-TERM-MEMORY; NEURAL-NETWORK; QUALITY PREDICTION; PM2.5; CONCENTRATIONS; INTERPOLATION; DECOMPOSITION; EXPOSURE; PM10;
D O I
10.1016/j.apr.2020.05.015
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
During the past few years, severe air-pollution problem has garnered worldwide attention due to its effect on health and wellbeing of individuals. As a result, the analysis and prediction of air pollution has attracted a good deal of interest among researchers. The research areas include traditional machine learning, neural networks and deep learning. How to effectively and accurately predict air pollution becomes an important issue. In this paper, we propose an Aggregated LSTM (Long Short-Term Memory) model (ALSTM) based on the LSTM deep learning method. In this new model, we combine local air quality monitoring station, the station in nearby industrial areas, and the stations for external pollution sources. To improve prediction accuracy, we aggregate three LSTM models into a predictive model for early predictions based on external sources of pollution and information from nearby industrial air quality stations. We exploited the data with 17 attributes collected by Taiwan Environmental Protection Agency from 2012 to 2017 as the training data to build the ALSTM forecasting model, and we tested the model using the data collected in 2018. We conducted some experiments to compare our new ALSTM model with SVR (Support Vector Machine based Regression), GBTR (Gradient Boosted Tree Regression), LSTM, etc., in the prediction of PM2.5 for 1-8 h, and evaluated them using various assessment techniques, such as MAE, RMSE, and MAPE. The results reveal that the proposed aggregated model can effectively improve the accuracy of prediction.
引用
收藏
页码:1451 / 1463
页数:13
相关论文
共 50 条
[2]  
Athira V., 2018, Procedia Computer Science, V132, P1394, DOI 10.1016/j.procs.2018.05.068
[3]  
Barai SV, 2007, ADV SOFT COMP, V39, P290
[4]  
Baralis E, 2016, 2016 39TH INTERNATIONAL CONVENTION ON INFORMATION AND COMMUNICATION TECHNOLOGY, ELECTRONICS AND MICROELECTRONICS (MIPRO), P1464, DOI 10.1109/MIPRO.2016.7522370
[5]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[6]   Analysis of air pollution data at a mixed source location using boosted regression trees [J].
Carslaw, David C. ;
Taylor, Paul J. .
ATMOSPHERIC ENVIRONMENT, 2009, 43 (22-23) :3563-3570
[7]   China tackles the health effects of air pollution [J].
Chen, Zhu ;
Wang, Jin-Nan ;
Ma, Guo-Xia ;
Zhang, Yan-Shen .
LANCET, 2013, 382 (9909) :1959-1960
[8]  
Cho K, 2019, J KOREAN SOC ATMOS E, V35, P214
[9]   Air quality prediction in Milan: feed-forward neural networks, pruned neural networks and lazy learning [J].
Corani, G .
ECOLOGICAL MODELLING, 2005, 185 (2-4) :513-529
[10]   A Novel Method for Improving Air Pollution Prediction Based on Machine Learning Approaches: A Case Study Applied to the Capital City of Tehran [J].
Delavar, Mahmoud Reza ;
Gholami, Amin ;
Shiran, Gholam Reza ;
Rashidi, Yousef ;
Nakhaeizadeh, Gholam Reza ;
Fedra, Kurt ;
Afshar, Smaeil Hatefi .
ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2019, 8 (02)