Applying PCA to Deep Learning Forecasting Models for Predicting PM2.5

被引:29
作者
Choi, Sang Won [1 ]
Kim, Brian H. S. [1 ,2 ]
机构
[1] Seoul Natl Univ, Dept Agr Econ & Rural Dev, Gwanak Ro 1, Seoul 08826, South Korea
[2] Seoul Natl Univ, Res Inst Agr & Life Sci, Program Agr & Forest Meteorol, 1 Gwanangno, Seoul 08826, South Korea
基金
新加坡国家研究基金会;
关键词
principal components analysis (PCA); PM2.5; recurrent neural network RNN); long short-term memory (LSTM); bidirectional LSTM (BiLSTM); deep learning; SHORT-TERM-MEMORY; AIR-POLLUTION; LSTM; NETWORK; CITIES;
D O I
10.3390/su13073726
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Fine particulate matter (PM2.5) is one of the main air pollution problems that occur in major cities around the world. A country's PM2.5 can be affected not only by country factors but also by the neighboring country's air quality factors. Therefore, forecasting PM2.5 requires collecting data from outside the country as well as from within which is necessary for policies and plans. The data set of many variables with a relatively small number of observations can cause a dimensionality problem and limit the performance of the deep learning model. This study used daily data for five years in predicting PM2.5 concentrations in eight Korean cities through deep learning models. PM2.5 data of China were collected and used as input variables to solve the dimensionality problem using principal components analysis (PCA). The deep learning models used were a recurrent neural network (RNN), long short-term memory (LSTM), and bidirectional LSTM (BiLSTM). The performance of the models with and without PCA was compared using root-mean-square error (RMSE) and mean absolute error (MAE). As a result, the application of PCA in LSTM and BiLSTM, excluding the RNN, showed better performance: decreases of up to 16.6% and 33.3% in RMSE and MAE values. The results indicated that applying PCA in deep learning time series prediction can contribute to practical performance improvements, even with a small number of observations. It also provides a more accurate basis for the establishment of PM2.5 reduction policy in the country.
引用
收藏
页数:30
相关论文
共 32 条
[1]  
[Anonymous], ARXIV161002583
[2]   Does lockdown reduce air pollution? Evidence from 44 cities in northern China [J].
Bao, Rui ;
Zhang, Acheng .
SCIENCE OF THE TOTAL ENVIRONMENT, 2020, 731
[3]   Mapping of background air pollution at a fine spatial scale across the European Union [J].
Beelen, Rob ;
Hoek, Gerard ;
Pebesma, Edzer ;
Vienneau, Danielle ;
de Hoogh, Kees ;
Briggs, David J. .
SCIENCE OF THE TOTAL ENVIRONMENT, 2009, 407 (06) :1852-1867
[4]  
César Ana Cristina Gobbo, 2016, Rev. paul. pediatr., V34, P18, DOI [10.1016/j.rpped.2015.06.009, 10.1016/j.rppede.2015.12.005]
[5]  
Choe J.-I., 2015, J. R'nviron. Policy Adm., V23, P155, DOI [10.15301/jepa.2015.23.4.155, DOI 10.15301/JEPA.2015.23.4.155]
[6]   Recurrent neural networks for music computation [J].
Franklin, Judy A. .
INFORMS JOURNAL ON COMPUTING, 2006, 18 (03) :321-338
[7]  
Goldberg Yoav., 2017, Synthesis Lectures on Human Language Technologies, DOI [10.2200/S00762ED1V01Y201703HLT037, DOI 10.1007/978-3-031-02165-7]
[8]  
Gong S., 2012, STUDY HLTH IMPACT MA, P1
[9]   Non-linear system modeling using LSTM neural networks [J].
Gonzalez, Jesus ;
Yu, Wen .
IFAC PAPERSONLINE, 2018, 51 (13) :485-489
[10]   Spatial and Temporal Trends of Number of Deaths Attributable to Ambient PM2.5 in the Korea [J].
Han, Changwoo ;
Kim, Soontae ;
Lim, Youn-Hee ;
Bae, Hyun-Joo ;
Hong, Yun-Chul .
JOURNAL OF KOREAN MEDICAL SCIENCE, 2018, 33 (30)