Applying PCA to Deep Learning Forecasting Models for Predicting PM2.5

被引:29
作者
Choi, Sang Won [1 ]
Kim, Brian H. S. [1 ,2 ]
机构
[1] Seoul Natl Univ, Dept Agr Econ & Rural Dev, Gwanak Ro 1, Seoul 08826, South Korea
[2] Seoul Natl Univ, Res Inst Agr & Life Sci, Program Agr & Forest Meteorol, 1 Gwanangno, Seoul 08826, South Korea
基金
新加坡国家研究基金会;
关键词
principal components analysis (PCA); PM2.5; recurrent neural network RNN); long short-term memory (LSTM); bidirectional LSTM (BiLSTM); deep learning; SHORT-TERM-MEMORY; AIR-POLLUTION; LSTM; NETWORK; CITIES;
D O I
10.3390/su13073726
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Fine particulate matter (PM2.5) is one of the main air pollution problems that occur in major cities around the world. A country's PM2.5 can be affected not only by country factors but also by the neighboring country's air quality factors. Therefore, forecasting PM2.5 requires collecting data from outside the country as well as from within which is necessary for policies and plans. The data set of many variables with a relatively small number of observations can cause a dimensionality problem and limit the performance of the deep learning model. This study used daily data for five years in predicting PM2.5 concentrations in eight Korean cities through deep learning models. PM2.5 data of China were collected and used as input variables to solve the dimensionality problem using principal components analysis (PCA). The deep learning models used were a recurrent neural network (RNN), long short-term memory (LSTM), and bidirectional LSTM (BiLSTM). The performance of the models with and without PCA was compared using root-mean-square error (RMSE) and mean absolute error (MAE). As a result, the application of PCA in LSTM and BiLSTM, excluding the RNN, showed better performance: decreases of up to 16.6% and 33.3% in RMSE and MAE values. The results indicated that applying PCA in deep learning time series prediction can contribute to practical performance improvements, even with a small number of observations. It also provides a more accurate basis for the establishment of PM2.5 reduction policy in the country.
引用
收藏
页数:30
相关论文
共 32 条
[11]  
Hochreiter S, 1997, NEURAL COMPUT, V9, P1735, DOI [10.1162/neco.1997.9.8.1735, 10.1162/neco.1997.9.1.1, 10.1007/978-3-642-24797-2]
[12]   Forecasting stock markets using wavelet transforms and recurrent neural networks: An integrated system based on artificial bee colony algorithm [J].
Hsieh, Tsung-Jung ;
Hsiao, Hsiao-Fen ;
Yeh, Wei-Chang .
APPLIED SOFT COMPUTING, 2011, 11 (02) :2510-2525
[13]  
HUNTER JS, 1986, J QUAL TECHNOL, V18, P203
[14]  
Hwang I.C., 2018, BENEFITS MANAGEMENT, P1
[15]  
Joint Association of Related Korean Ministries of Korea, 2019, COMPR PLAN FIN DUST
[16]   Evaluation of Different Machine Learning Approaches to Forecasting PM2.5 Mass Concentrations [J].
Karimian, Hamed ;
Li, Qi ;
Wu, Chunlin ;
Qi, Yanlin ;
Mo, Yuqin ;
Chen, Gong ;
Zhang, Xianfeng ;
Sachdeva, Sonali .
AEROSOL AND AIR QUALITY RESEARCH, 2019, 19 (06) :1400-1410
[17]   Effects of short-term fine particulate matter exposure on acute respiratory infection in children [J].
Kim, Kyoung-Nam ;
Kim, Soontae ;
Lim, Youn-Hee ;
Song, In Gyu ;
Hong, Yun-Chul .
INTERNATIONAL JOURNAL OF HYGIENE AND ENVIRONMENTAL HEALTH, 2020, 229
[18]  
Kingma Diederik P., 2015, 3 INT C LEARNING REP
[19]   Distributed Deep Features Extraction Model for Air Quality Forecasting [J].
Mengara Mengara, Axel Gedeon ;
Kim, Younghak ;
Yoo, Younghwan ;
Ahn, Jaehun .
SUSTAINABILITY, 2020, 12 (19) :1-19
[20]  
Ministry of Environment, MIN ENV PRESS REL KO