Selection of key features for PM2.5 prediction using a wavelet model and RBF-LSTM

被引:44
作者
Chen, Yi-Chung [1 ]
Li, Dong-Chi [1 ]
机构
[1] Natl Yunlin Univ Sci & Technol, Dept Ind Engn & Management, Touliu, Yunlin, Taiwan
关键词
PM2; 5; prediction; Feature selection; Deep learning model; Wavelet analysis; NEURAL-NETWORKS; AIR-POLLUTION; MORTALITY; SENSITIVITY;
D O I
10.1007/s10489-020-02031-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
PM2.5 prediction has received much attention from researchers in recent years, as PM2.5 has been proven to have a major impact on human health. High-precision PM2.5 predictions would greatly benefit both the general public and governments. However, most of the existing studies on predicting PM2.5 or other air pollutants focus on model design and rarely discuss feature selection. Conventionally, researchers either simply choose a few types of meteorological data based on suggestions from experts or conduct simple correlation analyses to identify the types of meteorological data that are most correlated with the predicted pollutant. However, these methods suffer from two shortcomings. (1) Changes in PM2.5 values are influenced by low-frequency pollution from other places and high-frequency pollution from the local area. Furthermore, the meteorological data associated with the two pollution sources are generally different. In this case, changes with different frequencies in each type of meteorological data should be considered separately to precisely identify the correlated meteorological data of pollutants with different frequencies. (2) Datasets used for PM2.5 predictions generally contain high-dimensional meteorological data. Conventional correlation analysis methods are not effective at identifying features in such datasets, making it difficult to select useful features and negatively affecting prediction accuracy. This study therefore proposes two concepts to address this issue: (1) a wavelet model to decompose each type of meteorological data into multiple sub-time series with different frequencies and (2) the design of a novel radial basis function long short-term memory model that can analyze the outputs of the radial basis function to extract key features identified by deep learning models. This approach is faster and simpler than other methods using deep learning models to extract key features. Application of these concepts will greatly enhance prediction accuracy and reduce costs regardless of the prediction model used. We use three years of historical meteorological data from Central Taiwan to demonstrate the effectiveness of the proposed methods.
引用
收藏
页码:2534 / 2555
页数:22
相关论文
共 68 条
[1]   An optimized model using LSTM network for demand forecasting [J].
Abbasimehr, Hossein ;
Shabani, Mostafa ;
Yousefi, Mohsen .
COMPUTERS & INDUSTRIAL ENGINEERING, 2020, 143
[2]   Modelling the effects of meteorological variables on ozone concentration - a quantile regression approach [J].
Baur, D ;
Saisana, M ;
Schulze, N .
ATMOSPHERIC ENVIRONMENT, 2004, 38 (28) :4689-4699
[3]   Recursive neural network model for analysis and forecast of PM10 and PM2.5 [J].
Biancofiore, Fabio ;
Busilacchio, Marcella ;
Verdecchia, Marco ;
Tomassetti, Barbara ;
Aruffo, Eleonora ;
Bianco, Sebastiano ;
Di Tommaso, Sinibaldo ;
Colangeli, Carlo ;
Rosatelli, Gianluigi ;
Di Carlo, Piero .
ATMOSPHERIC POLLUTION RESEARCH, 2017, 8 (04) :652-659
[4]   A wavelet analysis of mean and volatility spillovers between oil and BRICS stock markets [J].
Boubaker, Heni ;
Raza, Syed Ali .
ENERGY ECONOMICS, 2017, 64 :105-117
[5]   Prediction of hourly air pollutant concentrations near urban arterials using artificial neural network approach [J].
Cai, Ming ;
Yin, Yafeng ;
Xie, Min .
TRANSPORTATION RESEARCH PART D-TRANSPORT AND ENVIRONMENT, 2009, 14 (01) :32-41
[6]   An LSTM-based aggregated model for air pollution forecasting [J].
Chang, Yue-Shan ;
Chiao, Hsin-Ta ;
Abimannan, Satheesh ;
Huang, Yo-Ping ;
Tsai, Yi-Ting ;
Lin, Kuan-Ming .
ATMOSPHERIC POLLUTION RESEARCH, 2020, 11 (08) :1451-1463
[7]   A Two-Layer Nonlinear Combination Method for Short-Term Wind Speed Prediction Based on ELM, ENN, and LSTM [J].
Chen, Min-Rong ;
Zeng, Guo-Qiang ;
Lu, Kang-Di ;
Weng, Jian .
IEEE INTERNET OF THINGS JOURNAL, 2019, 6 (04) :6997-7010
[8]   Voltages prediction algorithm based on LSTM recurrent neural network [J].
Chen, Ying .
OPTIK, 2020, 220
[9]   Feature selection for time series prediction - A combined filter and wrapper approach for neural networks [J].
Crone, Sven F. ;
Kourentzes, Nikolaos .
NEUROCOMPUTING, 2010, 73 (10-12) :1923-1936
[10]  
Cybenko G., 1989, Mathematics of Control, Signals, and Systems, V2, P303, DOI 10.1007/BF02551274