A novel combined multi-variate prediction framework for air pollution based on feature selection and deep learning models

被引:0
作者
Bai, Lu [1 ]
Du, Pei [2 ]
Wang, Shubin [3 ]
Li, Hongmin [4 ]
Wang, Jianzhou [5 ]
机构
[1] Xiamen Univ Technol, Sch Math & Stat, Xiamen 361024, Fujian, Peoples R China
[2] Jiangnan Univ, Sch Business, Wuxi 214122, Jiangsu, Peoples R China
[3] Xian Univ Posts & Telecommun, Sch Econ & Management, Xian 710061, Shaanxi, Peoples R China
[4] Northeast Forestry Univ, Coll Econ & Management, Harbin 150000, Heilongjiang, Peoples R China
[5] Macau Univ Sci & Technol, Inst Syst Engn, Macau 999078, Peoples R China
关键词
PM; 2.5; concentration; Multi-variate prediction framework; Factor selection; Deep learning; PARTICLE SWARM OPTIMIZATION; POLLUTANTS; ALGORITHM;
D O I
10.1016/j.psep.2024.11.089
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Air pollution forecasting offers crucial support for pollution control and regulation. However, the non-linearity, volatility, and complexity of air pollution data pose significant challenges to accurate prediction. To address these challenges, this study develops a multi-variate combined forecasting framework based on feature selection methods, linear and nonlinear forecasting models, and intelligent optimization algorithms. Firstly, the framework analyzes the impact of each subset after feature selection on model performance. Secondly, multiple benchmark models based on different feature subsets and various prediction models are constructed as comparison models to further test the performance of the proposed prediction framework. Finally, two real-world datasets, statistical hypothesis testing, and several evaluation metrics are used to validate the prediction performance of the proposed framework. The Root Mean Square Error (RMSE) of the proposed model on the Site 1 dataset is 1.9762, representing an improvement of approximately 4 % compared to the best-performing benchmark model. Statistical testing further confirms that the proposed model significantly outperforms the benchmark model. The proposed combined multi-variate forecasting framework provides a heuristic framework for building combined prediction models with high prediction accuracy and robustness.
引用
收藏
页码:1157 / 1172
页数:16
相关论文
共 49 条
[1]  
[Anonymous], 2008, Advances in Neural Information Processing Systems
[2]   Recent advances and emerging challenges of feature selection in the context of big data [J].
Bolon-Canedo, V. ;
Sanchez-Marono, N. ;
Alonso-Betanzos, A. .
KNOWLEDGE-BASED SYSTEMS, 2015, 86 :33-45
[3]   Forecasting hourly PM2.5 concentrations based on decomposition-ensemble-reconstruction framework incorporating deep learning algorithms [J].
Cai P. ;
Zhang C. ;
Chai J. .
Data Science and Management, 2023, 6 (01) :46-54
[4]   A survey on feature selection methods [J].
Chandrashekar, Girish ;
Sahin, Ferat .
COMPUTERS & ELECTRICAL ENGINEERING, 2014, 40 (01) :16-28
[5]  
[陈华友 Chen Huayou], 2002, [中国科学技术大学学报, Journal of University of Science and Technology of China], V32, P172
[6]  
Cho KYHY, 2014, Arxiv, DOI [arXiv:1406.1078, 10.48550/ARXIV.1406.1078]
[7]   Attention enhanced hybrid model for spatiotemporal short-term forecasting of particulate matter concentrations [J].
Choudhury, Amartya ;
Middya, Asif Iqbal ;
Roy, Sarbani .
SUSTAINABLE CITIES AND SOCIETY, 2022, 86
[8]   Mean Absolute Percentage Error for regression models [J].
de Myttenaere, Arnaud ;
Golden, Boris ;
Le Grand, Benedicte ;
Rossi, Fabrice .
NEUROCOMPUTING, 2016, 192 :38-48
[9]   Comparing predictive accuracy (Reprinted) [J].
Diebold, FX ;
Mariano, RS .
JOURNAL OF BUSINESS & ECONOMIC STATISTICS, 2002, 20 (01) :134-144
[10]   An innovative interpretable combined learning model for wind speed forecasting [J].
Du, Pei ;
Yang, Dongchuan ;
Li, Yanzhao ;
Wang, Jianzhou .
APPLIED ENERGY, 2024, 358