A hybrid model for forecasting of particulate matter concentrations based on multiscale characterization and machine learning techniques

被引:14
作者
Shah, Syed Ahsin Ali [1 ]
Aziz, Wajid [1 ,2 ]
Almaraashi, Majid [2 ]
Nadeem, Malik Sajjad Ahmed [1 ]
Habib, Nazneen [3 ]
Shim, Seong-O [2 ]
机构
[1] Univ Azad Jammu & Kashmir, Dept Comp Sci & IT, King Abdullah Campus, Muzaffarabad 13100, Ajk, Pakistan
[2] Univ Jeddah, Coll Comp Sci & Engn, Jeddah 23890, Saudi Arabia
[3] Univ Azad Jammu Kashmir, Dept Sociol & Rural Dev, Muzaffarabad 13100, Ajk, Pakistan
关键词
empirical mode decomposition; forecasting; hybrid forecasting model; machine learning algorithms; particulate matter; DECOMPOSITION-ENSEMBLE MODEL; AIR-POLLUTION; TIME-SERIES; HEALTH;
D O I
10.3934/mbe.2021104
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Accurate prediction of particulate matter (PM) using time series data is a challenging task. The recent advancements in sensor technology, computing devices, nonlinear computational tools, and machine learning (ML) approaches provide new opportunities for robust prediction of PM concentrations. In this study, we develop a hybrid model for forecasting PM10 and PM2.5 based on the multiscale characterization and ML techniques. At first, we use the empirical mode decomposition (EMD) algorithm for multiscale characterization of PM10 and PM2.5 by decomposing the original time series into numerous intrinsic mode functions (IMFs). Different individual ML algorithms such as random forest (RF), support vector regressor (SVR), k-nearest neighbors (kNN), feed forward neural network (FFNN), and AdaBoost are then used to develop EMD-ML models. The air quality time series data from Masfalah air station Makkah, Saudi Arabia are utilized for validating the EMD-ML models, and results are compared with non-hybrid ML models. The PMs (PM10 and PM2.5) concentrations data of Dehli, India are also utilized for validating the EMD-ML models. The performance of each model is evaluated using root mean square error (RMSE) and mean absolute error (MAE). The average bias in the predictive model is estimated using mean bias error (MBE). Obtained results reveal that EMD-FFNN model provides the lowest error rate for both PM10 (RMSE = 12.25 and MAE = 7.43) and PM2.5 (RMSE = 4.81 and MAE = 3.02) using Misfalah, Makkah data whereas EMD-kNN model provides the lowest error rate for PM10 (RMSE = 20.56 and MAE = 12.87) and EMD-AdaBoost provides the lowest error rate for PM2.5 (RMSE = 15.29 and MAE = 9.45) using Dehli, India data. The findings also reveal that EMD-ML models can be effectively used in forecasting PM mass concentrations and to develop rapid air quality warning systems.
引用
收藏
页码:1992 / 2009
页数:18
相关论文
共 41 条
[1]  
[Anonymous], 1998, The new psychometrics: Science, psychology, and measurement
[2]  
BAILEY T, 1978, IEEE T SYST MAN CYB, V8, P311
[3]  
Breiman L., 2001, Mach. Learn., V45, P5
[4]   Air quality forecasting using a hybrid autoregressive and nonlinear model [J].
Chelani, AB ;
Devotta, S .
ATMOSPHERIC ENVIRONMENT, 2006, 40 (10) :1774-1780
[5]  
Chen Bingheng, 2008, Environmental Health and Preventive Medicine, V13, P94, DOI 10.1007/s12199-007-0018-5
[6]  
Chen D., 2019, PLOS ONE, V14
[7]   Epileptic seizure classifications using empirical mode decomposition and its derivative [J].
Cura, Ozlem Karabiber ;
Atli, Sibel Kocaaslan ;
Ture, Hatice Sabiha ;
Akan, Aydin .
BIOMEDICAL ENGINEERING ONLINE, 2020, 19 (01)
[8]  
Drucker H, 1997, ADV NEUR IN, V9, P155
[9]  
Fix E., 1952, USAF SCH AVIATION ME, P21
[10]  
Freund Y., 1999, Journal of Japanese Society for Artificial Intelligence, V14, P771