An Improved Air Quality Index Machine Learning-Based Forecasting with Multivariate Data Imputation Approach

被引:29
作者
Alkabbani, Hanin [1 ]
Ramadan, Ashraf [2 ]
Zhu, Qinqin [1 ]
Elkamel, Ali [1 ]
机构
[1] Univ Waterloo, Dept Chem Engn, 200 Univ Ave West, Waterloo, ON N2L 3G1, Canada
[2] Kuwait Inst Sci Res, Environm & Life Sci Res Ctr, Environm Pollut & Climate Program, POB 24885, Safat 13109, Kuwait
基金
加拿大自然科学与工程研究理事会;
关键词
ambient air quality observations; AQI; artificial neural network; machine learning; missForest imputation; forecasting; ARTIFICIAL NEURAL-NETWORKS; HYBRID ARIMA; PREDICTION; FINE; POLLUTION; MODEL; PARTICLES; MORTALITY; ENERGY; SAND;
D O I
10.3390/atmos13071144
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Accurate, timely air quality index (AQI) forecasting helps industries in selecting the most suitable air pollution control measures and the public in reducing harmful exposure to pollution. This article proposes a comprehensive method to forecast AQIs. Initially, the work focused on predicting hourly ambient concentrations of PM2.5 and PM10 using artificial neural networks. Once the method was developed, the work was extended to the prediction of other criteria pollutants, i.e., O-3,O- SO2, NO2, and CO, which fed into the process of estimating AQI. The prediction of the AQI not only requires the selection of a robust forecasting model, it also heavily relies on a sequence of pre-processing steps to select predictors and handle different issues in data, including gaps. The presented method dealt with this by imputing missing entries using missForest, a machine learning-based imputation technique which employed the random forest (RF) algorithm. Unlike the usual practice of using RF at the final forecasting stage, we utilized RF at the data pre-processing stage, i.e., missing data imputation and feature selection, and we obtained promising results. The effectiveness of this imputation method was examined against a linear imputation method for the six criteria pollutants and the AQI. The proposed approach was validated against ambient air quality observations for Al-Jahra, a major city in Kuwait. Results obtained showed that models trained using missForest-imputed data could generalize AQI forecasting and with a prediction accuracy of 92.41% when tested on new unseen data, which is better than earlier findings.
引用
收藏
页数:26
相关论文
共 50 条
[1]   Textural variations within different representative types of dune sediments in Kuwait [J].
Al-Dousari, A. M. ;
Al-Enezi, A. K. ;
Al-Awadhi, J. .
ARABIAN JOURNAL OF GEOSCIENCES, 2008, 1 (01) :17-31
[2]   Cost and effect of native vegetation change on aeolian sand, dust, microclimate and sustainable energy in Kuwait [J].
Al-Dousari, Ali ;
Ramadan, Ashraf ;
Al-Qattan, Ayman ;
Al-Ateeqi, Sara ;
Dashti, Hassan ;
Ahmed, Modi ;
Al-Dousari, Noor ;
Al-Hashash, Noof ;
Othman, Ahmed .
JOURNAL OF TAIBAH UNIVERSITY FOR SCIENCE, 2020, 14 (01) :628-639
[3]   Types, Indications and Impact Evaluation of Sand and Dust Storms Trajectories in the Arabian Gulf [J].
Al-Dousari, Ali ;
Doronzo, Domenico ;
Ahmed, Modi .
SUSTAINABILITY, 2017, 9 (09)
[4]  
Al-Kulaib A, 1992, WEATHER CLIMATE KUWA
[5]   Source apportionment of fine particles in Kuwait City [J].
Alolayan, Mohammad A. ;
Brown, Kathleen W. ;
Evans, John S. ;
Bouhamra, Walid S. ;
Koutralds, Petros .
SCIENCE OF THE TOTAL ENVIRONMENT, 2013, 448 :14-25
[6]  
[Anonymous], 1999, EN 1015-3
[7]  
[Anonymous], 2020, NAT AIR QUAL STRAT
[8]  
[Anonymous], 2013, Technical Assistance to Mongolia for the Ulaanbaatar Urban Planning Improvement, P1
[9]   Predicting hourly air pollutant levels using artificial neural networks coupled with uncertainty analysis by Monte Carlo simulations [J].
Arhami, Mohammad ;
Kamali, Nima ;
Rajabi, Mohammad Mahdi .
ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH, 2013, 20 (07) :4777-4789
[10]  
Arora H., 2020, J XIAN U ARCHIT TECH, VXII, P3052